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DESCPIFTIOM 

STE2 0- RELATED PROTEIN KINASES 

5 RELATED APPLICATIONS 

The iDresent application claims priority zo U.S. 
Provisional Patenr Application Serial No. 60/05 1,784 by 
Plowman and Martinez, entitled STE2 0 -Related Protein 
kinases, filed April 14, 1998 (Lyon Sc Lyon Docket No. 
10 232/279) , hereby incorporated by reference herein in its 

entirety, including any drawings, tables, or figures. 

FIELD OF THE I>JVENTION 
The oresent invention relates to novel kinase 
15 polypeptides, nucleotide sequences encoding the novel kinase 

polypeptides, as well as various products and methods useful 
for the diagnosis and treatment of various kinase-related 
diseases and conditions. 

20 BACKGROUND OF THE INVENTION 

The following description of the background of the 
invention is orovided to aid in understanding the invention, 
but is not admitted to be or to describe prior art to the 
invention. 

25 Cellular signal transduction is a fundamental mechanism 

whereby external stimuli that regulate diverse cellular 
processes are relayed to the interior of cells. One of the 
key biochemical mechanisms of signal transduction involves 
the reversible ohosphorylat ion of proteins, v;hicr: enables 

30 regulation of the activity of mature proteins by altering 

their structure and function. 

The best characterized protein kinases m eukaryotes 
phosphoryiate proteins on the hydroxyl moiety of serine, 
threonine and tvrosme residues. These kinases largely fall 
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into cwc groups, those specific for phosphoryiacing serines 
and chrsonines, and chose specific for phosphorylat ing 
tyrosines. Some kinases, referred co as "dual specificity" 
kinases, are able to phosphorylace on Tyrosine as well as 
5 serine/ threonine residues. 

Prccein kinases can also be characceri zed by cheir 
location v;ithin che cell. Some kinases are transmembrane 
recepcor- rype proteins capable of directly altering their 
catalytic activity in response to the external environment 

10 such as the binding of a ligand. Others are non-receptor- 

tvpe oroteins lacking any transmembrane domain. "^hey can be 
found in a variety of cellular compartments from, che inner 
surface of . the cell memb-rane to the nucleus. 

r4an\' kinases are involved in regulatory cascades 

15 v/herein their substrates may include other kinases whose 

activities are regulated by their phosphorylation state. 
Ultimately the activity of some downstream, effector is 
modulated by phosphorylation resulting fromi activation of 
such a pat hw a y . 

20 Protein kinases are one of the largest fam.ilies of 

eukaryotic proteins with several hundred known members. 
These oroteins share a 250-300 am.ino acid domiain that can be 
subdivided into 12 distinct subdom.ains that comprise the 
common catalytic core' structure. These conserved protein 

25 motifs have recently been exploited using PCR-based cloning 

strategies leading to a significant expansion of the known 
kinases . 

Multiple alignment of the sequences in the catalytic 
dom:ain of protein kinases and subsequent parsimony analysis 
30 permits the segregaticn of related kinases into distinct 

branches or subfam.ilies including: tyrosine kinases, 
c vc lie- nuc leotide - depe nde n t ki na s e s , ca 1 c i umt/ c a 1 modul i n 
kinases, cyclin-dependent kinases and MA?-kinases, serine- 
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chreonine kinase receptors, and several ocher less cierinec. 

subf arrdlies . 

SUMMARY Q? THB INVEN'TIOK 
5 Through che use' of a targeted ?CR cloning scracegy and 

of a "motif extraction" bioinf orraat ics script, mamnialian 
members of the STE2 0- kinase family have been identified as 
part of the present invention. Multiple alignment and 
■ parsimony analysis of the catalytic domain of all of these 
10 STE20- family members reveals that these proteins cluster 

into 5 distinct subgroups. Classification in this manner 
has proven highly accurate not only in predicting motifs 
present i-n the rem.aining non- catalyt ic portion or eacn 
protein, but also in their regulation, substrates, and 
15 signaling pathways. The present invention includes the 

partial or com.plete sequence of nev; members of the STE2Q-__ 
family, their classification, predicted or deduced protein 
structure, and a strategy for elucidating their oiologic and 
therapeutic relevance. ." . " 

20 Thus, a first aspect of the invention features an 

isolated, enriched, or purified nucleic acid molecule .. 
encoding a kinase polypeptide selected from tne group 
consisting of STLK2 , STLK3 , 3TLK4 , STLK5 , STLK6 , STLK7 , ZCi , 
ZC2, ZC3, ZC4, KHS2, SULUi , SULU3 , GEK2 , PAK4 , and PAK5 . 
25 Bv "isolated" in reference to nucleic acid is .meant a 

polym.er of nucleotides coniugated to each other, including 
DMA and RMA, that is isolated from a natural source or that 
is synthesized. The isolated nucleic acid of the present 
invention is unique m the sense that it is not found in a 
30 pure or separated state in nature. Use of the term 

J II ^ - = Mi'-a"! v 0''"-"ur r i no seauence has 

iscj.ated inQi'.-c;ue=> •_naL. n.a.'_uj.ci. — j ^ 

been removed rrom its normal ce^-iuic^r '..i.e., , 

environm.ent . Thus, the sequence m.ay be in a ceii-free 
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solution or plao^d m a different cellular environmenc . The 
cernrt does not irriciy thac the sequence is rhe only nuc^eozice 
chain present, bu" nhat in is essentially free (about 90 - 
95% pure at lease) of non-nucleocide material nacurally 
5 associaced with it, and thus is distinguished from isolaced 

chromosomes . 

By the use of the uerm "enriched" in reference zo 
nucleic acid is m.eanc chat the specific DNA or RNA sequence 
conscitu-es a signi f icane ly higher fraction (2-5 fold' of 

10 the tccal DNA or RNA present in the ceils or solution of 

interest than in normal or diseased cells or in the :;ells 
from v/hich the seauence was taken. This could be caused by 
a. person by preferential reduction in. the am.ount of other 
DNA or RNA present, or by a preferential increase in the 

15 am.ounz of the specific DNA or RNA sequence, or by a 

combination of the tv/o . However, it should be noted that 
enriched does not imply that there are no other DNA or RNA 
sequer.c^s present, just that the relative amtount c-f zhe 
sequence of incerest has been significantly increased. The 

20 term, "significant" is used to indicate that the level of 

increase is useful to the person making such a.n increase, 
and generally means an increase relative co other nucleic 
acids of about at least 2 fold, m.ore preferably at least 5 
to 10 fold or even more. The term, also does not im.ply that 

25 there is no DNA or RNA from other sources. The other source 

DNA. rciay , for example, comprise DNA from a yeast or bacterial 
genome, or a cloning vector such as pUC19. This termi 
distinguishes from naturally occurring events, such as viral 
infection, or tumor type growths, in v/hich the level of one 

30 mRNA may be naturally increased relative to other species of 

miRNA . That is, the termt is mteant to cover only those 
situatiions in which a person has intervened to elevace the 
proportion of the desired nucleic acid. 

4 
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Iz is alsc advantageous for sorae purposes chat ?, 
nucleocicie sequence be in purified form. The cerm 
"curified" in reference to nucleic acid does not require 
absolute purity (such as a homogeneous preparation) . 
Instead, it represents an indication that the sequence is 
relativeiv more pure than in the natural environment 
{compared to the natural level this level should be at least 
2-5 fold greater, e.g., m terms of mg/mL) . Individual 
clones isolated from a cDNA library may be purified to 
electrophoretic homogeneity. The claimed DNA molecules 
obtained from these clones could be obtained directly from 
total DNA or from total RNA. The cDNA clones are not 
naturally.. occurring, but rather are preferably obtained via 
manipulation of a parcially purified naturally occurring 
substance (messenger RNA) . The constructicn of a cDNA 
library from mRNA involves the creation of a synthetic 
substance (cDNA) and pure individual cDNA clones can be 
isolated from the synthetic library by clonal selection of 
the cells carrying 'the cDNA 1 ibrary .. . . Thus , the process 
which includes the construction of a cDNA library from m.RNA 
and isolation of distinct cDNA clones yields an 
approximately lo'-fold purification of nhe native m.essage. 
Thus, purification of at least one order of magnitude, 
preferably two or three orders, .and more preferably four or 
five orders of magnitude is expressly contemplated. 

By a "kinase polypeptide" is meant 3 2 (preferably 40, 
more preferably 45, most preferably 55) or more contiguous 
amino acids set forth in the amino acid sequence of SEQ ID 
NO: 5, SEQ ID NO : 6 , or SEQ ID NO : 7 , or the corresponding 
full-length ammo acid sequence; 250 (pre ferabi y 2 55 , more 
preferably 260, most preferably 270} or more contiguous 
ammo acids set forth in the ammo acid sequence SEQ ID 
NO: 13, SEQ ID NO : 14 , SEQ ID NC : 1 5 , or SEQ ID NO: 105, or the 

5 
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corresponding full-iengch amino acid sequence; 27 
(preferably 30, more preferably 40 ^ most preferably 45 zr 
more contiguous amino acids sec forch in che amino acid 
sequence SEQ ID NO :1c; 16 (prererably 20, more preferably 
25; most preferably 35) or more contiguous am.ino acids sen 
forch in che amino acid sequence SEQ ID NO: 22, SEQ ID NO: 23, 
SEQ ID NO: 24, SEQ ID NO : 2 9 , SEQ ID NO: 31, or SEQ ID NO: 103 
or che corresponding full-length amino acid sequence; 6 
(preferably 10, more preferably 15, m.ost preferably 25) or 
m.ore contiguous amino acids set forth in the am-ino acid 
sequence of SEQ ID NO: 97 or SEQ ID NO: 99, 22 (preferably 30, 
more preferably 35, most preferably 45) or more contiguous 
am.-mo acids sec forth in che amino acid sequence of SEQ ID 
NO: 101, or che corresponding full-length amiino acid 
sequetnce ; 78 (preferably 30, more preferably S5, mosc 
preferably 90) or more conciguous amino acids sec forth in 
the amino acid sequence SEQ ID NO: 107 or functional 
derivatives chereof as described herein. For sequences for 
v/hich che full-length sequence is noc given, che remaining 
sequences can be determined using methods v;ell-knov;n co 
chose in che art and are mcended to be included in che 
invention. In certain aspects, poiypepcides of 100, 200, 
3 00 or miore amdno acids are preferred. The kinase 
polypeptide can be encoded by a full-length nucleic acid 
sequence or any portion of the full-length nucleic acid 
sequence, so long as a funccional activity of the 
polypeptide is i^etamed, not to include fragm.encs containing 
only amino acids 1-22 of SEQ ID NO: 13 or only amino acids 1- 
33 of SEQ ID NO : 107 . 

The ammo acid sequence will be substancially similar 
CO the sequence shewn in SEQ ID NO : 5 , SEQ ID NO : 6 , SEQ ID 
NO: 7, SEQ ID NO : 1 3 , SEQ ID NO: 14, SEQ ID NO : 1 5 , SEQ ID 
NO: IS, SEQ ID NO: 22, SEQ ID NO: 23,' SEQ ID NO: 24, SEQ ID 
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N-0:29, SEQ ID NO : 3 1 , SEQ ID NO:97, SEQ ID NO: 39, SEQ ID 
NO: 100, SEQ ID NO: 102, SEQ ID NO: 105, cr SEQ ID NO: 107, cr 
zhe corresponding full -length amino acid sequence, or 
fragmencs thereof, not to include fragments consist mg ' only 
of the amino acid sequences 1-22 of SEQ ID NO: 15 or 1-33 of 
SEQ ID NO: 107. A sequence that is . substantially si-iiar to 
the seauence of SEQ ID NO : 5 , SEQ ID NO : 5 , SEQ ID NO : 7 , SEQ 
ID NO:i3, SEQ ID NO:14, SEQ ID NC:15, SEQ ID N0:1S, SEQ ID 
MO:22, SEQ ID NO : 2 3 , SEQ ID NO:24, SEQ ID NO : 2 9 , SEQ ID 
■NO:31, SEQ ID NO:97, SEQ ID NO:99, SEQ ID NO:101, SEQ ID 
NO:103, SEQ ID NO:105, or' SEQ ID NO:i07 v-li preferably nave 
at least 90% identity (more preferably at least 95% and most 
preferably 99-100%) to the sequence of SEQ ID NO : 5 , SEQ ID 
NO:.5, SEQ ID NO:7, SEQ ID NO:13, SEQ ID N0:14, SSQ ID NO:15, 
SEQ ID NO:1.3, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ 
ID NO:29, SEQ ID NO : 3 1 , SEQ ID NO:97, SEQ ID NO : 9 9 , SEQ ID 
N0:101, SEQ ID NO:i03, SEQ ID NO:105, or SEQ ID NO:107. 

3y "identity" is meant a property of sequences that 
measures their similarity or relationship ... Identity is 
.measured by dividing the number of identical residues by the 
total number of residues and gaps and multiplying the . 
product by 100. "Gaps" are spaces m an alignm.ent that are 
the result of additcns or delecions of am>ino acids. Thus, 
two copies of exactly the same sequence have 100% identity, 
but sequences that are less highly conserved, and have 
deletions, additions, or replacements, may have a lower 
dearee of identity. Those skilled in the art will recognize 
that several computer program.s are available for determining 
sequence identity using standard parameters, for example 
Blast (Altschul, et al . (1997) Nucleic Acids Res. 25:3339- 
3402), Blast2 (Altschul, ec al . (1990. J. mol . biol . 
215:403-410 . and Smith-waterman (Smith, et ai . :19S1) J. 
Mol. Biol. 147:155-197). 
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In preferred embcdipn.encs , che invention features 
isolated, enriched, or purified nucleic acid molecules 
encoding a kinase polypeptide comprising a nucleotide 
sequence that : (a) encodes a polypeptide having the amino 
acid sequence set forch in 5EQ ID NO:5, SEQ ID NO : 6 , SEQ ID 
I\T0:7, SEQ ID MO: 13, SEQ ID NO: 14, SEQ ID NO : 1 B , SEQ ID 
NO: 18, SEQ ID NO : 2 2 , SEQ ID NO : 2 3 , SEQ ID NO : 2 4 , SEQ ID 
NO: 29, SEQ ID NO : 3 1 , SEQ ID NO:?7, SEQ ID NO: 99, SEQ ID 
N0:101, SEQ ID N0:103, SEQ ID NO:105. or SEQ ID NO:107; (b) 
is che coiTiplemenc of che nucleotide sequence of (a) ; (c) 
hybridizes under highly scringenc conditions co che 
nucleotide molecule of (a) and encodes a nacurally occurring 
kinase polypeptide; (d) encodes a kinase polypeptide having 
the ammo acid sequence of SEQ ID NO : 5 . SEQ ID NO : 6 , 5SQ ID 
NO: 7, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO : 1 5 , SEQ ID 
NO: IS, SEQ ID NO : 2 2 , SEQ ID NO : 2 3 , SEQ ID NO: 24, SEQ ID 
NO:29. SEQ ID NO : 3 1 , SEQ ID NO:97, SEQ ID NO:99, SEQ ID NO: 
103, SEQ ID NO: 105, or SEQ ID NO: 107, except that it lacks 
one or more, but not all, of nhe following segments of am.ino 
acid residues: 1-21, 22-274, or 275-416 of SEQ ID NO : 5 , 1- 
31, 32-303, 309-489 or 490-516 of SEQ ID NO : 5 , 1-178 or 179- 
414 of SEQ ID N0:7, 1-22, 23-289, 290-526, 527-640, 641-896, 
or 397-1239 of SEQ ID N0:13, 1-255, 255-442, 443-626, 627- 
954, or 955-1297 of SEQ ID NO:14, 1-255, 256-476, 477-680, 
681-933, or 984-1326 of SEQ ID NO:15, 1-13, 14-273, 274-346, 
347-534, or 535-394 of SEQ ID N0:1S, 1-21, 22-277, 273-427, 
423-637, 638-751, or 752-393 of SEQ ID NO:22, 1-66, 67-215, 
216-425, 426-539, 540-786, or 787-337 of SEQ ID NO : 2 3 , 1-25, 
26-273, 274-422, 423-632, or 633-748 of $EQ ID NO:24, 1-51, 
52-224, 225-393, 394-658, or 659-681 of SEQ ID NO : 2 9 , 1-25, 
26-281, 234-430, 431-640, 641-754, 755-901, or 902-1001 of 
SEQ ID NO:31, 1-10, 11-321, or 322-373 of SEQ ID NO:97, 1- 
57, 53-359, or 370-413 of SEQ ID NQ:99, 1-52, 53-173, 174- 
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307, 30S-572, or 573-591 of SEQ ID NO:103,- 1-24, 25-289, 
290-397, 3SS-62S, 529-572, or 373-1227 of SEQ ID MO:105, or 
i-33, 34-294, 295-337, 338-472, 473-724, or 725-SS3 of SEQ 
ID NO: 107; (e) is the cotTipiement of che nucleocide sequenrs 
of (d) ; (f ) encodes a polypepcide having che amine acid 
sequence set forth in SEQ ID NO : 5 , SEQ ID NO : 6 , SEQ ID NO : 7 , 
SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: IS, SEQ 
ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO : 2 9 , SEQ ID 
•NO: 31; SEQ ID NO: 97, SEQ ID NO: 99, SEQ ID NO: 103, SEQ ID 
NO: 105, or SEQ ID NO: 107 from amino acid residues 1-21, 22- 
274, or 275-416 of SEQ ID NO : 5 , 1-31, 32-303, 309-439, or 
490-515 of SSQ ID NO : 5 , 1-178 or 179-414 cf SEQ ID NO : 7 , 23- 
239, 290-526, 52 7 - 54 0 , . 64 1 - S 96 , cr 397-1239 of SEQ ID NO:13, 
1-255, 256-442, 443-626, 627-954, or 955-1297 of SEQ ID 
N0:14, 1-255, 256-476, 477-680, 681-983, or 984-1326 of SEQ 
ID N0:15, 1-13, 14-273, 274-346, 347-534, or 535-394 of SEQ 
ID NO:lS, 1-21, 22-277, 278-427, 42S-637, 638-751, or 752- 
89-8 of SEQ ID NO:22, 1-66, 67-215, 216-425, 426-539, 540- 
786, or 737-887 of SEQ ID NO:23., 1-25, 26-272, 274-422, 423 
632, or 633-748 of SEQ ID NO:24, 1-51, 52-224, 225-393, 394 
658, or 659-631 of SEQ ID NO:29, 1-25, 26-281, 282-430; 431 
640, 641-754, 755-901, or 902-1001 cf SEQ ID NO : 3 1 , 1-10, 
11-321, or 322-373 of SSQ ID NO:97, 1-57, 58-369, or 370-41 
of SEQ ID NO:99, 1-52, 53-173, 174-307, 303-572, or 573-591 
of SEQ ID NO: 103. 1-24, 25-289, 290-357, 393-628, 529-872, 
or 873-1227 of SEQ ID N0:105,or 1-33, 34-294, 295-337, 338- 
472, 473-724, or 725-968 of SEQ ID NO: 107; ^g) is the 
complement of the nucleotide sequence of (f);(h) encodes a 
DOlypeptide having the amino acid seausr^.ce set forth in SEQ 
ID NO: 5, SSQ ID NO : 6 , SEQ ID NO : 7 , SEQ ID NO : 1 3 , SEQ ID 
NO: 14, SEQ ID NO: 15, SEQ ID NO: IS, SEQ ID NO: 22, SSQ ID 
NO: 22, SSQ ID NO: 24, SEC ID NO : 2 9 , SEQ ID NO : 3 1 , SSQ ID 
NO: 97, SSQ ID NO: 99, SEQ ID NO: 101, SSQ ID HO: 103, SEQ ID 
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NO: 105, or SEQ ID NO: 107, except thac it lacks one cr more 
cf the dcmains selected fron"; the group consisting of a 11- 
terminal dorriain, a catalytic domain, a C-terminal doiriain, 
coiled-coil structure region, a proiine-rich region, a 
spacer region, an insert, and a C-terminal tail; cr (i) is 
the complement of the nucleotide sequence of (h) . 

The termt " comiplement " refers to two nucleotides that 
can form, msultiple favorable interactions v.-ith one another. 
For example, adenine is complementary to thymiine as they can 
form zvjo hydrogen bonds. Similarly, guanine and cytosine 
are complem.ent ary since they can formi three hydrogen bonds. 

A nucleotide sequence is the complement of another 
nucleotide., sequence if all of the nucleotides of the first 
sequence are ccmplem.entary to all of the nucleotides of the 
second sequence. 

The termi "dom.ain" refers to a region of a polypeptide 
which contains a particular function. For instance, M- 
termiinal or C-term.inal dom.ains of signal transduction 
proteins can serve functions including, but not lim.ited to, 
binding molecules that localize the signal transduction 
molecule to different regions of the cell or binding other 
signaling molecules directly responsible for propagating a 
particular cellular signal. Somie domains can be expressed 
separately from the rest of the protein and function by 
them:selves, v/hile others m.ust rem^ain part of the intact 
protein to retain function. The latter are termed 
functional regions cf proteins and also relate to domains. 

The term. ''M- 1 erm.mal domiain" refers to the 
extracatalyt ic region located betv;een the initiator 
methionine and the catalytic domain of the protein kinase. 
The N-termiinal dom.am can be identified follov^ing a Sm.ith- 
Watermian alignment of the protein sequence against the non- 
redundanz protein database to define the M- terminal boundary 
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of the catalytic domain. Depending on its length, the N- 
terminal domain may or may not play a regularory role in 
kinase function. An example of a procein kinase whose N- 
term.mal dom:ain has been shown to play a regulatory role is 
PAK6 5, -which ccncains a CRIB motif used for Cdc4 2 and rac 
binding (Burbelo, P.D. et al . (1995) J. Biol. Chem. 270, 
29 0 71-290740) . 

The N- terminal domain spans amino acid residues 1-21 of 
the seauencB sec forth in SEQ ID Nc : 5 , amino acid residues 
1-11 of the. sequence sec forth m SEQ ID NO : 6 , amino acid 
residues 1-22 of the sequence set forth in SEQ 1^) NO: 13, 
amino acid residues 1-13 of the sequence set forth in SEQ ID 
IS!p:lS. amino acid residues 1-21 of che sequence sec forth in 
SEQ ID NO: 22, ammo acid residues 1-25 of the sequence set 
forth in SEQ ID NO: 24, amdno acid residues 1-51 of the 
sequence set forth in SEQ ID NO: 29. amino acid residues 1-25 
of the sequence set forth m SEQ ID NO: 31, am:ino acid 
residues 1-57 of the sequence set forth m SEQ ID NO: 99, 
ammo acid residues 1-52. of che\secueace set .forth m SEQ ID 
NO: 103, amino acid residues 1-24 of the sequence set forth 
m SEQ ID NO: 10 5, or am.ino acid residues 1-33 of che 
sequence sec forch m SEQ ID NO: 107. 

The cerm "cacalycic domain" refers co a region of the 
procein kinase chat is cypically 25- 300 ammo acids long and 
is responsible for carrying out the phosphace transfer 
reaction from: a high-energy phosphace donor molecule such as 
AT? or GTP co icself (autophosphorylacion) or to other 
proceins (exogenous phospnorylac ion) . The cacalytic domain 
of orotein kinases is made up of 12 subdomiains thac concain 
highly conserved ammo acid res idues , . and are responsible 
for proper polypeptide folding and for catalysis. The 
cacalytic domain can be idencified following a Smich- 

M 
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Waterman alignment of the protein sequence against che ncn- 
redundanc pronein database. 

Tne catalytic domain spans amino acid residues 22-274 
of the sequence se- forth in 5EQ ID NO : 5 , residues 32-308 of 
the sequence set forth m 3EQ ID NO : 6 , residues 1-178 of the 
sequence sec forth in SSQ ID NO : 7 , residues 23-2 9 9 of the 
sequence set forth in SEQ ID NO: 13, residues 1-255 of the 
sequence set forth in SEQ ID NO: 14, residues 1-255 of the 
sequence set forth in SEQ ID NO: 15, residues 14-273 of the 
sequence set forth in SEQ ID NO: 18, residues 22-2 77 of the 
sequence set forth m SEQ ID NO:22, residues 1-65 of the 
sequence set forth m SEQ ID NO:23, residues 26-273 of the 
sequence set forth m SEQ ID NO: 24, residues 3 94-653 of the 
sequence set forth in SEQ ID NO: 29, residues 26-281 of the 
sequence set forth in SEQ ID NO: 31, residues 1-278 of the 
sequence set forth in SEQ ID NO: 97, residues 58-369 of the 
sequence set forth m SEQ ID NO:99, residues 1-1C3 of the 
sequence set forth m SEQ ID NO: 101, residues 308-572 of the 
sequence set forth in SEQ ID NO: 103, residues 25--239 of the 
sequence set forth m SEQ ID NO:105, or residues 34-294 of 
the sequence set forth in SEQ ID NO:lC7. 

The term "catalytic activity", as used herein, defines 
tne rate at which a kinase catalytic domain phospncryiates a 
substrate. Catalytic activity can be measured, for example, 
by determining the amount of a substrate converted to a 
phosphorylated product as a function of tim.e. Catalvtic 
activity can be measured by mtethods of the invention by 
holding tim.e constant and determining the concentration of a 
phosphorylated substrate after a fixed period of time. 
Phosphorylation of a substrate occurs at the active-site of 
a protein kinase. The active-site is normally a cavitv m 
which the substrate binds to the protein kinase and is 
phosphorylated . 
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The zsrm "substrate" as used herein refers to a 
molecule phosphcrylaced by a kinase of the invention. 
Kinases phosphcryiate substrates on serine/ threonine or 
tyrosine amino acids. The molecule may be another protein 
5 or a polypeptide. 

The term "C- terminal domain" refers to the region 
located betv;een the catalytic domain or the last (located 
closest tC' the C-terminus) functional domain and the 
carboxy- terminal amino acid residue of the protein kinase. 
10 By "functional" domain is meant any region of the 

oolvoeotide chat may play a regulatory or catalytic role as 
predicted from amdno acid sequence hom.ology to other 
proteins or by the presence of amino acid sequences that may 
give rise to specific structural conf orm.at ions (i.e. corled- 
15 coils> . The C- terminal domain can be identified by using a 

Smith-vvaterman alignment of the protein sequence against the 
ncn- redundant protein database to define the C- terminal 
boundarv of the catalytic domiain or of any functional C- 
terminal . extracatalytic dom.ain. Depending on its length and 
20 am;ino acid composition, the C-termtinal dom.ain m.ay or may not 

plav a regulatory role in kinase function. An example of a 
protein kinase v/nose C-terminal doma^in may play a regulatory 
role is ?AK3 which contains a hecerotrimeric subunic- 
binding site near its C-terminus (Leeuv;, T. et al .(1998) 
25 Nature, 391, 191-195) . 

The C-terminal domain spans amino acid residues 275-416 
of the seauence set forth in SSQ ID NO : 5 , residues 309-439 
of the sequence set forth in SEQ ID NO : 6 , residues 179-414 
of the sequence set forth in SEQ ID NO : 7 , residues S97-1239 
30 of the secuence set forth m SEQ ID NO-:!!, residues 955-129~ 

of the seauence set forth in SEQ ID NO: 14, residues 984-1326 
of the seauence set forth in SEQ ID NO-: 15, residues 535-394 
of the sequence sez forth in SEQ ID NO:lc, residues 752-398 
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of -he sequence set forth m SEQ ID NO:22, residues 2^9-3" 
of Che sequence set forch in SEQ ID NO:97, residues 370-418 
of the sequence set forth in SEQ ID MO: 99, or residues 673- 
1227 of Che sequence set forch in SEQ ID NO: 105. 

The term "signal transduction pathway" refers co the 
rrolecuies that propagate an extracellular signal through che 
cell membrane to become an intracellular signal. This 
signal can then stimulate a cellular response. The 
polypepcide molecules involved in signal cransduccion 
processes are typically receptor and non-receptor procein 
tyrosine kinases, receptor and non- receptor procein 
phosphacases, SRC homology 2 and 3 domains, phosphocyrcsme 
binding proteins (SRC homology 2 (SH2) and phosphotyrosme 
binding (?T3 and PH) domain containing procems) , prolme- 
rich binding proteins (SH3 dom.ain containing proteins) , 
nucleotide exchange factors, and transcription factors. 

The term- "coiled-coil structure region" as used herein, 
refers to a polypeptide sequence that has a high probability 
of adopting a coiled-coil structure as predicted by computer 
algorithms such as COILS {Lupas, A. (1996) Mech. En-ymolcgv 
266:513-525). Coiled-coils are formed by two or three 
amphipachic ca-helices m parallel. Coiled-ccils can bind to 
coiled-coil domains of other polypeptides resulting m homo- 
or heterodimers (Lupas, A. (1991) Science 2 52:1162-1164). 
Coiied-ccil-dependent oligomerizacion has been snox^in to be 
necessary for protein function including catalytic activity 
of serine/threonine kinases (Roe, J. et al . (1997) J. Biol. 
Chem. 272:5333-5345) , 

Tne coiled-coil structure region spans am.mo acid 
residues 290-525 of the sequence sec forch in SEQ ID NO: 13, 
residues 256-442 of the sequence set forth in SEQ ID NO: 14, 
resiaues 256-476 of che sequence sec forth in SEQ ID NO: 15, 
residues 428-537 of che sequence sec forch in SEQ ID NO: 22, 

1^ 
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residues 216-425 or 540-786 of the sequence set fcrch in SEQ 
ID NO:23, residues 423-632 of the sequence set forth m SEQ 
ID NC:24, residues 431-640 or 755-901 of the sequence set 
forth in SEQ ID NO : 3 1 , residues 291-39S or 629-66S of che 
sequence set forth in SEQ ID NO: 105, or residues 473-724 or 
725-963 of the sequence set forch m SEQ ID NO: 107. 

The cern- " prol ine - rich region" as used herein, refers 
CO a reoion of a procein kinase whose proline concent over a 
given amino acid length is higher than the average content 
of this aiTiinC' acid found in proteins ( i . e . , >1C%) . Proline- 
rich regions are easily discernabie by visual inspection of 
arr.ino acid sequences and quantitated by standard computer 
seqwBnce: analysis programs such as the DNAScar program 
EditSeq. Prolme-rich regions have been demonstrated to 
parcicipate in regulatory protein -protein interactions. 
Among these interactions, those that are most relevant to 
this invention involve the "PxxP"" proline rich motif found 
m certain protein kinases (i.e., human PAKl) and the SK3 
douiainof the adaptor molecule Nek (Galisteo, M.L.-et al . 
(1S96) J. Biol- Chem. 2 71:20997-21000). Other regulatory 
interactions involving "Pxx?" prol ine -rich motifs include 
the WW domain (Sudol, M. (19 96) Prog. Biochys . Mol . 3ic. 
65 : 113 - 132) . 

The proline-rich region spans amiino acid residues 527- 
640 of the sequence set forth in SEQ ID NO: 13, residues 443- 
526 of the sequence set forth in SEQ ID NO: 14, residues 477- 
6S0 of the sequence set forth m SEQ ID NO: 15, residues 347- 
534 of the sequence set forth m SEQ ID NO : 1 3 , res idues 39S- 
625 of the sequence set forth in SEQ ID NO:105, c^r residues 
338-472 of the sequence set forth in SEQ ID NO: 107. 

The term "spacer region" as used herein, refers to a 
region of the protein kinase located between predicted 
functional doT;ains . The spacer region has no detectable 
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hoTiCiogy to any amino acid sequence in the database, and can 
be identified by using a Smich-Vv'ac erman alignment of rhe 
procein sequence againsc che non- redundant procein database 
CO define the C- and N-cerminai boundaries of the flanking 
functional domains. Spacer regions may or may not pj-ay a 
fundamental role in protein kinase function. Precedence for 
the regulatory role' of spacer regions in kinase function is 
orovided by che role of the src kinase spacer in incer- 
dom.ain interactions (Xu, W. et al . (1997) Nature 335:595- 
502) . 

The spacer region spans amino acid residues 64 1-S5S of 
the sequence set forth in SEQ ID NO: 13, residues 527-954 of 
the sequence sec forth in SEQ ID NO: 14, residues 681-583 of 
rhe sequence set forth in SEQ ID NO: 15, residues 274-346 of 
Che sequence sec forth m SEQ ID NO: 18, residues 278-427 or 
63S-751 of t-he sequence set forch in SEQ ID NO: 22, residues 
67-215 or 425-539 of che sequence set forth in SEQ ID NO:23, 
residues 274-422 or 633-748 of the sequence set forch in SEQ 
ID NO:24, r-sidues 225-393 of the sequence set forth in SEQ 
ID NO:25, residues 232-430 or 641-754 of the sequence set 
for-h in SEQ ID NO: 31, residues 174-307 of the sequence set 
forch m SEQ ID NO: 103, residues 56S-872 of the sequence set 
iorch in SEQ ID NO: 105, cr residues 295-337 of the sequence 
set forth in SEQ ID NO: 107. 

The term "insert" as used herein refers no a portion of 
a protein kinase that is absent from a close homolog . 
Inserts may or m.ay not by che product alternative splicing 
of exons. Inserts can be identified by using a Sm.ith- 
Waterman sequence alignment of the protein sequence against 
the non -redundant protein database, or by means or a 
multiple sequence alignment of homologous sequences usi.ng 
the DNAStar program Megalign. Inserts may piay a fu.nctional 
role by presenting a nev; interface for protein-protein 
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interaccicns; cr by incerfering v/ith such inc eract ions . 
Inser-s span amino acid residues 52-224 of che sequence seu 
forth in SEQ ID NO: 29 or residues 53 - 173 of che sequence sec. 
forth in SEQ ID NO: 103. 
5 The cerrri "C-cerminal tail" as used herein, refers cc a 

C-terrr.inal domain of a protein kinase, that by homology 
e:-:cends or orocrudes pasc the C-term.inal amino acid of its 
closest homolog. C-terminal cails can be identified by 
using a Smith -Wacerman sequence alignm.ent cf "he procein 

10 secuence against the non- redundant procein dacabase, or by 

means of a multiple sequen'ce alignmienc of homologous 
sequences using che DNAStar program Megalign. Depending on 
Its length., a C-terminal tail mtay or may not play a 
regulacor'y role in kinase funccion. 

15 The C-cerm.inal tail spans amino acid residues 490 -516 

of the sBQuence set forch m SEQ ID NO : 6 , residues 7S7-387 
of che sequence set forth in SEQ ID NO : 2 3 , residues 659-681 
cf -che sequence set forth in SEC ID NO : 2 9 , residues 994-1053 
of the" sequence . set. .forth in SEQ". ID NO : 3 1 , or residues 573- 

20 59T of che sequence sec forth in SEQ ID NO: 103. 

Various low or high stringency hybridization conditions 
may oe used depending upon che specificity and seleccivity 
desired. These conditions are v7ell-knov;n co chose' skilled 
in the arc. Under stringent hybridization conditions only 

25 highly complementary nucleic acid sequences hybridize. 

Preferablv, such conditions prevent hybridization of nucleic 
acids having m.ore than 1 or 2 miismiacches out of 20 
contiguous nucleotides, mo^re preferably, such conditions 
prrevent hvbr idi zac ion of nucleic acids having more than 1 or 

30 2 mismatches ouc of 50 contiguous nucleotides, m.osc 

preferably, such conditions prevent hybridization of nucleic 
acids having more than 1 or 2 mism.acches ouc of 100 
conciguous nucleotides. In some instances, the conditions 

\1 
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may- prever-c hybrid! zacion of nucleic acids having more rhan 

5 mismatches in nhe full -length sequence. 

By scringent hybridizacion assay conditions is meanc 
hybridizar ion assay condicions at least as stringent: as rhe 

following: hybridization m 50% formamide, 5X S3C, 50 m.M 
NaH.PO,, pK 6,8, 0.5% SDS , 0.1 mg/mL sonicaced salmon sperm 
DNA, and 5X Denhart solution ac 42 °C overnight; v/ashmc with 
2X SSC, 0.1% SD3 ac 45 ; and washing with 0 . 2X SSC, 0.1% 
SDS az 4 5 ""C. under some of the most stringent hybridization 
assay condicions, the second wash can be done v/ith 0 . IX SSC 
at a temperature up to 70 °C (Berger et al . (1937) Guide to 
Molecular Clonmo Techniques pg 421, hereby incorporated by 
ref erence "herein including, any figures, tables, or 
crawmgs . ; . Hov/ever, other applications may require che use 
of condicions falling becween these sets of condicions. 
Mechods of decermining che condicions required to achieve 
desired hybridizations are . v/el 1 - known to those v/ith ordinary 
skill in the art, and are based on several factors, 
including but not limited co, the sequences to be hybridized 
and che samples co be cesced. 

In other preferred embicdiment s , the invention features 
isolaced, enriched, or purified nucleic acid molecules 
encoding kinase . polypept ides , further comprising a vector or 
promoter effective to iniciace transcription in a hose cell. 

The invention also features recombinant nucleic acid, 
preferably in a cell or an organism.. The recombinant 
nucleic acid mtay contain a sequence sec forch in SEQ ID 
M0:1, SEQ ID i^C:2, SEC ID NO : 3 , SEQ ID NO : 9 , SEQ ID NO: 10, 
SEQ ID NO: IT, SEQ ID NO: 17, SEQ ID NO : 1 9 , SEQ ID NO : 2 0 , SEQ 
ID NO: 21, SEQ ID NO : 2 7 , SEQ ID NO: 56, SEQ ID NO: 93, SEQ ID 
NO: 100, SEQ ID NO: 102, SEQ ID NO: 104, or SEQ ID NO: 106, or a 
runctional derivacive cherecf and a vector or a prom.oter 
-^feccive to iniciace transcription in a hose cell. The 
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recorr.binant nucleic acid can alcernac ively contain a 

transcript icnal initiation region functional in a cell, a 
sequence corrplementary to an RNA sequence encoding a kinase 
polypepride and a transcriptional termination region 
5 functione.l in a cell. Specific vectors and hose cell 

combinations are discussed herein. 

The term "vector" relates to a single or double- 
stranded circular nucleic acid molecule that can be 
transfected into cells and replicated -within or 

10 independently of a cell genomie . A circular double ~ stranded 

nucleic acid molecule can be cut and thereby linearized upon 
treatm.ent with restriction enzymes. An assortment of 
nucleic acid vectors, restriction enzym.es, and the knowledge 
of the nucleotide sequences cut by restriction enzymes are 

15 readily available to those skilled in the art. A nucleic 

acid molecule encoding a kinase can be inserted into a 
vector by cutting the vector v/ith restriction enzym.es and 
ligating the two pieces together. 

The term. " trans fee t ing " defines a number, of miethods to 

20 insert a nucleic acid vector or other nucleic acid m.olecules 

into a cellular organism,. These miethods involve a variety 
of techniques, such a? treating the cells vyith high 
concentrations of salt, an electric field, detergent, or 
DMSC to render the outer membrane or v/all of the cells 

25 permeable to nucleic acid miolecules of interest or use of 

various viral transduction strategies. 

The termi "icromtOter " as used herein, refers to nucleic 
acid sequence needed for gene sequence expression. Promioter 
regions vary fromi organism to organism^, but are well known 

30 to persons skilled in the art for different organism.s. For 

example, in orokaryotes, the promoter region contains both 
the prcm;Oter- (which directs tlie initiation of RNA 
t ranscr lot ion ) as well as the DKiA sequences v;hich, when 
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transcribed inro RMA, v/ill signal synthesis iniciacion. 

Such regions v;ill normally include chose 5 ' -non-coding 
sequences involved wich initiation of t ranscripc ion and 
translation, such as the TATA box, capping sequence, CAAT 
5 sequence, and the like. 

In preferred embodiments, the isolated nucleic acid 
comprises, consists essentially of, or consists of a nucleic 
acid sequence set forth in SEQ ID NO : 1 , SEQ ID NO : 2 , SEQ ID 
NO: 3, SEQ ID NO : 9 , SEQ ID NO: 10, SEQ ID NO : 1 1 , SEQ ID NO : 1 7 , 

10 SEQ ID NO: 19, SEQ ID NO: 2.0, SEQ ID NO : 2 1 , SEQ ID NO : 2 " , SEQ 

ID NO:96, SEQ ID NO:93, SEQ ID NO:100 SEQ ID NO:102, SEQ ID 
NO: 104, or- SEQ ID NO: 106, or the corresponding full -length 
sequence, encodes the amino acid sequence of SEQ ID NO : 5 , 
SEQ ID NO: 6, SEQ ID NO : 7 , SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID 

15 NO: 15, SEQ ID NO : 1 3 , SEQ ID NO: 22, SEQ ID NO : 2 3 , SEQ ID 

NO: 24, SEQ ID NO : 2 9 , SEQ ID NO : 3 1 , SEQ ID NO: 97, SEQ ID 
NO: 99, SEQ ID NO : 1 C 1 , SEQ ID NO : 1 0 3 , SEQ ID NO: 105, or SEQ 
ID-NO:lC7, or the corresponding full-length amino acid 
sequence, a functional derivative thereof or at least 40, 

20 45, 50, 60, 100, 200, or 300 contiguous amino acids of SEQ 

ID NO: 5, SEQ ID NO : 6 , or SEQ ID NO : 7 , or of the 
corresponding full-length amino acid sequence; at least 250, 
255, 275, 300, or 400 contiguous amino acids of SEQ ID 
NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, or of the corresponding 

25 full-length amino acid sequence; at least 27, 30, 35, 40, 

50, 100, 200, or 3 00 contiguous amino acids of SEQ ID NO: 18; 
at least 16, 25, 35, 50, 100, 200, or 3 00 contiguous am.mo 
acids of SEQ ID NO: 22, SEQ ID NO : 2 3 , SEQ ID NO: 24, SEQ ID 
NO:29, SEQ ID NO : 3 1 , or SEQ ID NO:103, or of the 

30 corresponding full-length amino acid sequence; 6 (preferably 

10, more preferably 15, .most preferably 25) or more 
contiguc^us am.ino acids set forth in the am.ino acid sequence 
of SEQ ID NO: 97 or SEQ ID NO: 99, or the. corresponding full- 
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iengch ammo acid sequence; 22 (preferably 3C, more 
preferably 35, nncsc preferably 45) or more conciguous amino 
acids set forth in the amino acid sequence of 3EQ ID NO: 101, 
or che corresponding full-length amiino acid sequence; or ao 
5 least 30, 85, 90, 100, 200, or 300 contiguous am.ino acids of 

SEQ ID NO: 107, c-r functional derivatives thereof. The 
kinase polypeptides, selected fromi the group consisting of 
STLK2, STLK3, STLK4 , STLK5 , 3TLK6 , STLK7 , ZOl , ZC2 , Z03 , 
ZC4 , KHS2 , SULUl , SULU3 , GSK2 , PAK4 , and ?AK5 , com:prise , 

10 consist essentially of, or consist of at lease at least 40, 

45, 50, 60, 100, 200, or 300 contiguous am.ino acids of SEQ 
ID N0:5, SEQ ID NO : 6 , or SEQ ID NO : 7 ; at least 250, 255, 
275, 300,, or 400 contiguous amino acids of SEQ ID NO: 13, SEQ 
ID N0:14, SEQ ID NO : 1 5 , or SEQ ID NO : 1 0 5 ; ac least 27, 30, 

15 35, 40, 50, 100, 200, or 300 contiguous amiinc- acids of SEQ 

ID NO:13;. at least 35, 40, 45, 50, 100, 200, or 300 
contiguous amino acids of SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID 
NO:24, SEQ ID NO:29', SEQ ID NO : 3 1 or SEQ ID NO : 1 0 3 ; 6 
{oreferablv. 10, .miore^ preferably .1 5, .most preferably 25) or 

20 more contiguous amino acids set forth in the am.ino acid 

sequence of SEQ ID NO:57 or SEQ ID NO:99; 22 (pre f erably; 3 0 , 
more preferably 35, most preferably 45) or more contiguous 
amiino acids set forth in the amino acid sequence of SEQ ID 
NO: 101; or at least 30, 35, 90, 100, 200, or 300 conciguous 

25 am.ino acids of SEQ ID NO: 107, or the ■ corresponding full- 

length sequences or deriv^atives thereof. The nucleic acid 
mtay be isolated from a natural source by cDNA cloning or by 
subtract ive hybridization. The natural source may be 
mamm.alian, preferably human, blood, sem^en, or tissue, and 

30 the nucleic acid may be synthesized by the triester method 

or by using an automated DNA synthesizer. 

The term, ''miamimial" refers preferably to such organism.s 
as mdce, racs, rabbits, guinea pigs, sheep, arid goats, more 

2J 



BNSDOCiD: <WO 9953036A2_L> 



wo 99/53036 PGT/US99/08 1 50 

preferably co cacs, dogs, iTionkeys, and apes, and rrost 

pi'ef erably co humans . 

In yet ocher preferred embodiments, the nucleic acid i 
a conserved or unique region, for example these useful for: 
the design of hybridization probes to facilitace 
identification and cloning of additional polypeptides, the 
design of PCR probes co facilitate cloning of additional 
polypepc ides , obtaining antibodies to polypeptide regions, 
and designing antisense oligonucleotides. 

By "conserved nucleic acid regions", are meanc regions 
present on cwo or more nucleic acids encoding a kinase 
polypeptide, cc which a particular nucleic acid sequence ca 
hvbr idi ze . under lower stringency conditions. Exa.mples of 
lower stringency conditions suitable for screening for 
nucleic acid encoding kinase polypeptides are provided in 
Abe, et al . {J. Biol. Chem. 19:13361-13366; 1992), hereby 
incorporated by reference herein in its entirety, including 
any drawings, figures, or tables. Preferably, conserved 
regions differ by no more than 5 out of 2 0 nucleotides, eve 
more oreferr-/-. ly 2 out of 20 nucleocides or mosc preferably 
out of 2C nucleocides. 

3y "unique nucleic acid region" is meant a sequence 
present in a nucleic acid coding for a kinase polypeptide 
that is noc present in' a sequence coding for any other 
naturally occurring polypeptide. Such regions preferably 
encode 32 (preferably 40, more preferably 45, most 
preferably 55) or more contiguous amino acids set forch in 
the amino acid sequence of SEQ ID NC : 5 , SEQ ID NO : £ , or SEQ 
ID MO : 7 , or the corresponding full-length amino acid 
sequence; 250 (preferably 255, more preferably 260, most 
preferably 270) or mtore ccnciguous amino acids set forth in 
the amiino acid sequence SEQ ID KG: 13, SEQ ID NO: 14, or SEQ 
ID NO: 15, or SEQ ID NO: 105, or the corresponding full-lengc 
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amine acid sequence; 2 7 (preferably 30, more preferably 4 0, 
mcsc preferably 45) or more contiguous amino acids set forth 
in che am,ino acid sequence SEQ ID NO: 13; 15 (preferably 20, 
more preferably 25, most .preferably 35) or more contiguous 
5 amino acids set forth in the amino acid sequence SEQ ID 

NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO : 2 9 , SEQ ID 
NO: 31,. or SEQ ID NO:lC3, or the corresponding full-lengch 
amdno acid sequence; 6 (preferably 10, more preferably 15, 
most preferably 25) or more contiguous am.ino acids set forth 

10 in the am.ino acid sequence of SEQ ID NO: 97 or SEQ ID NO: 99, 

22 (preferably 30, more preferably 35, mosc preferably 45) 
or more contiguous amdno acids set forth in the am.ino acid 
secuence of SEQ ID N0:1C1, or the corresponding full-length 
amino acid sequence; or 73 (preferably SO, more preferably 

15 35, most preferably 90) or more contiguous ammo acids set 

forth m the amino acid sequence SEQ ID NO: 107, or 
functional derivatives cherecf . In particular, a unique 
nucleic acid region is preferably of m.am.mial lan origin. 
• • A- second • aspect of the . invention features a nucleic 

20 acid probe for the detection of nucleic acid encoding a 

kinase polypeptide in a sam.ple, wherein said polypeptide is 
selected from the group consisting of STLK2 , STLK3 , STLK4 , 
STLK5, STLKb, STLK7 , ZCl , ZC2 , 2C3 , ZC4 , KHS2 , SULUl , SULU3 , 
GEK2 , PAK4 , and PAK5 . Preferably, the nucleic acid probe 

25 encodes a kinase pclypepcide thac is a fragment of the 

protein encoded by the am.ino acid sequence set forth in SEQ 



ID NO 
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01 , 


SEQ ID NO: 


1 
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cf SEQ ID NO: 107- The nucleic acid probe concains a 
nucleo~ide base sequence chac v;ill hybridize cc a sequence 
set forth in SEQ ID NO : 1 . SEQ ID NO : 2 , SEQ ID NO : 3 , SEQ ID 
NO: 9, SEQ ID NO : 1 0 , SEQ ID NO : 1 1 , SEQ ID NO : 1 7 , SEQ ID 
5 NO: 19, SEQ ID NO : 2 0 , SEQ ID NO : 2 1 , SEQ ID NO : 2 7 , SEQ ID 

NO: 96, SEQ ID NO: 93, SEQ ID NO: 100, SEQ ID NO: 102, SEQ ID 
NO: 104, or SEQ ID NO: 106, or the corresponding full-lengch 
secuence, or a funccional derivative thereof. 

In oref erred embodiments, the nucleic acid probe 

10 hybridizes to nucleic acid encoding at lease 5, 12, 75, 90, 

105, 120, 150, 200, 250, 300 or 350 contiguous amino acids 
cf the sequence set forth in SEQ ID NO : 5 , SEQ ID NO : 6 , SEQ 
ID NO: 7, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID 
NO: 18, SEQ ID NO: 22, SEQ ID NO : 2 3 , SEQ ID NO: 24, SEQ ID 

15 NO: 29, SEQ ID NO : 3 1 SEQ ID NO: 97, SEQ ID NO : 9 9 , SEQ ID 

NO: 101, SEQ ID NO: 103, SEQ ID NO: 10 5, or SEQ ID NO: 107, or 
the corresoonding full-length amino acid sequence, or 
f unct ional der ivac ives thereof . 

Methods for using the probes include detecting the 

20 presence or am.ounu of kinase RNA in a sample by concaccing 

the sample with a nucleic acid probe under conditions such 
that hybridization occurs and dececcing the presence cr 
amount of ■ the probe bound ro kinase RNA. The nucleic acid 
duDle:-: form.ed between che probe and a nucleic acid sequence 

25 coding for a kinase polypeptide may be used in the 

identification of the sequence of the nucleic acid detected 
(Nelson et ai . , in Nonisotopic DNA Probe Techniques, 
Academic Press, San Diego, Kricka, ed,, p. 275, 1992, hereby- 
incorporated by reference herein in its entirety, including 

30 any drav;ing3, figures, or cables; . Kics for performing such 

m^ethods may be conscruGzed to include a container m.eans 
having disposed therein a nucleic acid probe. 
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In a Lhird aspscc , che invencion describes a 

reccTTLbinanc cell cr cissue comprising a nucleic acid 
molecule encoding a kinase polypeptide selected fron^. the 
group consisting of STLK2 , STLK3 , STLK4 , STLK5 , STLK5 , 
5 3TLK7, ZCl, ZC2, ZC3 , ZC4 , KH32 , 3ULU1 , SULU3 . GEF:2 , ?AK4 , 

and ?AK5 . In such cells, the nucleic acid may be under nhe 
control of the genomic regulatory elements, or m.ay be under 
the control of exogenous regulatory elements including an 
exocenous oromcter. By "exogenous" it is m.eanc a promoter 
10 that is not normally coupled in vivo transcriptionally to 

the codinc? seouence for the kinase polypeptides. 

The oolvpeptide is preferably a fragment of the protein 
encoded by the amtino.acid sequence set forth m SEQ ID NO : 5 , 
SEQ ID NO: 5, SEQ ID NO : 7 , SEQ ID NO : 1 3 , SEQ ID NO: 14. SEQ ID 
15 N0:15, SEQ ID NO : 1 3 , SEQ ID NO : 2 2 , SEQ ID NO : 2 3 , SEQ ID 

NO: 24, SEQ ID NO : 2 9 , SEQ ID NO : 3 1 , SEQ ID NO : 3 7 , SEQ ID 
NO: 99, SEQ ID NO: 101, SEQ ID NO: 103, SEQ ID NO: 105, or .SEQ 
ID NO:lC7, or the corresponding full-length amdno acid 
sequence, not to include fragm.ents consisting only of amiino 
20 acids 1-22 of SEQ ID NO: 13 or amino acids 1-33 of SEQ ID 

NO:107. 3y "fragment," is m.eant an am.ino acid sequence 
present in a kinase polypeptide. Preferably, such a 
sequence comprises at least 32, 45, 50, 50, 100, 200, or 300 
contiguous ammo acids of SEQ ID NO : 5 , SEQ ID NO : 6 , or SEQ 
25 ID NO : 7 , or of the corresponding full-length ammo acid 

sequence; at least 250, 255, 275, 300, or 400 contiguous 
amino acids of SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15,. OR 
SEQ ID NO: 10 5, or of the corresponding full-length amino 
acid sequence; at least 2-, 30, 35, 40, 50, 100, 200, or 300 
30 contiguous amino acids of SEQ ID NO: 13; at least 16, 25, 35, 

50, 100, 200, cr 300 contiguous amino acids of SEQ ID NO: 22, 
SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO : 2 9 , SEQ ID NO : 3 1 or 
SEQ ID NO: 103, or of the corresponding full-length ammo 



BNSCWDCID: <WO 9953036A2J_> 



wo 99/53036 PCT/US99/08150 

acid sequence; 6 (preferably IG, more preferably 15, mos- 
oreferably 2 5) or more ccnciguous amino acids sec lorzn m 
Che ammo acid sequence of SEQ ID NO: 97 or SEQ ID NO: 99, 22 
(preferably 30, more preferably 35, m.ost preferably 45) or 
5 more contiguous amino acids set forth in the amino acid 

sequence of SEQ ID MO: 101; at lease 78, 35, 90, 100, 200, or 
3 00 concigucus am.ino acids of SEQ ID NO: 107, or the 
corresponding full-length am.ino acid sequence; or a 
f unccional derivative thereof . 
10 In a fourth aspecc , the invention features an isolated, 

enriched, or purified kinase polypeptide selected from, che 
group consisting of STLK2 , STLK3 , 5TLK4 , STLK5 , STLK6 , 
STLK7, ZCl, ZC2, ZC3 , ZC4 , KHS2 , SULUl , SULU3 , GEK2 , PAK4 , 
and PAK5 . 

15 By "isolated" in reference to a pciypepcide is m^eanc a 

polym.er of amiino acids (2 or more amiino acids) conjugated co 
each other, including polypeptides chac are isolated from a 
nacural source or chac are synthesized. The isolated 
polypeptides of che presenc invencicn are unique m the 

20 sense thac they are noc found in a pure or separaced state 

in nature. Use of che -erm: "isolated" indicaces that a 
naturally occurring sequence has been removed from, its 
normal cellular environment . Thus, the sequence may be-in a 
cell-free solucion or placed in a different cellular 

25 environm:ent . The termi does not imply that the sequence is 

che only amino acid chain present, but thac it is 
essenciallv free (abouc.30 - 95% pure ac lease) of ncn-ammo 
acid material naturally associated with it. 

3y one use of one cerm "enriched" in reference to a 

30 pciypepcide is meano chat zhe specific am.ino acid sequence 

constituoes a significantly higher fraction {2-5 fold) of 
one coral amino acid sequences present in che cells or 
solucion of incerest than m normal or diseased cells or in 

M 
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the cells from v;hich the sequence was caken. This could be 
caused by a person by preferencial reduction in the amount 
of ocher amino acid sequences present, or by a preferential 
increase in the amount of the specific amino acid sequence 
5 of incerest, or by a combination of che two. However, it 

should be noted that enriched does not imply than chere are 
no other amino acid sequences presenc, just thac the 
relative amount of the sequence of interest has been 
significantly increased. The term significant here is used 

10 to indicate that che level of increase is useful cc "he 

oerscn making such an increase, .and generally means an 
increase relative to ocher amiino acid sequences of about at 
least 2-fold; more preferably ac least 5- to 10-fold or even 
more. The term also does not im.ply that there is no am.ino 

15 acid sequence from other sources. The other source of amiino 

acid sequences may, for example, Gompi~ise amtino acid 
sequence encoded by a yeast or bacterial genome, or a . 
cloning vector such as pUC19. The term^ is meant: to cover 
onlv chose sicuations in which m.an has intervened to 

20 increase the proportion of the desired am.ino acid sequence: 

It is '.also advantageous for som.e purposes that an amano 
acid secuence be in purified formi. The cerm. " pur i f ied" , in 
reference to a polypepcide does noc require absolute purity 
(such as a homogeneous preparation) ; instead, it represents 

25 an indication that the sequence is relatively purer than in 

che natural environm.ent . Compared to the natural level this 
level should be at least 2-5 fold greater (e.g., in cerm.s of 
m.g/miL) . Purification of ac lease one order of magnitude, 
oreferably two or three orders, and more preferably four or 

30 five orders of magnitude is expressly contemplated. The 

subscance is creferablv free of concamination at a 
functionally significanc level, for example 90%, 95%, or 99% 
pure , 

1'^ 
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In preferred erJoodimencs , the kinase polypepcice is a 

fragmenc of che protein encoded by uhe amino acid sequence 
set forth in SEQ ID NO : 5 , SEQ ID NO : 6 , SEQ ID NO : 7 , SEQ ID 
NO: 13, SEQ ID NO : 14 , SEQ ID NO: 15, SEQ ID NO: IS, SEQ ID 
5 NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO : 2 9 , SEQ ID 

NO: 31, SEQ ID NO: 97, SEQ ID NO: 99, SEQ ID NO: 101, SEQ ID 
NO: 103, SEQ ID NO: 105, or SEQ ID NO: 107, or tihe 
corresponding full-lengch amino acid sequences, not cc 
include fragrr.encs consisting only of amino acids 1-22 of SEQ 

10 ID NO: 13 or amino acids 1-33 of SEQ ID NO: 107. Preferably, 

che kinase polypeptide contains at least 32, 45, 50, 60, 
100., 200, or 300 contiguous amino acids of SEQ ID NO : 5 , SEQ 
ID NO : 6 , or SEQ ID NO : 7 , or the coi'responding full-length 
am.inc acid seauence; at lease 250, 255, 275, 300, or 400 

15 contiguous amino acids of SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID 

NO: 15, or SEQ ID NO: 105, or the corresponding full-lengnh 
amino acid sequence; at least 27, 30, 35, 40, 50, 100, 200, 
or 3 00 - contiguous am.ino acids of SEQ ID NO: IS; ac least 16, 
25, 35, 50, 100, 200, or 300 contiguous amino acids of SEQ 

20 ID NO: 22, SEQ ID NO : 2 3 , SEQ ID NO: 24, SEQ ID NO : 2 9 , SEQ ID 

NO: 31, or SEQ ID NO: 103, or the corresponding full-length 
amine acid sequence; 6 (preferably 10, more preferably 15, 
mosc preferably 25) or more conciguous ammo acids set forth 
in the amdno acid sequence of SEQ ID NO: 97 or SEQ ID NO: 99, 

25 22 (oreferablv 30, more preferably 35, mosr preferably 45) 

or more contiguous amino acids set forth in che amtino acid 
sequence of SEQ ID NO: 101, or cne corresponding full-length 
amino acid seauence; or at lease 78. 85, 90, 100, 200, or 
3 00 contiguous amino acids of SEQ ID NO: 107, or a funccionai 

30 . derivative cherecf . 

In preferred errb>odiment s , the kinase polypeptide 
ccmorises an amino acid seauence having (a) che am.ino acid 
sequence sec forth in SEQ ID NO : 5 , SEQ ID MO : 6 , SEQ ID NO : 7 , 

2S 
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SEQ ID NO: 13, SEQ ID MO: 14, SEQ ID NO: 15, SSv ID NO: 13, SEC: 
ID NO: 22, SEQ ID NO: 23, SEQ ID MO: 24, SEQ ID NO : 2 9 , SEQ ID 
NO: 31, SEQ ID NO: 97, SEQ ID NO: 99, SEQ ID NO: 101, SEQ ID 
NO: 103, SEQ ID NO: 105, or SEQ ID NO: 107; (b) che amino acid 
5 sequence set forch in SEQ ID NO : 5 , SEQ ID NO : 6 , SEQ ID 

NO: 7, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO : 1 5 , SEQ ID 
NO: IS, SEQ ID NO: 22, SEQ ID NO : 2 3 , SEQ ID NO: 24, SEQ ID 
NO: 29, SEQ ID NO : 3 1 , SEQ ID NO: 97, SEQ ID NO : 9 9 , SEQ ID 
NO: 10 3, SEQ ID NO: 105', or SEQ ID NO: 107, except that: it 
10 lacks one or more, bur not all, of che following segments of 

ammo acid residues: 1-21, 22-274, or 275-416 of SEQ ID 
N0:5, 1-31, 32-303, 309-439 or 490-516 of SEQ ID NO : 6 , 1-173 
or 179-41.4 of SEQ ID NO : 7 , 1-22, 23 -2S9, 290-526, 527-640, 
641-896, or 897-1239 of SEQ ID NO : 1 3 , 1-255, 256-442, 443- 
15 626 ,. 627-954 , or 955-1297 cf SEQ IDNO:14, 1-255, 255-476, 

477-6S0, 531-983, or 984-1326. of SEQ ID NO:15, 1-13, 14-273, 
274-345, 347-534, or 535-394 of SEQ ID NO:lS, 1-21, 22-277, 
273-427, 423-637, 633-751, or 752-893 of SEQ ID NO:22, 1-65, 
67-215, 216-425-, 425-539, 540-786-, or 787-837 of SEQ ID 
20 NO:23, 1-25, 26-273, 274-422, 423-632, or 633-743 of SEQ ID 

NO:24, 1-51, 52-224, 225-393, 394-658, or 659-681 of SEQ ID 
NO:29, 1-25, 26-231, 2S2-430, 431-640, 641-754, 755-901.-, or 
902-1001 cf SEQ ID NO : 3 1 , 1-10, 11-321, or 322-373 of SEQ ID 
NO:97, 1-57, 5S-369, or 370-413 of SEQ ID NO : 9 9 , 1-52, 53- 
25 173, 174-307, 303-572, or 573-591 of SEQ ID NO:103, 1-24, 

25-239, 290-397, 398-628, 629-663, 669-872, or 373-1227 of 
SEQ ID N0:105, or 1-33, 34-294, 295-337, 338-472, 473-724, 
or 725-958 of SEQ ID NO : 1 0 ; (c) the amine acid sequence sec 
forth in SEQ ID NO : 5 , SEQ ID NO : 5 , SEQ ID NO : 7 , SEQ ID 
30 N0:13, SEQ ID NO : 1 4 , SEQ ID NO : 1 5 , SEQ ID NO : 1 S , SEQ ID 

NC:22, SEQ ID NO : 2 3 , SEQ ID NO : 2 4 , SEQ ID NO : 2 9 , SEQ ID 
NO: 97, SEQ ID NO : 9 9 , SEQ ID NO : 1 0 3 , SEQ ID NO : 1 0 5 , or SEQ ID' 
NO:107 from amino acid residues 1-21, 22-274, or 275-415 of 
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SEQ ID MO:5, 1-31, 32-303, 309-489, or 490-516 of SEQ ID 
NC:6, 1-173 or 179-414 of SEQ ID NO : 7 , 23-289, 290-526, 527- 
640, 641-396, or 897-1239 of SEQ ID NO : 1 3 , 1-255, 256-442, 
443-626, 627-954, or 955-1297 of SEQ ID NO:14, 1-255, 256- 
5 475, 477-630, 631-983, or 984-1326 of SEQ ID NO : 1 5 , 1-13, 

14-273, 274-346, 347-534, or 535-394 of SEQ ID NO:lS, 1-21, 
22-277, 273-427, 423-637, 638-751, or 752-893 of SEQ ID 
NO:22, 1-66, 67-215, 216-425, 426-539, 54G-7S6, or 7S7-So7 
of SSQ ID MO:23, 1-25, 26-273, 274-422, 423-632, or 633-748 

10 of SEQ ID NO:24, 1-51, 52-224, 225-393, 394-558, or 659-631 

of SEQ ID NO:29, 1-25, 25-273, 274-422, 423-532, 633-745, 
747-993, or 994-1093 of SEQ ID NO : 3 1 , 1-lC, 11-321, or 322- 
373 of SSQ ID NO:97, 1-57, 58-369, or 370-413 of SEQ ID 
NO:99, 1-52, 53-173, 174-307, 303-572, or 573-591 of SEQ ID 

15 N0:103, 1-24, 25-2S9, 290-397, 398-628, 529-668, 659-372, or 

873-1227 of SSQ ID NO:105, or 1-33, 34-294, 295-337, 338- 
472, 473-724, or 725-963 of SEQ ID NO:107; or (d) che aniino 
acid sequence set forch in SEQ ID NO : 5 , SEQ ID NO : 6 , SEQ ID 
NO: 7, SEQ ID NO : 1 3 , SEQ ID NO: 14,. SEQ ID NO : 1 5 , SEQ ID 

20 NO: 13, SEQ ID NO : 2 2 , SEQ ID NO : 2 3 , SEQ ID NO : 2 4 , SEQ ID 

NO:29, SEQ- ID NO : 3 1 , SEQ ID NO : 9 7 , SEQ ID NO : 9 9 , SEQ ID 
NOilOl, SEQ ID NO:103, SEQ ID NO:105, or SEQ ID NO : 1 0 7 , 
except* thac lacks one or more, buc noc all, of the 

domains selected from the group consr sting of a C- terminal 

25 domain, a cacalytic domain, an N-terminal domain, a spacer 

region, a prcline-rich region, a coiled-coil scructure 
region, an insert, and a C-terminal tail. 

The polypepcide can be isolaced from a natural source 
by mechods v/ell-kncv/n in che arc. The natural source may be 

30 mammalian, preferably human, blood; semen, or cissue, and 

the polypeptide may be synthesized using an aucomated 
polypeptide synthesizer. The isolated, enriched, or 
purified kinase polypeptide is preferably: a STLPC2 , STLK3 , 

30 
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STLK4 , 3TLK5, 3TLK5 . or STLK7 pclypepcide; a ZCl, ZZ2 , ZC: , 
or ZC4 polypeptide; a KHS2 polypepn ide ; a 5ULUi or 5ULU3 
poiypepcide; a GEK2 polypepcide; or a PAK4 or PA?C5 
polypeptide . 

5 In some embodiments the invention includes a 

recombinant kinase polypeptide selected from, che group 
consis-ing of STLK2 , STLK3 , STLK4 , STLK5 , STLK6 , 3TLK7 , ZCl , 
ZC2. ZC3, ZC4, ?:KS2, SULUl , 3ULU3 . GEK2 . PAK4 , and PAKE. 3y 
"reccmbinanc Kinase polypeptide" is m.eant a polypeptide 
10 vDroduced by recomibinanr: DNA techniques such thaL ic is 

distinct from, a nauurally occurring polypepcide either in 
ics location (e.g., present in a different cell or tissue 
than found in nature), purity or structure. Generally, such 
a recombinant: polypepcide v/ill be presenc m a cell in an 
15 amiount different from chac normally observed m nacure . 

In a fifth aspect, the invencion features an ancibody 
{e.g., a monoclonal or polyclonal antibody) having specific 
binding affinity to ' a kinase polypepcide or a kinase 
polvpeptide dom.ain or fragment v/here the polypeptide is' 
20 selecced from, the group consiscing of 3TLK2 , STLK3 ,■ STLK-- , 

STLKE , 3TLK6, 5TuK7 , ZCl , ZC2 , ZC3 , ZC4 , KHS2 , SULUl , 5ULU3 , 
GEK2, PAK4, and ?AK5 , By "specific binding affinity" is 
. meanc chat, tne antibody omds cc the targec kinase 

polypeptide with- greater affinity chan ic binds to other 
25 polypeptides under specified conditions. Ancibodies or 

antibody fragm.encs are polypeptides thac concain regions 
that can bind ocher polypepc ides . The term ''specific 
binding affinity" describes an ancibody thac binds co a 
kinase polypeptide v;ith greater affinity chan ic binds co 
30 ocher polypeptides under specified conditions. 

The term, ^^polyclonal" refers to antibodies chat are 
heterogenous populations of ancibody molecules derived from 
Che sera of anim^als immiunized v;ith an ancigen or an 

31 
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10 



15 



20 



25 



antigenxc functional Gerivar.ive thereof. for the prccuction 
of polyclon£il ar.-ibodies, various hose animals rr^ay be 



may be used -o increase che immunological response, 
depending on the host species . 

"[vicnoclonal antibodies" are subscancially homogenous 
pcpulacions of antibodies to a particular antigen. They may 
be obtained by any technique which provides for the 
production of ancibody m.olecuies by continuous cell lines in 
culture. Monoclonal antibodies may be obtained by mechods 
known co those skilled in the arc (Kohler ez al . , Nature 
256:495-497, 1975, and U.S. Patent No. 4,376,110, both of 
which are hereby incorporated by reference herein in their 
entirety including any figures, tables, or drav/ings) . 

The term, "antibody fragm^ent" refers to a portion of an 
antibody, often the hyper variable region and portions of 
the surrounding heavy and light chains, that displays 
specific binding affinity for a particular molecule. A 
hyper variable region is a portion of an antibody that 
physically binds to the polypeptide target. 

Antibodies or antibody fragm.ents having specific 
binding affinity to a kinase polypeptide of the invention 
miay be used in m^ethods for detecting the presence and/or 
amount of kinase polypeptide m a sample by probing the 
sam.ple v/ith the antibody under conditions suitable for 
kinase - ant ibody immunocomplex formation and detecting the 
presence and/or am.ount C'f the antibody conjugated to the 
kinase polypeptide. Diagnostic kits for performing such 
methods may be tonstructed to include antibodies or antibody 
fragments specific for the kinase as well as a conjugate of 
a binding partner of the antibodies or the antibodies 
themselves . 



i mm/un i z e d by i 



niection with the antigen. 



Various ad j uvant s 
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An ancibody or ancibody fragment v;ith specific binding 
affinity zo a kinase polypeptide of che invention can be 
isolated, enriched, or purified from a prokaryocic or 
eukarvc-tic organisnrt. Roucine methods knov/n uc- chose skilled 
5 in the art enable production of antibodies or antibody 

fragments, in boch prokaryotic and eukaryotic organisms. 
Purif icacion , enrichment, and isolation of ancibodies, v;hich 
are polypepcide molecules, are described above. 

Antibodies having specific binding affmicy co a kinase 

10 polyoeptide of the invention may be used m mechods for 

deteccing the presence and/or amount of kinase pclypepcide 
in a sample by contacting the sam.ple v;ith' the antibody under 
condicions such thac an. immunocomplex forms and dececting 
the presence and/or amount of the antibody conjugated to the 

15 kinase polypeptide.. Diagnostic kits for performing such 

m.ethods m.ay be constructed to include a first container 
containing the antibody and a second container having a- 
coniugate of a binding partner of the antibody and a label, 
such as', for exam.ple.,. a' radioisotope.. The diagnostic kit 

20 may also include notification of an FDA approved use and . 

instructions therefor. 

In a sixth aspect, the invention features a hybridoma 
which produces an antibody having specific binding affinicy 
CO a kinase polypeptide or a kinase polypeptide dom.am, 

25 where the polypeptide is selected from the. group consisting 

of STLK2, STLK3, 3TLK4 , STLK5 , STLK5 , STLK7 , ZCl , ZC2 , ZC5 , 
ZC4 , KKS2, SULUl , SULU3 , GEK2 , PAK4 , and ?AK5 , 3y 
"hybridoma'' is meant an immortalized cell line chat is 
caoable of secreting an antibody, for example an antibody to 

30 .a kinase of the invention. In preferred embodimencs, the 

antibody to the kinase comprises a sequence of amiino acids 
that is able to specifically bind a kinase polypeptide of 
che mvencion. 
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In a seventh aspect; the invention features a kinase 

polypepcide binding agent able to bind to a kinase 
polypeptide selected from the group consisting of STLK2 , 
STLK3. 3TLK4, 3TLK6 , STLK7 , STLK5 , ZCl , ZC2 , ZC3 , ZC4 , KESl , 
SULUl, SULU3 , GEK2 , PAK4 , and PAK5 . The binding agenc is 
preferably a purified ancibody that recognizes an epitope 
present on a kinase polypeptide of the mvencion. Other 
binding agenc s include molecules that bind to kinase 
oolypeptides and analogous molecules that bind to a kinase 
polypeptide. Such binding agents may be identified by using 
assays uhat Treasure kinase binding partner activity, such as 
those chac measure PDGFR accivity. 

The invention also features a m.ethcd for screening for 
human cells containing a kinase polypeptide of the invencion 
or an equivalenc sequence. The m.ethod involves identifying 
the novel polypeptide in human cells using techniques that 
are rcucine and standard in the arc, such as those described 
herein for identifying the kinases of the invention [e.g., 
cloning, Southern or Northern, blot . analysis , in situ 
hybridizacicn, ?CR amplif icacion , ecc . ) . 

In an eighth aspect, the invencion features mechods for 
idencifying a substance cnat modulates kinase activity 
comprising the steps of: -(a) contaccmg a K'ln^se polypeptide 
selected from the group consisting of STLK2 , STLK3 , STLK4 , 
3TLK5, STLK6, STLK7 , ZCl . ZC2 , ZC3 , ZC4 , KHS2 , SULUl , SULU3 , 
GEK2 , ?AK4 , and PAK5 v;ith a test substance; (b) measuring, 
the activity of said polypeptide; and (c) determining 
whecher sa.id substance modulates the accivicy of said 
polypepc ide . 

The cerm ''modulates" refers cc che abilicy of a 
comocund to alter the function of a kinase of the invention. 
A modulator preferably activates or inhibits che activity 
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cf a kinase of che invention depending on the concencrac ^on 
of the compound exposed co the kinase. 

The cerm "accivates" refers co increasing the ceiiuiar 
acciviry of the kinase. The cerin inhibic refers co 
5 decreasing the cellular accivicy cf che kinase. Kinase 

accivitv is preferably the inceraccion wich a nacural 
binding parcner. 

The term "modulates" also refers to altering the 
funct:Lon of kinases of the invention by increasing or 
10 decreasing the probability that a complex forms becween the 

kinase and a natural binding parcner. A modulacor 
preferably increases the probability that such a com^plex 
forms be-ween che kinase and che natural binding parcner. 
more oreferably increases or decreases . the prcbabilicy chac 
15 • a complex forms becv/een the kinase and che nacural binding 
partner depending on the concentration of che compound 
exposed to che kinase, and most preferably decreases che 
probabilicy thac a complex forms becween che kinase and che 
natural binding partner. . . .. 

20 The -ermi ^^complex" refers to an assem^bly of ac- lease 

two molecules bound to one another. Signal cransduction 
complexes ofcen contain ac least two protein molecules bound 
CO one anocher. For mscance, a protein tyrosine receptor 
protein kinase, .GR32 , SOS, RAF, and RAS assemble co form a 
25 signal cransduction complex in response co a mitogenic 

ligand. 

The cerm. "natural binding parcner" refers to 
polypeptides, lipids, small molecules, or nucleic acids chat 
bind to kinases in cells. A change in che interaction 
30 becv/een a kinase and a nacural binding partner can manifest 

icself as an increased or decreased probabilicy chat che 
inceraccion formis, or an increased cr decreased 
concentracicn cf kmase/nacural binding parcner complex. 
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The cern^ ''concaccing" as used herein refers cc mixing a 

sclucxon comprising the tesz compound with a liquid raediurn 
baching the cells of che rriBzhods , The solution cornprismg 
che corr\pound may also comprise another component, such as 
diraethyl sulfoxide {DMSO) , which facilitates che uptake of- 
the test compound or compounds into the cells cf zhe 
m:echods . The solucion comprising the tesc comp'Ound may be 
added zo che medium baching che cells by uc ill zing a 
delivery apparacus, such as a pipe t -based device or syringe - 
based device . 

In a ninth aspect, che invention feacures mechods for 
idencifying a substance thac modulaces kinase accivity in a 
ceil compr-ising the steps of: (a) expressing a kinase 
pclypepcide in a cell, v;herein said polypeptide is selected 
from che group consisting of STLK2 , STLK3 , STLK4 , 3TLK5 , 
£TLK£ , STLK7 , ZCl, ZC2 . ZC3 , ZC4 , KH£2 , 3ULU1 , SULu3 , GEK2 , 
PAK4 , and PAK5 ; (b; adding a cest substance to said cell; 
and (c) monitoring a change in cell phenotype or the 
mteraccion between said polypeptide and a nacural binding 
partner . 

The cerm "expressing'' as used herein ' refer s co che 
production of kinases of che invention from a nucleic acid 
veccor containing kinase genes wichin a cell. The nucleic 
acid vector is cransfected into cells using v/ell knov;n 
techniques' in the art as described herein. 

In a tenth aspect, che invention provides mechods for 
treating a disease by administering to a patient in need of 
such creatmenc a substance chac modulaces the accivity of a 
kinase selected from the group consisting of 3TLPC2 , 3TLK3 , 
3TLK4, STLK5, STLK'5 , STLK7 , ZCl , ZC2 , ZC3 , SC4 , KHS2 , 3ULU1 , 
SULU3, GEK2, ?AK4 , and ?AK5 . Preferably, che disease is 
selecced from the group consiscing of im.m^une- relaced 
diseases and disorders, organ cransplancac ion , .miyocardiai 

3u- 



wo 99/53036 



PCT/US99/08I50 



mfarccicn, cardiovascular disease, scroke, renal failure, 
oxidative sc ress - relaced neurodegenerative disorders, and 



disorders include, but are not limited to, rheumatoid 
arthritis, artherosclerosis , and autoimmune disorders. 

In preferred errJoodiment s , the invention provides 
methods for treating or preventing a disease or disorder by 
adrr.inisterinvg to a patient in need of such treatment a 
substance that modulates the activity of a kinase 
polyoectide selected from the group consisting of ZCl, ZC2 , 
ZC3 , ZC4 , p:HS2, PAK4 , and- PAK5 . Preferably, the disease or 
disorder is selected from the group consisting of rheumatoid 
arthritis, artherosclerosis, autoim.m.une disorders, and organ 
transplantation. The invention also features methods of 
treating or preventing a disease or disorder by 
administering to a patient in need of such treatment a 
substance that modulates the activity of a kinase 
oolvoeotide selected from the group consisting of STLKl „ 
STLp:2, STLK3 , . STLK4, STLK5 , 3TLK6 , and. STLK7 . Preferably 
the disease or disorder is selected from, the group 
consisting of im.m.une - related diseases and disorders, 
myocardial infarction, cardiomyopathies, stroke, renal 
failure, and oxidative stress - related neurodegenerative 
disorders. Most preferably, the im.m.une -related diseases and 
disorders are selected from the group consistixng of 
rheumatoid arthritis, chronic inflammatory bov;el disease, 
chronic inf lam.m.a t cry pelvic disease, multiple sclerosis, 
asthmia, osteoarthritis , psoriasis , atherosclerosis, 
rhinitis, autoim.m/uni ty , and organ transplantation. 

The invention also features methods of treating or 
preventing a disease or disorder by admtinistering to a 
oatient m need of such treatment a substance that modulates 
the activity of a kinase polypeptide selected from the group 



cancer . 



Most creferably, the imimune - related diseases and 
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consiscing of ZCl , ZC2 , ZC2 , and ZC4 . Preferaoiy che 
disease is selected from che group consiscing of immune- 
related diseases and disorders, cardiovascular disease, and 
cancer. Mosc preferably, the immune- related diseases and 
disorders are selected from the group consisting of 
rheumatoid arthricis, chronic inf lammatOi~y bov/el disease, 
chronic inf lam.matory pelvic disease, multiple sclerosis, 
asthma, osteoarthricis , psoriasis , atherosclerosis , 
rhinitis, autoimmunity, and organ transplancacion , 

Substances useful for treatm;ent of kinase - related 
disorders or diseases preferably show positive results in 
one or m.ore in vitro assays, for an activity corresponding to 
treatment of the disease or disorder in question (Exam.cles 
of s\:.crL assays are provided in the references m section VI, 
belov;; and in Example 7, herein) . Examples of substances 
that can be screened for favorable activity are provided and 
referenced in section VI, below. The substances that 
m.odulate the activity of the kinases preferably include, but 
are not limited to, antisense oligonucleotides and 
inhibitors of protein kinases, as determined by m,ethods and 
screens referenced in section VI and Example 7, below. 

The term "preventing" refers to decreasing the 
probability that an organism contracts or develops an 
abnormal condition . 

The termi "treating" refers to having a therapeutic 
effect and at least partially alleviating c^r abrogating an 
abnormal condition m the organism. 

The term "therapeutic effect" refers to the inhibition 
or activation factors causing or ' contributing to the 
abnormal conditic-n. A therapeutic effect relieves to some 
extent one or more of the symptoms of the abnormal 
condition. In reference to the treatment of abnormal 
conditions, a therapeutic effect caz-: refer tc^ one or more of 
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-he rol lowing: (.a) 
and/or dif f erenciat ion of cells; (b} inhibidon (i.e., 
slowing or stopping) of cell death; (c) inhibition of 
degeneration; (d) relieving to some extenc one or more of 
the symptoms associated v/ith the abnormal condition; and (e) 
enhancing the function of the affected population cf cells. 

Comocunds demonstrating efficacy against abnormal 
conditions can be identified as described herein. 

The term ''abnormial condition" refers to a function in 



Che cells 02 



-issues of an organism* that deviates f; 



norm.al functions in that organismt. An abnormal conctici^: 
can relate to cell proliferation, cell differentiation, 



15 



20 



2d 



Abncrm^al cell prol i f erat i\^e conditions include cancers 
such as fibrctic and miesangial disorders; abnormal 
angiogenesis and vasculogenesis , wound healing, psoriasis,, 
diabetes m.ellitus, and inf lammiat ion . 

Abnorm.al differentiation conditions include, but are 
not lim.ited to neurodegenerative .disorders, slow wound 
healing rates, and slow tissue grafting healing rates.- 

Abnormal cell survival conditions relate to conditions 
m which programmed cell death (apoptosis) pathways are 
activated or abrogated. A number of protein kinases are 
associated with the apoptosis pathv/ays. -Aberrations in che 
function cf any one of the protein kinases could lead to 
cell immortality or premature cell death. 

The term, ''aberration", in conjunction with the function 
of a kinase in a signal transduction process, refers to a 
kinase that is over- or under-expressed in an organism, 
mrutated such that its catalvcic activity is lower or higher 
than wild-type protein kinase activity, m.utated such that it 
c a. n no 1 o no e r interact with a natural b 1 no 1 ng p a. r t n e r , is no 
lonaer modified bv another protein kinase or protein 
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phcsphatasa, or no longer interacts with a nacural binding 

partner. 

The terni "adminiscering" relates to a method of 
incorporating a compound into cells or tissues of an 
5 organism. The abnorrriai condition can be prevented or 

treated when the cells or tissues of the organism exist 
'A'ithin the organism or outside of the organism. Cells 
eMxsting outside the organism can be maintained or grown in 
cell c^ilcure dishes. For cells harbored within the 

10 organj-sm, many techniques exist in the art to admiinister 

compounds, including (but -not limited to) oral, parenteral, 
dermal, injection, and aerosol applications. For cells 
outside of the organism, multiple techniques exist in the 
art to administer the compounds, including (but not limited 

15 to) cell m.icroini ect ion techniques, t I'ansf ormat ion 

techniques, and carrier techniques. 

The abnormal condition can also be prevented or treated 
by , admini stering a compound to a group of cells having an 
aberration in a signal transduction pathway to an organism. 

20 The effect of adm.inis tering a, compound on organism, function 

can then be monitored. The organism, is preferably a mouse, 
rat, rabbit, guinea pig, or goat, more preferably a monkey 
or ape, and mosc preferably a human. 

In an eleventh aspect, the invention features methods 

25 for detection of a kinase polypeptide in a samiple as a 

diagnostic tool for diseases or disorders, -wherein the 
m.ethod comiprises the steps of: (a) contacting the samiple 
vvith a nucleic acid probe which hybridizes under 
hybridization assay conditions to a nucleic acid target 

30 region* of a kinase polypeptide selected from: the group 

consisting of STLK2 , STLP;3 , STLK4 , 5TLK5 , STLK6 , STLK7 , 2C1 , 
ZC2, ZC3, ZC4, KHS2, SULUl , SuLU3 , GEK2 , ?AK4 , and ?AK5 , 
sa.id probe com;prising the nucleic acid seauence encodinq the 

4i) 
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polypeptide, fragments Thereof, and the conplerrtenc s of the 
sequences and fragments; and (b) detecting che presence or 
amounc of che probe : nargec region hybrid as an . indicacion of 
the disease. 

5 In preferred embodiments of the invention, the disease 

or disorder is selected from the group consisoing of 
rheumatoid arthritis , artheroscleros i s , aucc immune 
disorders, organ transplantation, myocardial infarction, 
cardiomyopathies, stroke, renal failure, oxidative scress- 

10 related neurodegenerative disorders, and cancer. In other 

preferred embodiments, the kinase pclypepcide is selected 
from, the group consiscing of PAK4 and PAKE, or the 
polypepoide is selected from: the group consisting of ZCl, ■ 
ZC2, ZC3 , and ZC4 , and the disease is cancer. 

15 The kinase "target region" is the ' nucleoc ide base 

sequence set forth m SEQ ID NO : 1 , SEQ ID NO : 2 , SEQ ID NO : 3 , 
3SQ ID NO: 9,' SEQ ID NO : 1 0 , SEQ ID NO: 11, SEQ ID NO : 1 7 , SEQ 
ID NO: 19, , SEQ ID NO : 2 G , SEQ ID NO : 2 1 , SEQ ID NO : 2 7 , SEQ ID 
■NO:96, -SEQ ID.NO:98, -SEQ ID NO : 1 C 0 , SEQ ID NO : 1 0 2 , . SEQ ID 

20 NO: 104, or SEQ ID NO: 106, or the corresponding full-length"' 

sequences, a funccional derivative thereof, or a fragm.ent 
chereof to v;hich the nucleic acid probe v/ill specifically 
hybridize. Specific 'hybridization mdicaoes that: in the ■ 
presence of other nucleic acids the probe only hybridizes 

25 detectably with -the kinase of the invention's carget region. 

Putacive carget regions can be identified by miethods well 
knov;n m the art consiscing of alignm.enr and com.parison of 
the most closely relaced sequences m the dacabase . 

In preferred embcdimiencs the nucleic acid probe 

30 hybridizes to a kinase targec region encoding -at least 6, 

12, 75, 90, 105, 120, 150, 200,. 250. 300 or 350 conciguous 
amiino acids of zhe sequence sec forth m SEQ ID NO : 5 , SEQ ID 
NO: 6, SEQ ID NO : 7 , SEQ .ID NO: 13, SEQ ID NO: 14, SEQ ID NO : 1 5 , 

Hi 
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SEQ ID NO: 13, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ 
ID NG:2r, SEQ ID NO : 3 1 , SEQ ID NO:97, SEQ ID N0:9?, SEQ ID 
NO:101, SEQ ID NO:103, SEQ ID NO:105, or SEQ ID NO:i07, or 
the corresponding full-length ammo acid sequence, or a 
functional derivative cherecf . Hybridization conditions 
should be such that: hybridization occurs only v/ich the 
kinase genes in the presence of other nucleic acid 
niolecules. Under stringenc hybridization condicions only 
highly coniplementary nucleic acid sequences hybridize. 
Preferably, such condicions prevent hybridization of nucl-iic 
acids having more chan 1 or 2 mismacches ouc of 20 
contiguous nucleotides. Such conditions are defined supra. 

The diseases for which deteccion of kinase genes in a 
sample could be diagnostic include diseases in which kinase 
nucleic acid (DNA and/or RNA) is amplified in comparison to 
norm-al cells. By "amplification" is mieant increased numJoers 
of kinase DNA or RNA m a cell comipared with normial cells. 
In normal cells, kinases are typically found as single copv 
genes. In selected diseases, the chromosom.al iocacion of 
the kinase genes may be amplified, resulting in multiple 
copies of the gene, or amplif icacion . Gene amplification 
can lead to amplification of kinase RNA, or kinase RNA can 
be amplified m che absence of kinase DNA ampl if icacion . 

"Amplif icacion" as it refers to RNA can be che 
detectable presence of kinase RNA in ce^ls, since in som.e 
ncrmial cells chere is no basal expression of kinase RNA. In 
ocher normial cells, a basal level of expression of kinase 
exists, therefore in chese cases amplification is the 
detection of ac least 1-2 -fold, and preferably more, kinase 
RNA, compared to the basal level . 

The diseases chac could be diagnosed by detection of 
kinase nucleic acid in a sample preferably include cancers. 
The test samples suitable for nucleic acid probing methods 
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of the presenc invention include, for example, cells or 

nucleic acid extraccs of cells, or biological fluids. The 
samples used in the above -de scribed me c hods will vary based 
on the assay format, the detection m.echod and the nature of 
5 the tissues, cells or extracts co be assayed. Methods for 

preparing nucleic acid extracts of cells are v/ell knov/n in 
the art and can be readily adapted in order to obtain a 
sample that is compatible v;ith che m.echod ucilized. 

In a final aspect, the invencion feacures a miethod for 

10 detection of a kinase polypeptide in a samiple as a 

diagnostic tool for a disease or disorder, wherein the 
method comiprises: (a) comtparing a nucleic acid target 
region encoding the kinase polypeptide in a sample, v;here 
,"he kinase polypeptide is selected fromi the group -consiscing 

15 of .STLK2, STLK3 , STLK4 , STLK5 , STLKc , STLK7 , ZCl, ZC2 , ZC3 , 

ZC4., KHS2, SULUl, SULU3 , GEK2 , PAK4 , and PAK5 , or one or 
more fragmtents thereof, v/ith a control nucleic acid target 
region encoding the kinase polypeptide, or one or more 
fragments thereof; and (b) detect ing differences in sequence 

20 . or amount between the target region and the .control target 
region, as an indication of the disease or disorder. 
Preferably, the disease or disorder is selected from the- 
group consisting of im^m/une- re laced diseases and disorders, 
organ transplantation, myocardial infarction, cardiovascular 

25. disease, stroke, renal failure, oxidative stress - related 

neurodegenerative disorders, and- cancer. Immiune - related 
diseases and disorders include, but are not limtited to, 
those discussed previously. 

The termi "comparing" as used herein refers to 

30 identifvina discreoancies between the nucleic acid target 

region isolated fromi. a samiple, and the control nucleic acid 
target region. The discrepancies can be in the nucleotide 
seauences, e.gr. insertions, deletions, or point mutations, 

■4:3 
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or in che amcunu of a given nucleotide sequence. Mechccs tc 
decermine these discrepancies in sequences are weil-kncv/n zc 
one of ordinary skill in the art. The "control" nucleic 
acid cargec region refers co the sequence or arrounc of r.he 
sequence found in normal cells, e.g. cells that are noi: 
diseased as discussed previously. 

The invencicn has been described broadly and 
generically herein. Each of the narrov/er species and 
subgeneric groupings falling within the generic disclosure 
also form part of zhe invencion. This includes the generic 
description of the invencion with a proviso or negative 
limication removing any subject matter from the genus, 
regardless of v;hether or not the excised macerial is 
specifically recited herein. For example, in some instances 
the nucleotide sequence of "he ZC4 kinase polypeptide may 
noc be pare of a preferred embodiment. 

The summ.ary of the invention described above is noc 
limiting and ocher feacures and advantages of the invencion 
v;ill be apparenc from the following detailed description of 
the invention, and from che claims. 

BRIEF DESCRIPTION QF THE FIGURES 

Figure 1 shows a mulciple sequence alignment: of zhe 
amino acid sequences of the STE20-STE20 family kinases. 

Figure 2 shows a m.ultiple sequence alignm^ent of the 
ammo acrd sequences of che STE20-STLK5 famil\' kinases. 

Figures 5 A and 3 3 shov; a multiple sequence alignm.ent of 
rhe ammo acid sequences of STE20-ZC family kinases. 

Figure 4 shcv;s. a pairv;ise sequence alignmenc of STE20- 
KH3 family kinases. 

Figure 5 shews a mulciple sequence alignment of che 
arnica acid sequences ' cf STE20-SULU family kinases. 
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Figure 5 shows a pairwise sequence alignmenc of STE2 0- 
GEK family kinases 

Figure 7 shows a multiple sequence alignment of che 
amino acid sequences of STE20-PAK family kinases. 

Figures 3A, S3, 3C, SD, and 3E show che am.mo acid 
sequences of hum.an STLK2 , hum.an STLK3 , human STLK4 , human 
STLK5 , human ZCl^-hum^an ZC2 , human ZC3 , human ZC4 , human 
KHS2, human SULUl , human SULU3 , m.urine" SULU3 , hum.an GEK2 , 
hum^an ?AK4 , and human ?AK5 . 

Figures 9A., 9B, 9C , 9D, 9E, 9F, 9G , 9H, 91. 9J, 9?:. 9L, 
9M, 9N, 90, and 9? shov; the nucleic acid sequences of hum.an 
STLK2 , human STLK3 , human STLK4 , human STLK5 , human ZCl, 
humian ZC2,_ hum.an- ZC3 , human ZC4 , human KHS2 , humian 3ULU1 , . 
human 3ULU3 , murine SULU2 , human GEK2 , human PAK4 , and human 
PAK5 . 

Figures lOA and lOB show the full-lengoh amino acid 
se.q^s.~z\CBB of hum.an STLrCB (SEQ ID NO: 97), human ?AK5 (SEQ ID 
NO:103;, and human ZC4 (SEQ ID NO:105), as well as the 
parcial .amino acid sequences of human, ful 1 - 1 ength STLK6 (SEQ 
ID NO: 99) and human STLK7 (SEQ ID NO: 101) . ■ 

Figures llA., HE, IIC, and IID show the full-length 
nucleic acid sequences of human STLK5 (SEQ ID NO:96), human 
PArCE (SEQ ID NO:102), and ^rwAmar. ZC4 (SEQ ID NO:104), as well 
as the partial nucleic acid sequences of human STLK6 (SEQ ID 
NO: 95} and human 3TLK7 (SEQ ID NO: 100) . 

Figure 12 shows a m:ultiple sequence alignmenc among 
hum.an SPAK, humian STLK6 , hum.an STLK7 and full-length human 
STLK5 . 

Figure 13 shows a m.ulciple s^ou^no^ alignment am:Ong 
human PAKl , human PAK- and human PAK5 . 

Figures 14A and 143 show a pa.ir-wise seqi-ience alignmienc 
bee ween hum.an ZOl and hum-an ZC4 . 



Figure 15 shews a pair-wise sequence aiignr:ien~ cezv/een 
LCKl and full-length GEK2 . 

DETAILED DESCRIPTION OF THE INVENTION 

5 The prssenc invention relates in part to kinase 

polypeptides, nucleic acids encoding such polypeptides, 
cells concaining such nucleic acids, antibodies to such 
polypeptides, assays utilizing such polypepc ides , and 
methods reiacing to all of the foregoing. The present 
10 invencion is based upon che isolacion and characterization 

of ne*A' kinase polypeptides. The polypeptides and nucleic 
acids may be produced using well-known and standard 
synthesis' 'techniques when given the sequences presented 
herein. 

15 The recent elucidation of che DNA sequence of 

Saccharomyces cere^^esiae has provided the first corrplete 
example of the genecic information contained in a simiple 
eukaryotic organism; . Analysis of this yeast genome revealed 
that ic contains at least 113 protein kinases. These 

20 kinases v;ere further subdivided into several structurally 

related groups. One of these newly defined groups was 
termed che STE20- family to represent its founding m.ember 
STE20, which is a protein kinase involved in the yeast 
pheromone response pathway that initiates a protein kinase 

25 cascade in response to a G-protein mtediated signal. S. 

cerevesiae has two additional m.emJDers of this family, CLA4 , 
and YOL113W (HRA655: . 

Several mammalian homologues have recently been 
identified that belong to the STS2 0 - f ami ly , including SOK-1 

30 (human STE20) , GC-kinase, KHS , H?K1 , NI?:, SLK, GEK, PAKl , 

PAK65, MSTl, and CDC/. Furthermore, the Drosophila and che 
C. elegans genomie efforts nave identified additional protein 
kinases which belong to the STE20-f amiiy , yet have 
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Suruccurally unique ext racacalyc i c domains, including 
ZC504.4 and SULu kinases from C. elegans, and MINAC or 
Drosophila . 

3TE2 0 -related protein kinases have been implicated as 
regulating a variety of cellular responses, including 
response co growth .factors or cytokines, oxidative-, U'v - , or 
irradiation-related stress pathways, inflammacory signals 
[i.e., TNFa) , apcpcccic stimuli (i.e., Fas), T and B cell 
cosciiTiUiation, the control of cycoskeletal archicecture , and 
cellular Transformation. Typically, the STE20-relat:ed 
kinases serve as upscream regulacors of MJ\?K cascades. 
Examples include: HPKl, a protein - serine/ threonine kinase 
(STK) chat. ..possesses a STE2G-like kinase domain chac 
activates a procein kinase pathway leading to the stress - 
accivared protein kinase SAPK/JNK; PAKl , an STK with an 
uoscream CDC42 -binding dom.ain thac interaccs v/ich Rac and 
plays a role m cellular crans format ion chrough che Ras-MAPK 
pathway; and m.urine NIK, which interaccs wich upstreami 
receptor tyrosine kinases and connects v;ich downscream. 
STEll-family kinases. 

The 3TE2 0 - kinases possess a variecy of non-cacalyt ic 
dom.ains chat are believed to interact v/ich upstiream. 
regulators. Examples include prol ine - rich domains for 
inceraccion wich SH3 -cone aining proceins; or specific 
dom.ains for mteraccion v;ith Rac, Rho, etnd Rab sm.all G- 
proteins. These interactions may provide a m.echanismL for 
cross -calk becween distinct biochemical pachv/ays in response 
CO excernal stimuli such as the accivacion of a variety of 
cell surface recepccrs, including cyrosine kinases, cytokine 
recepccrs, TNF recepccr, Fas, T cell recepcors, CD23, or 
CD4C . 
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1. The Nucleic Acids of che Invencicn 

Included v;itihin che scope of this invencion are- che 
functiional equivalents of the herein-described isolated 
nucleic acid molecules. The degeneracy of che genecic code 
5 permits substitution of certain codons by other codons that 

specify the same amino acid and hence v/ould give rise to the 
same protein. The nucleic acid sequence can vary 
substantially since, v/ith the exception of methionine and 
tryptophan, the knov/n amino acids can be coded for by more 

10 than one coacn . .Thus, portions or all of the kinase genes 

of the invention could be synthesized to give nucleic acid 
sequence significantly different fromi that shoivn in 5EQ ID 
NG:1, SEQ ID NO : 2 , SEQ ID NO : 3 , SEQ ID NO : 9 , SSQ ID NO: 10, 
SEQ ID NO: 11, SEQ ID NO: 17, SEQ ID NO : 1 9 , SEQ ID NO : 2 0 , SEQ 

15 ID NO: 21, SEQ ID NO : 2 7 , SEQ ID NO: 96, SEQ ID NO: 93, SEQ ID 

NO:100, SEQ ID NO:102, SEQ ID NO:104, and SEQ ID NO:106. 
The encoded amino acid sequence thereof ivouid, hov/ever, be 
preserved . 

In addition, the nucleic acid sequence may comprise a 
20 nucleotide sequence v;hich results from the addition, 

deletion or substitution of at least one nucleotide to the 
5 ' -end and/or the 3 ' -end of the nucleic acid formula shov/n 
in SEQ ID N0:1, SEQ ID NO : 2 , SEQ ID NO : 3 , SEQ ID NO : 9 , SEQ 
ID NO: 10, SSQ ID NO : 1 1 , SEQ ID NO: 17, SEQ ID NO : 1 9 , SEQ ID 
25 N0:2C, SEQ ID NO:21, SEQ ID NO : 2 7 , SEQ ID NO : 9 6 , SEQ ID 

^NO:9:B, SEQ ID NO: 100, SEQ ID NO: 102, SEQ ID NO: 104, or SEQ 
ID NO: 106, or a derivative thereof. Any nucleotide or 
polynucleotide may be used in this regard, provided that its 
addition, deletion or substitution does not alter the am.ino 
30 acid sequence of SEQ ID NO : 5 , SSQ ID NO : 6 , SEQ ID NO : 7 , SEQ 

ID N0:13, SEQ ID NO:14, SEQ ID NO : 1 5 , SEQ ID NO : 1 S , SEQ ID 
NO:22, SSQ ID NO:23, SEQ^ ID NO : 2 4 , SEQ ID NO : 2 9 , SEQ ID 
NO: 9", SEQ ID NO : 9 9 , SEQ ID NO : 1 0 1 , SEQ ID NO : 1 0 3 , SEQ ID 

MS 
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NO: 105, or SEQ ID M0:i07, which is encoded 'cy che nucleocide 
sequence. For example, the present invention is xnzend^z tz 
include any nucleic acid sequence resulcing from che 
addition of ATG as an initiation codon at the 5 ' -end of che 
inventive nucleic acid sequence or its derivative, or from 
the addition of TTA, TAG or TGA as a tex-niinat ion codon at 
hhe 3 ' -end of the inventive nucleotide sequence or its 
derivative. Moreover, the nucleic acid molecule of the 
present invention may, as necessary, have restriction 
endonuclease recognition sites added to its 5 '-end and/or 
3 ' -end . 

Such functional alterations of a given nucleic acid 
seauence afford an opportunity to promote secretion and/or 
orocessinQ of heterologous proteins encoded by foi~eign 
nucleic acid seauences fused thereto. All variations or the 
nucleotide sequence of the kinase genes of the invention and 
fragments thereof permitted by the genetic code are, 
therefore, included in' this invention. 

Further,, it is possible to delete codons or to 
substitute one or more codons v/i-h codons other than 
degenerate codons to produce a structurally modified 
polypeptide, but one which has substantially the sam.e 
utility or activity as the polypeptide produced by the 
unmodified nucleic acid molecule. As recognized in the art , 
the two polypeptides are functionally equivalent, as are the 
two nucleic acid molecules that give rise to their 
production even though the differences between the nucleic 
acid molecules are not related to the degeneracy of tne 
gene t ic code . 

Mammalian STLPC2 

The full-length human 3TLK2 cDNA (SEQ ID NO : 1) is 3268 
bo long and consists of a 1243 bp open reading fram.e (ORF) 
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flanked by a 181 bp 5' uncranslaced reaion (UTR; 1-131) a^d 
a 1734 bp 3' UTR (1433-3216) chat is folloived by a 52 
nuclsctide polyadenylated region. A polyadenylacion signal 
(AATAAA) is found at positions (3193-3193) . The sequence 
5 flanking the first ATG conforms to the Kozak aonseri3U3 

(Kozak, M., Nucleic Acids Res. 15, 8125-314S (19S7)) for an 
initiating methionine, and is believed to be the 
translat lonal start site for STLK2 , Furthermore, human 
STLK2 and the related SOK-1 and MST3 proteins conserve the 
10 amino acid sequence imtmediately following this presumed 

initiating methionine. 

Several EST fragments span the comoiete STLK2 secuence 
with AA191313 at -he 5' end and W16504 at the 3' end. 

15 Mammialian STLK3 

The partial human STLK3 cDNA (SEQ ID NO : 2 ) is 3030 bp 
long and consists of a 154S bp OR? flanked by a "1476 bp 3' 
UTR (1550-3025) and a 5 nucleotide polyadenylated region. A 
potential polyadenylacion signal (AATAAA) begins at position 

20 3004. Since the coding region is open throughout the 5' 

extenc of this sequence, this is apparently a partial cDNA 
clone lacking the N-terminal start methionine. 

Multiple EST fragments span the complete STLK3 sequence 
with AA273967 at the 5' end and AA628477 and others at the 

25 3 ' end . . 

Mam.malian 3TLK4 

The partial human STLK4 cDNA (SEQ ID NO: 3) is 3S57 bp 
long and consists of a 1242 bp ORF flanked by a 2596 bp 3' 
30 UTR (1244-3339= and an IS nucleotide polyadenylated region. 

A potential polyadenyla t ion signal (AATAAA; is found at 
positions 218 1-3322. Since the coding region is ocen 
throughout the 5' extenc of this sequence, this is 

5c 
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apparently a partial cDNA clone lacking the M- terminal start 
methionine. A near full-length murine STLK4 cDMA is 
represented in the 1773 bp EST AA117438. It extends an 
additional 21 nucleotides 5' of the human STLK4 consensus, 
^ but since its coding region is open throughout the 5' extent 

of the sequence, this is also apparently a partial cDNA 
clone lacking the M~ terminal start methionine. 

Several EST fragments span the complete STLK2 seauence 
with AA297759 at the 5' end and AA100434 and ethers at the 
10 3' end. 

Mam.malian STLK5 

The full-length human STLK5 cDNA (SSQ ID NO: 96) is 2110 
bp long and consists of a 1119 bp ORF flanked by a 229 bp 5' 
UTR and a 762 bp 3' UTR . The sequence flanking the first ATG 
conforms to the Kozak consensus (supra) for an initiating 
methionine, and is believed to be the transla t ional start 
site for STLK5 . Several EST fragments span the complete " 
STLK5 sequence with AA297059. and F07734 at the. 5' end, and 
R46686 and F03423 and others at the 3' end. 

Mamm.alian SThKS 

The full-length human .STLK6 cDNA (SSQ ID MO: 93^' is 
2,001 bp long and consists. of a 1,254 bp ORF flanked by a 75 
bp 5' UTR and a 673 bp 3 ' - UTR . The sequence flanKing the 
first ATG conform.s to the Kozak consensus (supra) for an 
initiating methionine, and is believed to be the 
translat ional start site for STLK6 . 

Mammalian 5TLK7 

The partial human STLK7 cDNA (SEQ ID MO: 100) is 311 bp 
long and consists of a 309 bp ORF. Since the- coding region 
is open throughout both the 5' and 3' extent of this 

6i 
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sequence, chis is apparently a parcial cDMA clone lackmc 
the N- terminal start rriethionine and C-terrp.inal scoo ccdon . 

-Mammal i an ZCl 

The full-lengnh human ZCl cDNA (SEQ ID NO: 9) is 379S bp 
long and consists of a 3717 bp ORF (7-5723) flanked by a 5 
bp 5' UTR and a 75 bp (3724-3798) 3' UTR . No 

poiyadenylacion signal (AATAAA) or pclyaden\*la tec region are 
present in the 3 ' UTR . The sequence flanking the first ATG 
conforms to the Kozak consensus for an initiating 
methionine, and is believed to be the translat ional start 
site for human ZCl. 

Multiple EST fragments {WS1656) match the 3' end of the 
human ZCl gene, but at the time of filing, the inventors 
believe that none exist in GenBank or the EST database that 
match its 5' end. 

Mam^malian ZC2 

The partial human ZC2 cDNA (SEQ ID NO:- 10) is 4 055 bp 
long and consists of a 3S91 bp ORF (1-3391) and a 164 hv 
(3392-4055) 3' UTR. Since the coding region is- open 
throughout the 5' extent of this sequence, this is 
apparently a partial cDNA clone lacking the M- terminal start 
methionine. No polyadenylat ion signal (AATAJy\) or 
polyadenylated region are present in the 3 ' UTR . 

Multiple EST fragm.ents (R51245) m.atch the 3' end of the 
human ZC2 gene, but at the tim.e of filing, the inventors 
believe that none exist in GenBank or the EST database that 
match its 5' end. 

Mamm.alian ZC3 

The partial hum.an ZC3 cDNA (SSQ ID NO:ll- is 4133 bp 
long and consists of a 3973 bp ORF (1-3973; and a 152 bp 
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(3S79-4133) 3'UTR region. Since the coding region is open 

throughout che 5' extent of this sequence, chis is 
apparently a partial cDNA clone lacking the N- terminal scart 
methionine. No pel yadenylat ion signal (AATAAA) or 
polyadenylated region are present in che 3'UTR. 

Multiple EST fragments (R54563) macch the 3 ' end of the 
human ZC3 gene, but at. the time of filing, the inventors 
believe that none exist in GenEank or the EST database that 
miatch its 5' end. 



Mammalian ZC4 

The full-length human ZC4 cDNA (SEQ ID NO:104) is 3,68- 
bp long and v;as originally assembled from X chromosome 
genomdc DMA sequence. 

15 Multiple EST fragm.encs (R9S571) match the 3 ' end of the 

human ZC4 gene, but ac the tim^e of filing, the invencors 
believe that none exist in GenBank or the EST database that 
miatch ics 5' end. ZC4 gene is also contained within the 
human genomic clone Z33S50. 

20 . ... 
Mammalian KHS2 

The full-length human KKS2 cDNA (SEQ ID NO: 17) is 4023 
bp long and consiscs of a 2682 bp QRF (6-26S7) flanked by a 
5 bp (1-5) 5'UTR and a 13 3 6 bp (2688-4023) 3' UTR . A 

25 potential polyadenylat ion signal (AATAAA) is found at 

posicions 4008-4013. Mo polyadenylated region is present in 
the 3'UTR. The sequence flanking the first ATG conform.s to 
the Kozak consensus for an initiacing methionine, and is 
believed to be the ~ ranslat ional scare site for human KHS2 . 

30 Multiple EST fragm.encs match che 5 ' end (AA445022) as 

well as the 3' end (R37b25) of the human KH32 crene . 



b1) 
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M a mma 1 i a n S ULU 1 

The full-length human SULUl cDNA (SEQ ID NO: 19; is 4177 
bp long and consists of a 2694 bp ORF (415-3108^ flanked by 
a 414 bp (1-414) 5'UTR and a 1069 bp (3109-4177) 3' UTR 
5 followed by a 19 nucleotide polydenylated region. A 

potential polyadenylac ion signal (AATAAA) is found at 
positions 4164-4169. The sequence flanking the first: ATG 
conforms to the Kozak consensus for an iniciating 
methionine, and is believed to be the t ranslac lonal szarz 
10 site for human SULUl, 

Multiple EST fragments miatch the 5 ' end (N27153) as well 
as the 3' end (R90 90S) of the human SULUl gene. 

Mammalian (Murine) SULU3 

15 The partial murine SULU3 cDNA (SEQ ID NO: 21) is 2 24 9 bp 

long and consists of a 2244 bp ORF (6-2249) flanked by a 5 
bp (1-5) 5'UTR. The sequence flanking the first ATG 
conforms to the Kozak consensus for an inic lacing 
methionine, and is believed to be the t ranslat ional start 

20 site for murine SULU3 . The 3' end of the murine SULU3 cDNA 

shares 90% DNA sequence identity over 1620 nucleotides v/ith 
human SULU3 , suggesting zhaz these tv;o genes are functional 
orthologues . 

One EST fragment (AA.44 5022) matches the 3' end of the 
25 partial miurine 3ULU3 gene, but at the time of filing, the 

inventors believe that none exist in GenBank or the EST 
database that m.atch its 5' end. 

Mamma 1 i an ( Human ) SULU3 
30 The partial human SULU3 cDNA (SEQ ID NO:20) is 3324 bp 

long and consists of a 2358 bp OP.? (2 -2359) flanked by a 
1465 bp (2360-3824) 3 ' UTR followed by a 19 nucleotide 
polydenylated region. A potential polyadenylat ion signal 

6^ 
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(AATAAA; is found at: positions 2602-2507. Since che 
region is open throughout: the 5' er^tenn of this sequence, 
this is apparently a partial cDNA clone lacking che N- 
terminal stare methionine. The 5' end of the human SULU3 
cDNA shares 90% DNA sequence identity over 1620 nucleotides 
v/ich nrturine SULU3 , suggesting that these two genes are 
functional orthologues . 

Multiple EST fragments (R02283) match the 3 'end of the 
human SULU3 gene, but at the time of filing, the .inventors 
believe that none exist in GenEank or the EST database that 
match its 5' end. 



Mammal i an GEK2 

The full-length human GEK2 cDNA (SEQ ID NO: 106) is 2952 

15 bp long and consists of a 2737 bp ORE (59-2795) flanked by a 

58 bp (1-58) 5'UTR. The sequence flanking the first ATG 
conforms to the Kozak consensus for an initiating 
methionine, and is believed to be the translat ional start 
site for human GEK2 . 

20 Multiple EST fragments (AA465671) match the- 5 'end, but 

at the time cf filing, the inventors believe that only one 
(AA380492) mtatches the 3' end of the human GEK2 gene. 

Mammalian PAK4 

25 The full-length human PAK4 cDNA (SEQ ID NO: 27) is 3604 

bp long and consists of a 2043 bp ORE (143-2185) flanked by 
a 142 bp (1-142) 5'UTR and a 1419 3' UTR followed by a 22 
nucleotide poly:-ienylat ed region. A potential 

polyadenylat ion signal (AATTAAJ^.) is found at positions 3 582 - 
30 35S3. The sequence flanking the first ATG conform:s to the 

Kozak consensus for an initiating methionine, and is 
believed to be the trans lat ional start site for human ?AK4 . 



6f) 
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Multiple EST fragrr.ents {AA535751) luacch the 3 ' end of 

the huraan PAK4 gene, but at the cime of filing, the 
inven-ors believe thac none exist in GenBank or the EST 
database that match its 5' end. 



Mammalian PAK5 

The full-length human ?AK5 cDNA (SEQ ID MO: 102) 'is 2S0 
bp long and consists of a 17 73 bp ORE flanked by a 201 bp 5 
UTR and a S33 bp 3' UTR . The sequence flanking che firsc 
ATG conforms to the Kozak consensus (supra) for an 
initiating rr.echicnine , and is believed to be the 
trans lat ional start site for PAK5 . 

Multiple EST fragm.ents {AA4423b7) match the 3 ' end of 
the human PAK5 gene, but at zhe time of filing, the 
inventors believe that none exist in GenBank or the EST 
database that match its 5' end. 

I I . Nucleic Acid Probes, Methods, and Kits for Detection 
of STE2 0 -Related Kinases. 

A nucleic acid probe of the present invention may be 
used to probe an appropriate chromosom^al or cDNA library by 
usual hybridization methods to obtain other nucleic acid 
molecules of the present invention. A chromosomal DNA or 
cDNA library may be prepared from^ appropriate cells 
"according to recognized methods in the art (of. "Molecular 
Cloning: A Laboratory Manual", second edition, Cold Spring 
Harbor Laboratory, Sambrook, Eritsch, 8c Maniatis, eds . , 
1939) . 

In the alternative, chemical synthesis can be carried 
out in order to obtain nucleic acid probes ha.vi.ng nucleotid 
sequences which correspond to N-term.inal and C-te.rminal 
portions of the amino acid sequence of the polypeptide of 
interest. The synthesized nucleic acid probes may be used 

6(j> 
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as primers in a polymerase chain reaccion (PCR) carried out 
in accordance wich recognized PCR techniques, essentially 
according to PCR Protocols, "A Guide to Methods and 
Applications", Academic Press, Michael, at al . , eds . , 1990, 
utilizing the appropriate chromosomal or cDNA library to 
obtain the fragment: of the present invention. 

one skilled in the art can readily design such probes 
based on the sequence disclosed herein using methods of 
computer alignm.ent and ' sequence analysis known m the art 
f'Mclecuiar Cloning: A Laboratory Manual " , 19S9, supra). 
The hybridization probes' of the present invention can be 
labeled by scandard labeling techniques such as with a 
radiolabel, enzyme label,- fluoresce..- .ao-i, 
label, chemiluminescence, and the like. Afcer 
hybridization, the probes may be visualized using known 
methods . 

^^A^ r--r^HoQ or the present invention 
The nucleic acio prooes u.ic ^^j-w^ 

1 ■n^-2\ r^r-^b*=i<=; such orobes being 
include RNA, as well as UNA prooes, ^u^-.. 

generated using techniques Known m un-= a. o . --^ -u 
acid orobe m.av be immobilized on a solid support. Examples 
■ =ucr solid suoporr_s include, -but are not lim.ited to,^ 

plastics such as polycarbonate, complex carbohydrates such 
as a-aarcse and sepharose, and acrylic resins, such as 
polvicrvlamide and latex beads. Techniques for coupling 
nucleic acid probes to such solid supports are well known m 
the art. 

The test samoles suitable for nucleic acid probing 
methods of the present invention include, for example, cells 
or nucleic acid extracts of cells, or biological fluids. 
The samples used m che above -described mechods will vary 

f^™-,- rh^ d'-t^ccion mechod and the 
basea on tne assay torma^-, tlcl^ u^u-. 

-^c,s..^s ^-i--s cr extracts to be assayed, 
nature or tne '_--ssu ^ 

^v^i-accs of cells are 

Methods for preparing nucj-ex_ 

6n 
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v/ell kncv;n in the art and can be readily adap-ed in order to 
obtain a sarr^ple v/hich is compatible with the "^echcd 
ut il i zed . 

One method of detecting the presence of nucleic acids 
5 of the invention in a sample comprises (a) contacting said 

sarrtple with the above-described nucleic acid probe under 
-conditions such that hybridization occurs, and (b) detecting 
the . presence of said probe bound to said nucleic acid 
molecule. One skilled in the art v/ould select the nucleic 
10 acid probe according co techniques knov/n in the arc as 

described above. Samples to be tested include but should 
not be lirr.ited to RNA samples of human tissue. 

A kit- for deuecting the presence of nucleic acids of 
the invention m a sample comprises at least one container 
15 means having disposed therein the above - described nucleic 

acid probe. The kit m.ay further comprise other containers 
comprising one or more of the following: wash reagents and 
reagenus capable of detecting the presence of bound nucleic 
acid probe. Examples of detection reagents include, but are 
20 net limited to radiolabelled probes, enzymatic labeled 

probes (horseradish peroxidase, alkaline phosphatase) , and 
affinity labeled probes (biotm, avidin, or sceptavidin) . 

In detail, a comparcmental ized kit includes any kit in 
which reagents are contained m separate containers. Such 
25 containers include small glass containers, plastic 

concainers or strips of plastic or paper. Such containers 
allow the efficient transfer of reagents from one 
com.partment to another compartmene such that the samples and 
i-eaoents are not cross-contaminated and tne agents or 
30 solutions of each concainer can be added in a quancitacive 

fashion from one comparum.ent to another. Such containers 
will include a container which will accept the test sample, 
a container which contains the probe or primers used in the 
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assay, containers 



which contain wash reagents isucn a= 



phosphate buffered saline, Tris-buf f ers , ana tne ^iKe=, anc 
containers which contain the reagents used to detect the 
hybridized probe, bound antibody, amplified product, or the 
like. One skilled in the art will readily recognize that 
the nucleic acid probes described in the present invention 
can readily be incorporated into one of the established kit 
.formats which are well known in the art. 

TTi . mTA Cor.st^-nnt.q Comr^^i.qina a STT^20 -Re lated Nucleic 

ar-^ri Molec'jTe and C^T! s Con t^-^nina These Constructs. 
The present invention also relates to a recombinant DNA 
molecule comprising, 5' t.o 3 ' , a promoter effective to 
initiate transcription in a host cell and the- above- 
described nucleic acid molecules. In addition, the present 
invention relates to a recomJDinant DNA molecule comprising a 



vector a 



and an above -described nucleic acid m.olecule . The 



■present invention also relates to a nucleic acid molecule 
comprising a transcriptional region functional in a cell, a 
20 sequence complem.entary to an RNA sequence encoding an amino 

acid sequence corresponding to the above-described 
polypeptide, and a transcriptional termination region 
functional in said cell. The above-described molecules m.ay 
be isolated and,/or purified DNA molecules. 
25 The present invention also relates to a cell or 

organism that contains an above-described nucleic acid 
molecule and thereby is capable of expressing a polypeptide 
The polypeptide may be purified from cells which have been 
altered to express the polypeptide. A cell is said to be 
30 "altered to express a desired polypeptide" wnen the cell, 

through genetic manipulation, is made to prcauce a protein 
which it normally does net produce or which tne ceil 
normally produces at lower levels. One skilled m the art 
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can readily adapc procedures for introducing and expressing 
either genomic, cDNA, or synthetic sequences into eicher 
eukaryocic or prokaryocic cells. 

A nucleic acid molecule, such as DMA, is said cc be 
5 "capable of expressing" a polypeptide if it ccnrains 

nucleotide sequences v/hich concain transcriptional and 
translational regulatory information and such sequences are 
"operablv linked" to nucleocide sequences which encode the 
i^olvpeouide . A.n operable linkage is a linkage in which the 
10 regulacory DNA sequences and the DNA sequence soughc to be 

expressed are connected in such a way as co permit gene 
sequence expression. The precise nacure of t:he regulatory 
reaions needed for gene sequence expression may vary from, 
organism to organism, but shall in general include a 
15 promoter region v;hich, in prokaryotes, contains both the 

promoter (which directs the initiation of RNA transcription) 
as v;eli as the DNA sequences which, when transcribed into 
RNA, w:l11 signal synthesis iniciation. Such regions will 
normally include those 5 ' -non-coding sequences involved with 
20 iniciation of transcription and translation, such as the 

TATA box,' capping sequence, CAAT sequence, and the like. 

If desired, the non-coding region 3' no the sequence 
encoding a kinase of the invention m.ay be obtained by the 
above-described mierhods . This region miay be retained for 
25 its transcriptional termination regulatory sequences, such 

as termination and polyadenylat ion . Thus, by retaining the 
3 '-region naturally contiguous to the DNA sequence encoding 
a kinase of the invention, the transcriptional termination 
signals may be provided. Where the transcriptional 
30 termination signals are not satisfactorily functional m the 

exoression host cell, then a 3' region functional in the 
host cell m.ay be substituted. 

uo 
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Twc DNA sequences (such as a prorr.ocer region sequence 

and a seauence encoding a kinase of the inven-ion) are said 
to be operably linked if the nature of che linkage becv/een 
the two DNA sequences does noc (1) result in the 

5 introduction of a frame- shir t mucation, (2) interfere with 

Che ability of che promoter region sequence co direct the 
transcription of a gene sequence encoding a kinase of the 
invention, or (3) interfere with the ability of che gene 
sequence of a kinase of the invention co be transcribed by 

10 the promoter region sequence. Thus, a promoter region would 

be operably linked co a DNA sequence if che promoter were 
caoable of effeccing transcription of that DNA sequence. 
Thus, CO .express a gene encoding a kinase of che invention, 



transcriptional and translat icnal signals recognized by 



an 



15 appropriace hose are necessary. 

The oresent invention encom.passes the expression of a 
gene encoding a kinase of the invencion (or a functional 
derivacive thereof) in eicher prokaryotic or eukaryotic 
cells. Prokaryocic hosts are, generally, very efficient and 

20 convenient for the production of recombinanc proteins and 

are, therefore, one type of preferred expression system for 
kinases of the invencion, Prokaryotes m.osc frequently are 
represenced by various strains of E. coll. However, other 
microbial strains m.ay also be used, including ocher 

25 bacterial strains. 

In prokaryotic syscems , plasmdd veccors that contain 
replication sites and control sequences derived from a 
species compatible with the host may be used. Examples of 
suitable plasmid vectors may include pBR322, pUCllS, pUC119 

30 and the like; suitable phage or bacteriophage veccors may 

include ygtlO, ygtll and the like; and suitable virus vectors 
may include pMZ^.M-nec, pKRC and the like. Preferably, the 

U>1 
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selected vector of zhe present invencion has the capacity 
replicate in the selecced hose cell. 

Recognized prokaryotic hosts include bacteria such as 
E, coll, Bacillus, Streptomyces , Pseudowonas , Salmonella, 

5 Serx-atia, and the like. However, under such conditions, the 

oolypepcide ivill ncc be glycosylated. The prokaryotic host 
musi: be compatible with the replicon and control sequences 
in the expression plasmid. 

To express a kinase of the invencion (or a functional 

10 derivative chereof} in a prokaryotic cell, it is necessary 

to ooerably link the sequence encoding the kinase of the 
invention to a functional prokaryotic promoter. Such 
promoters-may be either constitutive or, more preferably, 
regulacable {i.e., inducible or derepressible) . Examples of 

15 constitutive promoters include the int promiocer of 



bacteriophage a, the bla promoter of the (3- lactamase gene 
sequence of pBP.322, and the cat promoter of the 
chloram.ohenicol acetyl transferase gene sequence of pPR32 5, 
and the like. Examples of inducible prokaryotic promoters 

20 include the m.ajor right and left promoters of bacteriophage 

X (P. and Pp.) , the trp, recA, AacZ, AacI , and grai promoters 
of E. coli, the a-amylase (Ulmanen et al., J. Bacterid. 
162:176-182, 19S5) and the c- 2 8 - spec i f ic promoters of S. 
sujbtilis (Oilman et al . , Gene Sequence 32:11-20, 1984), the 

25 promoters of the bacteriophages of Bacillus (Gryczan, In: 

The Molecular Biology of the Bacilli, Academic Press, Inc., 
NY, 19S2), and Strepvomy^ces promoters (Ward et al . , Mol . 
Gen. Genet. 203:458-478, 1986). Prokaryotic promoters are 
reviev/ed bv Glick (Ind. Microbiot . 1:277-282, 1987), 

30 Cenatiempo (Biochimie 68:505-516, 1985), and Gottesm.an (Ann, 

Rev. Genet.. 13:415-442, 1984). 

Propel" exoression in a prokaryotic cell also requires 
the presence of a ribosomie -binding site upstream, of the gene 
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seauence-encoding sequence . Such ribosome -binding sites are 
disclosed, for exaT.ple, by Gold et al . (Ann. Rev. Microbiol. 
25:365-404, 1981). The selection of control sequences, 
expression vectors, cransf orrrtat ion methods, and the like, 
5 are deoendent on the type of host cell used to express the 

gene. As used herein, "cell", "cell line", and "cell 
culture" may be used interchangeably and all such 
designations include progeny. Thus, che words 
"cransforman-s" or "transformed cells" include the prim.ary 
10 subject cell and cultures derived therefrom, without regard 

to the nurrjoer of transfers. It is also understood that all 
progeny may not be precisely identical in DK'A content, due 
to deliberate or inadvertent m.utations. However, as 
defined, m.utant progeny have the same functionality as that 
15 of the originally transformed cell. 

Host cells which may be used in the expression systems 
of the oresent invention are not strictly, lim.ited, provided 
that they are suitable for use m the expression of the 
kinase polypeptide of interest. Suitable hosts may often 
20 include eukaryotic cells. Preferred eukaryctic hosts 

include, for example, yeast, fungi, insect cells, mammalian 
cells either In vi\'^o, or in tissue culture. Mammalian cells 
which may be useful as hosts include HeLa cells, cells of 
fibroblast origin such as VERO or CHO-Kl, or cells of 
25 lymphoid origin and their derivatives. Preferred mamimalian 

host cells include SP2/0 and J558L,- as well as neuroblastoma 
cell lines such as IMR 332, which may provide better 
capacities for correct pos t - t ranslat lonal processing. 

In addition, plant cells are also available as hosts, 
30 and control sequences compatible with plant cells are 

available, such as the cauliflower m.osaic virus 353 and 19£, 
and nopal ine synthase promoter and pol yadeny lat icn signal 
sequences. Another preferred host is an insect cei_, tor 

6)^ 
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example che Drosophila larvae. Using insecc cells as hoszs, 

che Drosophila alcchci dehydrogenase promoter can be used 
(Rubin, Science 2 4 0:1453-1459, 193S). Alternatively, 
baculovirus veccors can be engineered cc express large 
amounts of kinases of che invention in insect cells (Jasny, 
Science 233:1653, 1937; Miller et ai . , In: Genetic 
Engineering, Vcl . 3, Plenum, Setlow et al . , eds , , pp. 277- 
297, 1986) . 

Any of a series of yeast: expression systems can be 
utilized v^hich incorporate promoter and Termination elem.ents 
from, the actively expressed sequences coding for glycolycic 
enzymes thac are produced in large quanticies v;hen yeasc are 
grown in medium.s rich in glucose. Knov/n glycolytic gene 
sequences can also provide very efficient transcriptional 
control signals. Yeasc provides subscantial advancages in 
that it can also carry out post - t ranslat ionai m.odif icat ions . 

A numJoer of recombinant: DNA strategies exist utilizing 
strong promoter sequences and high copy numb-er plasmids 
which, can be utilized for production of the desired proneins 
in yeasc . Yeast recognizes leader sequences on cloned 
mamim.alian genes and secretes peptides bearing leader 
sequences (i.e., pre -peptides ) . Several possible vector 
systems are available for the expression of kinases of the 
invention in a miamimialian host. 

A wide variety of transcriptional and zranslar ional 
regulatory sequences m.ay be employed, depending upon the 
nature of the host . The cranscript ional and translazional 
regulatory signals may be derived fromi viral sources, such 
as adenovirus , bovine papillomia virus, cycomegalovirus , 
simian virus, or the like, where the regulacory signals are 
associated with a particular gene sBOUBnc^ which has a high 
level of expression. Alternatively, promoters from: 
mammalian expression products, such as ace in, collagen, 
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iTiycsin, and che like, may be employed. Transcriptional 
initiation regulatory signals may be selected which allov.- 
for repression or activation, so that expression of the gene 
sequences can be modulated. Of interest are regulatory 
5 signals v;hich are temperature-sensitive so that by varying 

the temperature, expression can be repressed or initiated, 
or are subject to chemiical (such as metabolite; regulation. 

Expression of kinases of the invention in eukaryotic 
hosts requires the use of eukaryotic regulatory regions. 

10 Such regions will, in general, include a promoter region 

sufficient to direct. the initiation of RNA synthesis. 
Preferred eukaryot ic . promiOters include, for example, the 
tJromoter of the mouse metal lothionein I gene sequence (Hamer 
ec al . , J: Mol . Appl . Gen. 1:273 -253, 1982); the TK promioter 

15 of Herpes virus (McKnight, Cell 31:355-365, 1S32); the 3V40 

early , promoter (Benoist et al . , Nature (London) 290:304-31, 
19S1); and the yeast gal4 gene sequence promoter (Johnston 
et ai . , Proc. Natl. Acad. Sci . (USA) 79:6971-6975, 1982;- 
Silver et al . , Proc. Natl. Acad. Sci. (USA) 31:5951-5955, 

20 1984) . 

Translation of eukaryotic mRNA. is initiated at the j 
codon which encodes the first methionine. For this reason, 
it is preferable to ensure that the linkage between a 
eukaryotic promoter and a DNA sequence which encodes a 
25 kinase of the invention (or a functional- derivative thereof) 

does not contain any intervening ccdons v/hich are capable of 
encoding a methionine (i.e., AUG) . The presence of such 
codons results either in the formation of a fusion protein 
(if the AUG codon is in the same reading frame as the kinase 
30 of the invention coding sequence) or a fram^e- shift mutation 

(if the AUG codon is not in' the same reading fram^e as the 
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A nucleic acid molecule encoding a kinase of zhe 
invention and an cperably linked promoter may be incroduced 
into a recipient prokaryotic or eukaryc-iic cell either as a 
nonreplicating DNA or RNA molecule, which may either be a 
5 linear molecule or, more preferably, a closed covalent 

circular molecule. Since such molecules are incapable of 
autonomous replicacion, the expression of the gene may occur 
through the transient expression of the introduced seauence. 
A.lternatively , permanent expression may occur through the 
10 integration of the introduced DMA sequence into the host 

chromosome . 

A vector miay be employed which is capable of 
integrating the desired gene sequences into the host cell 
chromosome. Cells which have stably integrated the 
introduced DNA into their chromosomes can be selected by 
aj.so introducing one or m.ore markers which allow for 
selection of host cells which contain the expression vector. 

The m:arker may provide for prototrophy to an auxotrophic 
host, biocide resistance, e.g.,. antibiotics, or heavy 
m.etals, such as copper, or the like. The selectable marker 
gene sequence can either be directly linked to the DNA gene 
sequences to be expressed, or introduced into the sam.e cell 
by CO- transf ect ion . Additional elements may also be needed 
for optimal synthesis of mRNA. . These elements m.ay include 
25 splice signals, as well as transcription promoters, 

enhancers, and term.ination signals. cDNA expression vectors 
incorporating such elemients include those described by 
Okayamia (Mol . Cell. Biol. 3:280-, 1985). 

The introduced nucleic acid molecule can be 
30 incorporated ' into a plasmid or viral vector capable of 

autonomous replication in the recipient host. Any of a wide 
variety of vectors may be employed for this purpose. 
Factors of importance in selecting a particular plasm.id or 
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viral vecLor include: the ease with which recipient cells 

that contain the veccor may be recognized and selected frcrr. 
chose recipienc cells v;hich do noc contain the vector; the 
number of copies of the vector which are desired in a 
particular host; and whether iz is desirable to be able to 
"shuttle" the vector betv;een host cells of different 
species . 

Preferred prokaryocic vectors include plasmids such as 
those capable of replication in E. coli (such as, for 
example, pBR322, ColEl, pSClOl, pACYC 1S4 , nVX ; "Molecular 
Cloning: A Laboratory Manual", 1969, supra) . Bacillus 
plasmids include pC194, pC221, pT127, and che like 
(Gryczan . In : The Molecular Biology of the Bacilli, Academic 
Press, NY, pp. 307-329, 1982). Suitable Screptcruyoes 
plasmids include plJlOl (Kendall et al . , J. Bacterid . . 
169:4177-4183, 1987), and s t reptomryces bacceriophages such 
as (j)C31 (Chauer en ai . , In: Sixch International Symposium on 
Act inom.yce tales Biology, A.kademiiai Kaido, Budapest, Hungary, 
pp. 45-54, 1986). Pseudomonas plasmids are reviewed by John 
et al. (Rev. Infect. Dis . 8:693-704, 1985), and Izaki (Jpn'. 
J, Bacterid. 33:729-742, 1973). 

Preferred eukaryotic plasmids include, for example, 
B?V, vaccinia, SV40, 2-mLicron circle, and che like; or their 
derivatives. Such plasmids are well known in the- art 
(Bctstein et al . , Miami Wntr, Symp . 19:265-274, 1982; 
Broach, In: "The Molecular Biclogy of the Yeast 
Saccharom.yces : Life Cycle and Inheritance", Cold Spring 
Harbor Laboratory, Cold Spring Harbor, NY, p. 445-470, 1981; 
Broach, Cell 28:203-204, 1982; Bcllon et ai . , J. Clin. 
Hem^atd . Oncol. 10:39-48, 1930; Maniacis, In: Cell Biology: 
A Comprehensive Treatise, Vol. 3, Gene Sequence Expression, 
Academic Press, MY, pp. 563-605, 1930). 

G1 



wo 99/53036 PCT/US99/08150 

Once Che vector or nucleic acid molecule containing the 
construcc(s) has been prepared for expression, the DNA 
construct (s) may be introduced into an appropriate hose cell 
by any of a variety of suitable means, i.e., t ransf orrPiac ion , 
5 transf eccion, conjugation, protoplast fusion, 

eleccroporat ion , particle gun technology, calcium phosohace- 
precipitat ion, direct microinjeccion, and the like. Afcer 
the introduccion of the vector, recipient cells are grown in 
a selective medium, v.'hich seleccs for the grov/th of vector- 

10 containing cells. Expression of the cloned geneCs) resulcs 

in the production of a kinase of the invention, or fragments 
thereof. This can take place in the transformed cells as 
such, or following the induction of chese ceils to 
differentiate (for example, by administration of 

15 bromodeoxyuracil to neuroblastoma cells or the like) . A 

variety of incubation conditions can be used to form the 
peptide of nhe present invention. The most preferred 
conditions are those v/hich mim.ic physiological conditions. 

20 rv. The Proceins of the Invention 

A variety of methodologies Known in the art can be 
utilised to obtain the polypeptides of the present 
invention. The polypepnides may be purified from^ tissues ox- 
cells that naturally produce the polypeptides. 

25 Alternatively, the above -described isolated nucleic acid 

fragments could be used to express the kinases of the 
invention in any organism. The samples of the present 
invention include cells, protein extracts or merrJorane 
extracts of cells, or biological fluids. The samples will 

30 vary based on the assay format, the detection m.ethcd, and 

the nature of the tissues, cells or extracts used as the 
sample . 
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Any eukaryocic organism can be used as a source for the 

polypeptides of zhe invention, as long as the source 
organism naturally contains such polypepcides . As used 
herein, "source organism" refers zo the original organism 
from which the amino acid sequence of the subunic is 
derived, regardless of the organismi the subunit is expressed 
in and ultimately isolated from;. 

One skilled in the arn can readily follov; knov/n methods 
for isolating proteins in order to obcain zhe pclypepcides 
free of natural concamiinant s . These include, but are not 
limited to: size-exclusion chromatography, HPLC, ion- 
exchange chromatography, and im.muno- affinity chrom.atography . 

Mammialian STLK2 

Analysis of the deduced amino acid sequence predicts 
STLK2 to be an intracellular serine/ threonine kinase, 
lacking both a signal sequence and transm^emibrane domain.* 
STLK2 contains a 21 amino acid N-term.inal domain, a 253 
amino acid catalytic domiain with all the motifs 
characteristic of a serine/ threonine kinase, follov/ed by a 
142 amino acid C-term^inal domain. 

STLK2 is m.ost closely related to hum.an STE2 0 - subf ami ly 
kinases, MST3 (G3 : AF024636 } and SOK-1 (GB:X99325) and a C. 
elegans kinase yk34bll :5 (GB:U53153) sharing 72 . 7% , . 6S . 7% , 
and 65.3% am.inc acid identity, respectively. 

The 21 amino acid N- terminal domain of humian STLK2 is 
71.4% identical to the M-terminus of MST3 (G5 :AF024.636) . 
Human STLK2 lacks a glycine residue at position 2, and is 
therefore unlikely to undergo myristylat ion . A Smith- 
Watermtan search of the nonredundant protein database does 
not reveal any significant homiologies that miight suggest a 
potential function for this domain. 
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The 253 amino acid catalytic domain of human STLK2 is 
most: related to human SOK-1 (X99325), MST3 (GE : AF024 63 6 ; , C. 
elegans yk32bll . 5 {GB:TJ53153), and STLK3 {SEQ ID NO : 5 ) 
sharing 38.9%, 8 7.4%, 7S.3%, and 4 9% amino identity 
respectively, placing ic in the STLK- subf ami ly of STE20- 
related kinases. The STLK2 kinase domain displayed lesser 
hom.ology to other STE2 0 - related kinases including; 55,9% to 
human MST2 (G3:U26424), 49.2% to human GCK (GB:U07349>, 
49.2% CO human KKSl (GB:U77129), and 44.2% to human HPKl 
(GB:U66464) . The activation loop of human 3TLF:2 cacalycic 
domain is identical to chac of human SOK-1 and MST3 
including zhe presence of four potential threonine 
phosphorylacion sites that could serve an aucoregulacorv 
role on kinase activity. 

The 142 am.inc acid C- terminal dom.ain of human STLK2 is 
most related to hum.an SOK-1 (X99325), MST3 (GB : AF02463c ) , 
and C. elegans yk32bll.5 {GB:U53153), sharing 39.9%, 39.9%, 
and 33.3% amino acid identity, respect ivelv . This C- 
terminal dom.am shares some significant amino acid 
similarity to the C-terminal domiains of the related human 
STLK3 (SEQ. ID NO : 6 ) and STLK4 (SEQ ID NO : 7 ) . 

The C-terminus of the related human SOK-1 (GB:X99325) 
kinase has been shown co be inhibitory to the catalvcic 
activity of this kinase (Pcmbo, C.M., Bcnventre, J.V., 
Holnar, A., Kyriakis, J. and Force, T. EMBO J. 15, 4537-4546 
(1996)). Based on the sequence identity between the C- 
termmi of hum:an SOK-1 (GB:X99325) and human STLK2 (39.2%), 
rhe C-rerminus of human STLK2 may also function as an 
mhibiuory domain for its kinase. 

M a mmia 1 1 a n S TL K 3 

The 3030 bp hum.an STLK3 nucleotide sequence of che 
partial cDNA clone encodes a polypepcide of 516 amino acids 

to 
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(SEQ ID NO: 6) with a predicted molecular mass of 56,784 
dalcons . Analysis of. che deduced ainino acid sequence 
predicts 3TLK3 co be an intracellular serine/ threonine 
kinase, lacking both a signal sequence and transmembrane 
domain, however the cDNA clone lacks an initialing ATG , so 
the full extenc of it N-cermius is not known. STLK3 
contains a 3 1 amino acid M- terminal domain, a 277 amino acid 
catalytic dom.ain with all the m.otifs charac uei'ist ic of a 
serine/ threonine kinase, follovved by a ISl am.ino acid C- 
terminal domiain containing a 25 amtino acid insert and a 27 
amino acid tail relative to the sequence of human STLK2 . 

STLK3 is m.ost closely related to human STE2 0 - subfamily 
kinases, STLK4 (SEQ ID. NO : 7 ) , MST3 (GB : AFG24 6 3 6 ) , SOK- 1 

(GB:X99325) and STLK2 (SEQ ID NO : 5 ) sharing 71.1%, 37.6%, 
33.1%,, and 38.4% amino acid identity respectively. 

The 31 amino acid N- terminal domain of hum:an STLK3 
lacked any significant: amiino acid sequence hom-ologies using 
a Smith-Waterm.an search of the nonredundant protein 
database, other than sequence similarity to prol ine -alanine 
repeats . 

The 277 am.ino acid catalytic domiain of humian STLK3 is 
most related to human STLK4 (SEQ ID NO : 7 ) , SOK-1 
(G3:X99325) , MST3 ( GB : AFC2 4 6 3 5 ) , and STLK2 (SEQ ID NO : 5 ) 
sharing 88.2%, 49.2%, 49%, and 49% am:ino acid identity, 
respectively. It also shares strong homiology to other STKs'- 
from lower organisms including 51.7% to A. thali^na (GB: 
AC002343) , 43.1% to A. thaliana (GB: Z97336), 42.1% to A. 
chaliana (GB: U96613), and 43.3% to C. elegrans (GB: U53153). 

The activation loop of the hum.an STLK3 catalytic domain 
conserves three potential threonine pho3ohcr\'la t ion sites 
with other members of the STLK-subf am.ily of STE2 0 - related 
kinases (humian STE2 0, .MST3 , STLK2 , STLK4) that could serve 
an autoregulatory role on kinase activity. 
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The ISl a-inc acid C-cerminai domain of human. STLK3 
shares 55.5% amino acid identity to human STLK4 (SEQ ID 
NC:7), and is 100% identical to a partial human cDNA DCHT 
(GB:AFG17635) . The C-uerminal domain of human STLK3 
contains a 26 amino acid inserc relative to hum.an STE20. A 
similar (87.5% amino acid identity) 26 amine acid mser- is 
also present in hum*an STLK4 . 

The 2 7 amino acid C- terminal tail of human STLK3 shares 
77.8% amino acid identity to human STLK4 , bun is absent from 
other STLK-family members. This high degree of homology 
between che C-cail of two STLK-family m.embers sugcfescs thev 
m.ay be involved in an as yet unidentified procein-orocein 
interact ion . 

The weak seausnce homology between the C- termini of 
15 human STLK3 and STE20, suggests iz may also function as an 

inhibitory domain for its kinase. 

Mam.malian STLK4 

. The 3 857 bp hum^an STLK4 nucleotide sequence of the 
partial cDNA clone encodes a polypeptide of 4 14 amino acids 
(SEQ ID- NO: 7) with a predicted molecular mass of 45,451 
daltons. Analysis of rhe deduced am.ino acid sequence 
predicns STLK4 co be an intracellular serine/ chreonine 
kinase, lacking both a signal sequence and transmembrane 
25 domain, however the cDNA clone lacks e.n mitiacing ATG, so 

the full extenc of it M-terminus is noc knov.^n . The partial 
STLK4 protein sequence contains a 178 ammo acid catalytic 
dom.ain corresponding to the C-termiinal motifs VI -XI of a 
serine/threonine kinase, follov;ed by a 236 amino acid C- 
term.inal domain containing two inserts of 25 and 41 amino 
acids each, relative to the sequence of hum^an 3TLK2 . 

3TLK4 is most closely related to human £TE2 0 - subf ami 1 v 
kinases, STLK3 (SEQ ID. NO o), MST3 (GB : AF02 4 £ 3 6 ^ , STLK2 
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(SEQ ID NO:5), and SOK-1 (GB:X99325) sharing 71.0%, 46.8%, 
43.9%, and 37.7% amine acid identiry, respectively. 

The 17S amino acid cacalytic domain of human STLK4 is 
most related to human STLK3 (SEQ ID NO. 7), SCK-1 
5 , (GB;X95325), MST3 (GB : AFC2 4 6 3 6 ) , STLK2 (SEQ ID NO : 5 ) , and 
MSTl (GB:U18297), sharing 88.2%/ 54.2%, 54.0%, 53.7 and 
45.7% amino acid identity, respectively. It also shares 
strong homology to other STKs from lower organisms including 
56.9% to A. thaliana (GB: AC002343)., 52.5% to C. elegans' 

10 (GB: U53153) , 46.2% to A, thaliana (GE: Z97336) and 45.7% to 

A, thaliana (GB: U96613) . The activation loop of the human 
STLK4 catalytic domain conserves zhree potential threonine 
phosphorylation sites v/ith other members of the STLK- 
subfamily of STE2 0 -related kinases (human STE20, MST3 , STLK2 

15 and STLKS ) that could serve an autoregulatcry role on kinase 

activity. 

The 236 amino acid C-terminal domain of hum.an STLK4 
shares 58.1% ammo acid identity to boch hum.an STLK3 (SEQ ID 
NO:6) and to a partial human cDNA, DCHT (GB : AF017635 ) . The 
20 • C-termdnal domain of. human STLK4 ccncains a 25 amino acid - 
insert relative to human SOK-1 and shares 87.5% amino acid 
identity to an insert present in human STLK3 . 

The weak sequence hcmiology between the C- termini of 
human STLK4 and STE2 0, suggests it m.ay also function as an 
25 inhibitory domiam for irs. kinase. - • 

Mammtalian STLK5 

The full-length 2110 bp hum:an STLK5 cDNA encodes a 
polypeptide of 373 amino acids (SEQ ID NO: 97) wich a 
30 predicted molecular m-ass of 41,700 daltons. Analysis of the 

deduced amino acid sequence predicts STLK5 to be an 
intracellular STE20-subf amily kinase, lacking both a signal 
sequence and cransmiemjDrane domain. 5TLK5 contains a 10 

13 
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aTTiino acid N-cerminal domain, a 311 amino acid cacaiyric 
domain v;i~h all the mccifs characteristic of a 
serine/rhreonine kinase, and a 52 amino acid C-cerminal 
domain . 

STLK5 is most closely related to the human STE2 0- 
subfamily kinases STLK5 (SEQ ID No. 99) and S?AK (AF099939) , 
sharing 51% and 33% amine acid idencity, respectively, over 
ics full extent. It also shares significant homology zo 
database entries from A.rabldopsis chaliana (GB : AC002343 ; and 
C.Blegans (GB : AL023843 , GB:AL023S43) . 

The 10 amino acid N-cerminal domain of human STLK5 does 
not reveal any significant homologies to the protein 
database , 

The 311 am.ino acid catalytic domain of humian STLK5 
shares 51% and 34 % identicy to STLK6 and SPAK, 
respectively. The cataly-ic domain of STLK5 contains a 45 
am.ino acid insert between kinase subdom.ains X and XI 
relacive to human 3TE20 . Multiple human EST f ragm.ents as 
v/ell as a murine EST (GB : AA575647 } contain this insert 
providing evidence that this region is an integral part of 



The 52 amino acid G-cerminal taal of human STLK5 shares 
41.3% am.ino acid idencity co human 50K-1 (GB:X99325). The 
weak sequence hom-ology between the C- termini of human STLK5 
and STE20, suggests it may also function as an inhibicory 
domiain for its kinase . 

Mam.malian STLK6 

The 2,001 bp human 3TLK6 nucleotide sequence of the 
com.plete cDNA encodes a polypeptide of 4 IS amino acids (SEQ 
ID NO: 99) with a predicted m.olecular mass of 47,025 daltons. 

Analysis of the deduced amino acid sequence predicts STLK5 
to be an intracellular STE2 0 - subf amii ly kinase, lacking both 
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a signal sequence and t ransmerrJorane domain. STLKc contains 

a 57 amino acid M-cerminal domain, a 312 amino acid 
catalytic domain with all the mocifs characteristic of a 
serine/ threonine kinase, follov/ed by a 49 amino acid C- 
5 terminal domain. 

STLKc is most closely related zo human STE2 0 - subf ami 1 ;/ 
kinases STLK5 (SEQ ID NO:97), STLK7 (SSQ ID NO:101), and 
SPAK (AP0999o9) , sharing 50%, 35%, and 30% amino acid 
idencity over ics full extent. It also shares significanc 
10 homology to database entries from; Arabidopsis chaliana 

(GB : AC002343 ) and C . elegans (G3:U53153). 

The 57 am.mo acid N- terminal domain of hum.an STLK6 does 
not reveal any significant homologies in zhe protein 
database. 

15 The 312 amdno acid catalytic domain of human STLK6 

shares 51 and 3 0 % identity to human STLK5 and SPAK, 
respectively. 

The 49 'amino acid C-terminal tail of human STLK6 shares 
low amino acid sequence identity (29%) with STLK5 and SPAK. 

20 : - . 

Mamim.alian STLK7 

The 311 bp humtan STLK7 nucleotide sequence of the 
partial cDMA encodes a polypeptide of 103 amino acids (SEQ 
ID NO: 101) . Analysis of the deduced am.ino acid sequence 

25. predicts STLK7 co be an mrernal fragment of an 

intracellular STE20-family kinase. This sequence lacks che 
Ni- and C-terminal portions of STLK7 and contains only the M- 
term.inal 103 am^ino acids of the predicted catalytic domain, 

30 Human STLK7 is mos" closely related to human 3TE2C- 

subfamily kinases SPAK {AFC99989); STLK5 (SSQ ID iNlO:97), and 
STLK6 (SEQ ID NO : 9 9 ) , sharing 36%, 33%, and 35% amino acid 
identity v;ichin this region of the kinase domain. It also 

13 
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shares significant hcmolcgy to database entries from 
Arabidopsis thaliana ;GB : AC002343 ) and Drosophila 
melanogasver (GB : AF006640 ) 

5 Mamrr^alian ZCl 

The 3 7 9S bp human ZCl nucleotide sequence encodes a 
polypeptide of 1239 amino acids (3EQ ID NO: 13) with a 
predicted molecular m.ass of 142,140 daltons. Analysis of 
che deduced amino acid sequence predicts ZCl zo be an 
10 intracellular serine/threonine kinase, lacking both a s":ignal 

sequence and transmembrane domain. The full-length ZCl 
procem contains a 22 amino acid N- terminus, a 267 ammo 
acid catalvcic dom^ain with all the m^otifs characteristic of 
a serine/threonine kinase, a 237 amino acid region predicted 
15 to form a coiled-coil structure, a 114 am.inc acid proline- 

rich region, a 2 56 amiino acid spacer region, followed by a 
343 amino acid C-term.inal dom.ain containing a potential 
Rab/Rho- binding region . 

ZCl is most closely related to the hum.an 3TE20- 
20 subfamily .kinases ZC2 (SEQ ID N0:14), ZC3 (SEQ ID NO:15), 

and ZC4 (SEQ ID NC:16), sharing 61.7%, 60.9%, and 43.8% 
amino acid identity, respectively. ZCl also shares 45.5% 
amino acid identity to a C. eiegans kinase encoded bv the 
cosmid ZC504.4 {G3:Z50029). ZCl exhibits 50.0% amino acid 
25 homology to m.urine NIK (GB: US 8 98 4) , suggesting ic may be the 

human orthologue of this STK. 

The 22 amino acid N- terminal dom^ain of hum.an ZCl is 
58.8% identical to the C. elegans kinase encoded by the. 
cosmid ■ZC504 .4 {G3:Z50029;, and 100% identical to murine I^TIK 
30 (GB: U889S4; . Human ZCl lacks a glycine residue at position 

2, and is therefore unlikely to undergo m.yriscy lac ion . A 
Smith-Wacerman search of che nonredundant protein database 
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does noc reveal any significant: homologies tha- mighz 

suggesc a potential funccion for this domain. 

The 267 ammo acid catalytic domain of human ZCl is 
most related to human STE2 0 - subf amiily kinases, ZC3 (SEQ ID 
5 NO: IE), ZC2 (SEQ ID NO: 14), KHS2 (SSQ ID NO: 18), SOK- 1 

(G3:X99325), GCK (GB:U07349), and GEK2 (SEQ ID NO:107), and 
CO the C. elegans kinase encoded by the cosmid ZC504.4 
(G3:Z50029) sharing 90.6%, 90.2%, 50.6%, 47,4%, 45.4%, 42.5% 
and 32.6% amino acid idencity, respectively. The ZCl kinase 
10 domain shares 98.1% identity to murine NIK (GE:U88984). ZCl 

contains the potential "TPY" regulaccry phosphorylation sice 
in izs activation loop. This "TPY" m.otif is conserved in 
other STS20 - related' kinases , including ZC2 , ZC3 , ZC4 , GEK2 , 
KHS2, SULUl, SULU3 , ?AK4 . and FAK5 . 

Im-ediacely C- terminal co the kinase domain of human 
ZCl is a 237. amino acid region predicted to form a coiled- 
coil structure based on the Lupas algorichmi {Lupas, A. Meth. 
Enzymol . 266, 513-525 (1996)). This region of ZCl is most 
related to human STE20 -subf amily kinases, ZC3 (SEQ ID 
20 NO: 15), ZC2 (SEQ ID NO: 14), and GEK2 (SEQ ID NO : 1 0 7 ) , as 

well as to human PITSLRE (GB:U04S24) sharing 65.5%, 65.4%, 
25.3%, and 2 9.0% amLino acid identity, respectively. The. ZCl 
coiled-coil domain also shares 90.6% ammo acid hom.ology to 
murine NIK. .The C.eiegans homologue ZC504,4 shares 32.2% 
25 sequence identity over this region. 

Within the predicted coiled-coil domain of hum^an ZCl, 
and the related ZC3 , is a region predicted to form a leucine 
zipper (Lreu-X6-Leu-X6-Leu-X6-Leu-X20-Leu-X6-Leu) . The fact 
that this leucine repeat exists within a predicted coiled- 
30 coil structure suggests that the leucine zipper may have a 

high probability of serving as a dimerization interface 
(Hirst, J.D. ec ai Protein Engineering ■ 9 657-662 (1996)) 

11 
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mediating a pocencial inner- or incra -mciecuiar dirrieri za" ion 
of human ZCl . 

The 114 amino acid proline-rich region of human ZCl is 
most related co hum.an STE2 0 - subfamily kinases, ZC2 {SEQ ID 
5 NO: 14) and ZC3 (SEQ ID NO: 15), sharing 3 5. S%, and 24.9%, 

respectively. The ZCl proline-rich domain shares 36.4% ammo 
acid homology to murine NIK (G3:US8 984) . Three potential 
^'Pxx?'' SH3 dom.ain-binding motifs (I, II and III) are found 
v/ichin Che proline-rich region of hum.an ZCl, Morif I is 

10 conserved in human ZCl and C. elegans ZC504 . 4 (GB:Z50029). 

Motif II is conserved in ZCl, ZC2 , ZC3 , ZC4 and C. elegans 
ZC504.4. Mocif III is conserved in ZCl, ZC2 , ZC3 and ZC4 . • 
Mccifs II and III of mrarine NIK have been shown ro bind the 
SK3 mccif of the adaptor molecule Nek (Su, Y-C. et al, EM30 

15 J. 16, 127S-1290 (1997)), From this evidence , hum.an ZCl may 

have che potential to bind to Nek or other SH3 or WV-: domiain- 
containing proteins and participate in grov/ch faccor- induced 
signal ing pathv;ays . 

Tne 2 56 am.ino acid spacer region of hum^an ZCl is mosr 

20 related co human STE2 0 - subf ami ly kinases, ZC2 (SEQ . ID NO:14) 

and ZC3 (SEQ ID NO:lE), as well as to human PITSLRE 
(GB:U04S24), sharing 59.9%, 33.1%, 29.6%, and 25.4% ammo 
acid identity, respecc ively . It also shares 59.9% amino 
acid homology to murine NIK. The C. elegans homioloQue 

25 ZC504.4 has only limdced sequence similarity in this spacer 

region . 

The 343 am^ino acid C- terminal of human ZCl is most 
related uo human STE20 -subf amily kinases, ZC3 (SEQ ID 
NO: 15), ZC2 (SEQ ID NC:14), and ZC4 -(SEQ ID NO : 1 5 ) , sharing 
30 89.2%, 83.9%, and 42.3%, am;ino acid identi^:y^ respecc ivel v . 

The ZCl C-termdnal dom.a:Ln also shares 93.3% ammo acid 
identify to m/urine NIK. The C. elegans homologue ZC5G4.4 
also shares 68.7% amino acid identity v.'ith che C-cail of 

1^ 
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human ZCl . A lower, yet significant, horaclogy is also 

evident zo human KHS2 (SEQ ID NO: 13), GCK (GE:U07345!, and 
murine citron (GB:U07349) wich 26.6%, 23.1% and 36,2% ammo 
acid idenci-y, respectively. GCK is a STE20-family kinase 
5 whose C-terTiinal domain has been shown to bind the small G- 

pro-ein Rab8 (Ren, M. ec al , , Proc . Natl. Acad. Sci . S3, 
5151-5155 (1996)) . Citron is a non-kinase Rho-binding 
protein (Madaule, P. et al . , FE3S Lett. 377, 243-238 
(1995) ) . . ■ . . 

10 The sequence similarity of che C-cerm^nal region of ZC 

to proteins that have potential Rab- or Rho-brndmg domains 
suggests that ZCl may signal through a small G-protein- 
dependanc _ pathv/ay . . 

15 Mammalian ZC2 

The 4C5 5 bp hum.an ZC2 nucleotide sequence of the 
partial cDNA encodes a polypeptide of 1297 am.ino acids (SEQ 
ID NO: 14) with a predicced molecular mass of 147,785 
daltons. Analysis of the deduced amine acid sequence 

20 predicts ZC2 to be an intracellular serine/ threonine kinase 

lacking both a signal sequence and transmembrane domain,, 
however the cDNA clone lacks an initiacing ATG , so the full 
excenu of ic N-termiinus is not knov/n . The N-.cerminally " 
truncated ZC2 protein ■ contains a 255 amino acid catalytic 

25 domain with all the motifs characteristic of a 

serine/ threonine kinase, a 187 amino acid region predicted 
to form a coiled-ccil structure, a 184 amino acid proline- 
rich region, a 32 3 amdno acid spacer region, followed by a 
343 amino acid . C - terminal dom.ain containing a pocennial 

30 Rab/Rho-binding region. 

ZC2 is most closely related to the hum.an 3TE2 0- 
subfamily kinases ZC3 (SEQ ID NO: 15), ZCl (SEQ ID NO: 13), 
and ZC4 (SEQ ID NO : 1 6 ) , sharing 8 8,3%, 61.7%, and 4 1.9% 

1^ 
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ZC504 . 4 (GB : Z50C29) . 

The 2 55 amino acid catalytic domain of huirian ZC2 is 
most reiaced cc human 3TE20-subf amily kinases, ZCl (SEQ ID 
NO:13), ZC3 iSEQ ID NO : 1 5 ) , ' SOK- 1 (GE:X99325), ?CH32 (SEQ ID 
N0:1S), MSTl (GB:U1S297), and GCK (GB:U07349), and zo che C. 
eiegrans kinase encoded by the ccsmid ZC504.4 (GB:Z5002 9) 
sharing 90.2%, 89.8%, 49.0%, 4S.6%, 47.9%, 45.0 and 75.7% 
amino acid idencity, respectively. ZC2 contains the 
pocencial ^^tPY" regulacory phosphorylation site in its 
accivacicn loop. This "tPY" motif is conserved in ocher 
^TE2 0- reiaced kinases, including ZCl, ZC3 , ZC4 , GEK2 , KHS2 , 
3ULU1, SULU3 , PAK4 and PAK5 . 

Immediacely C- terminal co the kinase domain of human 
ZC2 is a 137 amino acid region predicced to form, a cciled- 
coil structure based on the Lupas algorithm (supra) . This 
region of ZC2 is most related to hum.an STE20 -subfamily 
kinases, ZCl (SEQ ID NO:13), ZC3 (SEQ ID NC:15), and GEK2 
(SEQ ID NG:107), as well as to hum.an PITSLRE (GB:U04S24), 
sharing 55'.. S%, 51.5%, 29.7% and 29.5% amino acid idencity, 
respeccively . The C. elegrans homologue ZC504.4 shares 30.8% 
sequence idencity over this region. Hum^an ZC2 lacks tne 
potential leucine zipper found in ZCl as a consequence of a 
29 am.ino acid deletion i-elative to ZCl and ZC3 . 

The 1S4 am.ino acid proline- rich region of human ZC2 is 
most reiaced to human STE20-subf amaly kinases, ZC3 (SEQ ID 
N0:15) and ZCl (SEQ ID NO:13), sharing 35.9% and 23. 5%, amino 
acid identicy, respeccively. Significanc homology is also 
evident to che m.u r i ne WW dcma i n - b i nd i ng o r o t e i n WE ? 7 
(GB:U92455), and to the human SH3 dom.ain-bmding procein 
3BP-1 {GB:X37671), with 27.7% and 25.3% amino acid identicv, 
respect ively . 
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2C2 contains two of the potential "Pxx?'' SH3 domain- 
binding motifs (II and III) found within the proline-rich 
region of human ZCl . Motif II is conserved in ZCl , ZC3 , ZC4 
and C. elegans ZC504,4, and Motif III is conserved in ZCl, 
5 ZC3 and ZC4 , Motifs II and III of murine NIK have been 

shcv/n to bind the SH3 mocif of the adaptor molecule Nek. 
From this evidence, hum.an ZCl may have the potential to bind 
to Nek or other SH3 or WW domain- containing procems, and to 
parcicipate in growth faccor- induced signaling pathways. 

10 The 328 amino acid spacer region of human ZC2is most 

relaced to human STE20-3ubf amily kinases ZCl (SEQ ID NO: 13) 
and ZC3 (3SQ ID N0:15), and to murine NIP: (GB:US3934), 
sharing 31.6%, 26.9% and 25.9% am.inc acid identity/ 
respectively. The C. siecraiis hom.clogue ZC504.4 has only 

15 limited sequence similarity in this spacer region. 

The 343 amino acid C-term.inal of human ZC2 is most 
related to human STE2 0 -subfamily kinases ZCl (SEQ ID NO:13), 
ZC3 (SEQ ID NO:15) and ZC4 (SEQ ID N0:16), and to murine NIK 
(GB :U88984 ), sharing 83.9%, 88.3%, "41.9%., and 3S.0%, am.ino 

20 acid identity, respectively. The C. elegans hom.ologue, 

ZC504.4, also shares 67.2% amino acid identity with the.C- 
tail of human ZC2 . A lower, yet significant, homology is 
also evident to human GCK (GB:UC7349), m.urine citron 
(GB:U07349), and the S. cere\risiae R0.M2 protein (GB:U19103), 

25 a Rhol GDP/GT? exchange faccor, with 22.3%, 22.2% and 21.9% 

amino acid identity,' respectively. 

The sequence similarity of the C-terminal region of ZC2 
to proteins that have potential Rab- or Rho-binding domains 
suggests that ZC2 , like ZCl, miay also signal through a sm.all 

30 G-protein-dependant pathway. 



^1 
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Mamma 1 i an 7.r--^ 

The 4123 bp human ZC3 nucieocide sequence of the 
partial cDNA encodes a polypeptide of 1526 amino acids (SEQ 
ID NO: 15) wich a predicted molecular mass of 149,906 
daltons. Analysis of the deduced ammo acid sequence 
prediccs ZC3 co be an intracellular serine/threonine kinase, 
lacking both a signal sequence and transmenibrane domain, 
however the cDNA clone lacks an initiating atg, so the full 
extent of it N-termius is not known. The N-cerminally 
truncaced ZC3 protein contains a 2 55 amino acid catalytic 
dom.ain v/ich all the motifs characteriscic of a 
3erine,''threonine kinase: a 221 amino acid region predicted 
to torm a cciled-coil scructure, a 204 amino acid proline- 
rich region, and a 3 03 amino acid spacer region followed by 
a 343 amino acid C-cerminal domain containing a potential 
Rab/Rho-binding region. 

ZC3 IS most closely related to the human STE20- 
subfamily kinases ZCl (SEQ ID NO:13), ZC2 {SEQ ID MO : 14 ) , 
and ZC4 (SSQ ID NO:16), sharing 62.0%, 61.0%, and 42.5% 
amino acid identity, respeccively and shares 46.7% amino ■ 
acid identity to a C. eJegrans kinase encoded by the cosmid 
ZC504 . 4 (G3 : Z5CC29) . 

The 255 amino acid catalytic domain of human ZC3 is 
most related to hum.an STE20-subf amily kinases, ZCl (SEQ ID 
NO:13), ZC2 (SEQ IDMO:14), SOK-1 (G3:X99325), KHS2 (SEQ ID 
NO:iS), GCK (GB:U07349), SULUl (SSQ IDNO:22), andGEK2 (SEQ 
ID NO:107), and to che C. elegans kinase encoded bv the 
cosm.id ZC504.4 (GB:Z50029) sharing 90.6%, 89.3%, 49.0%, 
48.3%, 45.0%, 43.1%, 42.3% and 76.7% amino acid identity, 
30 respectively. ZCl concains che potential "TPY" regulatory 

phosphorylation site in its activation loop. Tr.is "tfY" 
motif is conserved in other STE20-related kinases, 
including ZCl, ZC2 , GEK2 , KKS2 , SULUl, SULU3 , PAK4 and ?AK5 . 



20 
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Imrr^ediacely C-cerminal co the kinase dorr:ain of human 
ZC3 is a 221 aT:ino acid region predicted to for- a cciled- 
coii structure based on the Lupas algorithm (supra) . This 
.region of ZC3 is mosc homologous to human ■STE2 0 - subf ami 1 v 
kinases, ZCl {SEQ ID N0:13). ZC2 (SEQ ID N0:14;, and GEK2 
(SEQ ID NO:107), sharing 66.9%, 61.5%, and 27.5% identity, 
as v;ell as to rat PLC-beta (GB:A45493) and human PITSLRE 
(GB:H54024) sharing 29.6% and 25.9% amino acid identity, 
respectively. The C elegans homologue ZC504.4 shares 26. S% 
sequence identify over this region. 

Within -he predicted cciled-coil domain of human 2C3 , 
and the relaced ZCl, is a region predicted to formi a leucine 
zipper (Leu-X6-Leu-XD-Leu-Xo-Leu-X20-Leu-X5-Leu) . The fact 
that this leucine repeac exiscs within a predicted coiled- 
coil structure suggests nhac the leucine zipper m.ay have a 
high probability of serving as a dimerization interface 
(Hirst, J.D. at al Procem Engineering 9 657-562 (1996)) 
miediacing a" potential inter- or intra-molecular dimerization 
of humian ZC3 . 

The 2 04 amino acid proline -rich region of human ZC3 is 
most related to human 3TE2 0 - subf ami ly kinases, ZCl (SEQ -ID 
NO: 13) and ZC2 (SEQ ID NO: 14), sharing 6 6.9% and 61.5% amino 
acid identity, respectively. 

ZC3 contains two of the potential ^^PxxP" SH3' dom.ain- 
binding mocifs {II and III) found within the prol ine-- rich 
region of human ZCl. Mocif II is conserved in ZCl, ZC2 , ZC4 
and C. elegans ZC504.4; Motif III is conserved in ZCl, ZC2 
and ZC4 . Motifs II and III of miurine NIK have been shown to 
bind the SH3 motif of the adaptor molecule Nek. From this 
evidence; humian ZC3 m.ay have che pocential to bind co Nek or 
other SH3 or WW domain- containing proteins and particioate 
in growth factor- induced signaling pathways. 
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The 303 amino acid acid spacer region of human ZC3 is 
most reiaced to human STE20 -subfamily kinases, ZCi (SEQ ID 
NO: 13) and ZC2 (SEQ ID NO: 14) sharing 30.1%, and 27.1% amino 
acid identity, respeccively . The C. eiegrans homologue 
5 ZC504.4 lacks nearly the entire spacer region of ZC3 . 

The 34 3 amino acid C- terminal of human ZC3 is most 
related to hum.an STE2 0 - subfamily kinases, ZCI (SEQ ID 
NO: 13). ZC2 (SEQ ID NO: 14) and ZC4 (SEQ ID NO: 16), sharing 
39.2%, 83.9%, and 42.5%, amino acid identity, respectively. 
10 The C. elegans homologue ZC504.4 also shares 67.2% amino 

acid . identity with the C-tail of human ZC3 . A lov/er, yec 
significant, hom.ology is also evident to human GCK 
(G3:U07349), as well as to the non-kinases m.urine citron 
{G3:U07349) and the 5. cerevisiae R0M2 procein (GB:U1?103), 
15 a Rhol GDP/GTP exchange factor, wich 21.6%, 32.4% and 22.9% 

amino acid identity, respecuively . 

The sequence similarity of the C-terminal region of ZC3 
to proteins that have potential Rab- or Rho-binding domains 
suggests that ZC3 , like ZCI and ZC2 , may signal through a 
20 small G-protein-dependant pachway. 

Mam-alian ZC4 

The 3,634 bp human ZC4 nucleotide sequence of the 
comolece cDNA encodes a polypeptide of 1,22 7 amino acids 
25 (SEQ ID NO: 105) with a predicted molecular mass of 133,205 

Daltons. -Analysis of the deduced amino acid sequence 
predicts ZC4 to be an intracellular STE2 0 - subf ami ly kinase, 
lacking both a signal sequence and a t ransmemJorane domain. 
The full-length ZC4 protein contains a 25 am.inc acid N- 
30 cermmus, a 265 am.mo acid catalytic dom.am with ail the 

motifs characterisuic of a serine /threonine kinase, a ICS 
amino acid region predicted to form a coiled-coil structure, 
:31 am.ino acid prcline-rich region, a 40 ar 
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predicced to form a coiled-coil structure spacer region, a 

204 amine acid spacer region (domain B) , follov/ed by a 355 
a^tinc acid C- Lerniinal domain containing a potential Rab/Rho- 
binding region (domain C) . 
5 ZC4 is most closely related- to human ZCl' (SEQ ID NO: 13, 

also known as human HGK, human KIAA06S7, murine NIK, human 
AC0G5O35, human NIK, and C. elegans MIG-15) , ZC2 (SEQ ID 
N0:14, similar to partial sequence human KIAA0551), and ZC3 
(SEQ ID NO: 15) . An assembled genomic fragment in zhe 

10 database (ZS3850) is identical to ZC4 , except for 

inappropriate identiif icacion of the excn boundaries. (Abo 
ec ai . {1998} EMBO J. 17 : 652 7 - 654 0 . ) 

The 2 5 amino acid N-cerminal domain of hum.an ZC4 shares 
weak homology co human ZCl in its C-terminal extent, but 

15 otherwise does not reveal any significanc homologies to the 

protein database . 

The 2 65 amino acid catalytic domain of human ZC4 is 
most related to humtan ZCl (SEQ ID NO:13), ZC3 (SEQ ID 
NO: 15), and ZC2 (SEQ ID NO: 14), sharing 63%, 64% and 62% 

2(3 am.ino acid identity, respectively. 

Im.mediately C-term.inal to nhe kinase dom.ain of human 
ZC4 is a 108 am.ino acid region predicted co form a coiled- 
coil structure based on the Lupas algorithm (supra) . This 
region is most related to human ZCl (SEQ ID NO:13),.ZC3 (SEQ 

25 . ID NO:15;, and-ZC2 (SEQ ID NO:14), sharing 29%, 25% and 20% 

amino acid idennity, respectively. 

The 231 amiino acid proline-rich region of human ZC4 
dC'es not reveal any significant homtclogies no the protein 
database. This region of ZC4 contains cwc "Pxx?" motifs 

30 that could potentially bind to proteins concainmg SH3 or WW 

domains and allow ZC4 to participate in grov;th factor 
activated signaling pathways. In addicion, wichin the pro- 
rich domain of human ZC4 is a reoion ored:LCi:ed to form a 



S5 
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leucine zipper ( Leu -X6 - Leu -X6 -Leu-X6 -Leu- X2 0 - Leu -X6 - Leu } 
which may serve as a dimerizahion interface. The ZC STE20 
subfamily kinases (ZCl, ZC2 and ZC3) have similarly located 
^^?xxP' motifs and potential Leu zippers. 
5 immediately C-terminal zo the proline-rich region of 

human ZC4 is a 40 amino acid region also predicred to form a 
coiled-coil structure based on the Lupas algorithm. This 
region of human ZC4 does not reveal any significant 
homologies to the protein database. 

^0 The 204 am.ino acid acidic- and serine- rich dom.ain "B" 

of ZC4 does noz reveal any significanc homologies to the 
protein database . 

The 3 55 amino acid C-terminal of hum.an ZC4 is Ttost 
related to human ZCl (SEQ ID NO:13), ZC3 (SEQ ID NO:15}, and 

15 ZC2 (SEQID NO: 14), sharing 43%, 42% and 42% amino acid 

identity , respectively . 

The sequence simdlarity of nhe C-term.inal region of ZC4 
to proteins that have potential Rab- or Rho-binding domains 
suggests that ZC4 , like ocher ZC-subfamily STE20 kinases, 

20 may signal through a small G-protein-dependant pathv/ay. 

Mam.malian KHS2 

The 4 023 bp human KHS2 nucleotide sequence encodes a 
polypeptide of 394 amiinc acids (SEQ ID NO: 18) v/ich a 

25 predicted molecular m.ass of 101,327 dalrons . Analysis of- 

the deduced amino acid sequence px'edicts KHS2 to be an 
intracellular serine/ threonine kinase, lacking both a signal 
sequence and transmem.brane domiain. The full-length KHS2 
protein contains a 13 amino acid N- terminus, a 260 amino 

30 acid catalytic domain v;ith all the motifs characteristic of 

a serine / threonine kinase, a 73 am.ino acid spacer region, a 
ISS proline-rich region, followed by a 360 amiino acid C- 
terminal domtain conT:ai.ning a pocential Rab/Rho -binding site. 
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KHS2 is mosc closely related co che human STE20- 
subfa-^ily kinases KKSl (GE : Ul 77 12 S ) , GCK (GB:UC7349), and 
HPKl (GB:U07349), sharing 65.5%, 51.9%, and 44.9% amino aci 
identity, respeccively and shares 38.5% amine acid identicy 
to a C. elegans STK {G3:U55363) . 

The 13 amino acid N- terminal domain of hum.an 1-0132 does 
not reveal any significant: homologies that might suggesc a 
pccential function for this domain when examined by a Smith 
Waterman alignment to the nonredundant protein database. 
Human KHS2 lacks a glycine residue*^ at position 2,- and is 
therefore unlikely to undergo my^ristylat ion . 

The 2 60 am^ino acid catalytic dom.ain of humian KHS2 is 
most related to humian STE20-subf amily kinases KHSl 
(G3 : U177129) , GCK ( G3 : UO 7 3 4 9 ), , K?K1 (G3:U66464), SOK- 1 
(GB:X99325}, MST 1 ( G3 : Ul S 2 9 7 ) , ZCl {SEQ ID NO:13), and to 
the C, elegans kinase (G3:U55363), sharing 35.4%, 75.1%, 
67.7%, 51.4%, 48.1%, 49.8% and 72.0% am.rno acid identity, 
respectively. KHS2 contains the potential ^^TPY" regulatory 
phosphorylation site in its activation loop... This "TPY" 
motif is conserved in other 3TE2 0 - related kinases, 
including. ZCl , ZC2 , ZC3 , ZC4 , GEK2 , SULUl , SULU3 , PAK4 and 
PAK5 . 

The 73 amino acid acid spacer region of human KHS2 is 
most related to humtan 3TE2 0 - subf am^i ly kinases, KKSl 
(G3:U177129)-, HPKl (G3:U66454) and GCK (GB:U07349), sharing 
60.3%, 43.5% and 44.0%, amino acid identity, respectively. 

The 18 8 ammo acid proline -rich region of human KJ^S2 i 
most related to human STE2 0 - subf am.i ly kinases, HPKl 
(GB:U66464), GCK (GB:UQ7349) and KHSl ( GE : Ul 7 7 1 2 9 ) , sharing 
33.3%, 31.9% and 31.4%, am.ino acid identity, respectively. 

Tv7o potential ''Pxx?" SH3 domia in- binding motifs (I and 
II) are found v;ithin the proline-rich region of human KHS2 . 
Motif I is conserved with humtan KHSl and HPKl; motif II is 
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conserved wich GCK and ?CHS2 . A 192 amino acid region of 
human HP?:i ccncaining motif II has been sho^vn co bind co che 
C- terminal SH3 monif of the adaptor molecule Grb2 (Anafi, M 
et al, J. Biol. Chem. J. 272, 278C4-27811 (1997)). Hum.an 
5 KHS2 may bind SH3 or WW dom.ain- contiaining proteins throuch 

this proline-rich region. 

The 3 60 amino acid C-cerminal of human KHS2 is most 
related to KH31 (G3 :U177129) , GCK (GB:U07349) and,H?Kl 
(G3:Uc6464), and to the C. elegans kinase {GB:U55363), 
10 sharing 74.9%, 54 .-3%, 42.9%, and 31.0%, amino acid idencity, 

- respectively. GCK is a STE20-family kinase whose C-terminal 
domain has been shovm to bind the small G-protein RabS (Ren, 
M. ec ai . , Proc . Natl. Acad. Sci. 93, 5151-5155 (1996)). 

15 Mammalian SULUl 

The 4196 bp human SULUl nucleotide seauence encodes a 
polypeptide of 398 amino acids (SEQ ID NO:22) with a 
predicted miolecular mass of 105,402 daltons. Analysis of 
the deduced amino acid sequence predicts SULUl co be an 

20 intracellular serine/ threonine kinase, lacking both a signal 

sequence and transm.emJDrane domain. The full -length SULUl 
protein contains a 21 amino acid N-cerm-inus, a 2 56 amino 
acid catalytic domiain v/ith all che motifs characteristic of 
a serine/ threonine kinase, a 150 amino acid spacer region, a 

25 210 amdno acid region predicted co form; a coiled-coil 

structure, a 114 amino acid spacer region and a 147 amino 
acid C-terminal domain predicted to form a coiled-coil 
structure . 

SULUl is most closely related to the STE2 0 - subf amii Iv 
30 kinases murine SULU3 (SEQ ID NO: 24), hum;an SULU3 (SEQ ID 

WO:23), and to the C. elegans kinase SULU {GB:U11280), 
sharing 63.9%, 72.2% and 33.2% amino acid identity, 
respect ively . 

Hi 
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The 21 amino acid M-terminal domain of human SULUl is 
mosc related to murine SULU3 (SEQ ID NO: 24) and co the C. 
elegans kinase SULu (GB:U11280), sharing 86.3% and 52.3% 
amino" acid idenrity. Human SULUl lacks a glycine residue at 
posicion 2, and is therefore unlikely cc undergo 
myris toylat ion . A Smith- Waterman search of the nonredundant 
protein database does not reveal any significanc homiolcgies 
that n^.ight suggesc a potential function for- this domain. 

The 25b amino acid catalytic domain of hum.an SULUl is 
most related to miurine SULU3 (SEQ ID NO:24); and to human 
SOK-1 (GB : X95525) , STLK2 (SEQ IDN0:5), MSTl (GB:U18297), 
PAKl (GE:U24152), ZC2 (SEQ ID N0:14), and KKS2 (SEQ ID 
NO: 18) sharing S6.3%, 48.1%, 46.9%, 45.2%, 43.3%, 43.1% and 
42.0% amino acid identity, respectively. The C. elegrans 
SULU 5TK (G3:U11230) shares 62.3% sequence identity over 
this region. SULUl contains the potential "TPY" regulatory 
phosphorylation sice in its activation loop. This ^^t?Y" 
motif is conserved in other STE2 0 - related kinases, 
including ZCl, ZC2 , ZC3 , ZC4 , GEK2 , KHS2 , SULU3 , PAK4 and 
PAK5 . 

The 150 am:ino acid spacer region of human SULUl is most 
related co human SULU3 (SEQ ID NO: 23) and to the C. elegans 
kinase (GB : Ul 12 8C ) , sharing 53.5% and 10.4% amino acid 
ident icy, respect i vely . 

Immediately C-term:inal to the spacer region of human 
SULUl is a 210 amino acid region predicted to form a coiled- 
coil structure based on the Lupas algorithm. This region of 
SULUl is mosc related to SULU3 (SEQ ID NO:23), the C. 
eJegrans SULU kinase (G3 : U112 SO ) , GEK 2 (SEQ ID NO:107) and 
2C1 (SEQ ID N0:13), sharing 6 3 . 6 % , 2 5 . S % , 2 3 . 2 % . and 22.3% 
amino acid identity, respectively. 

The 114 amino acid spacer region humian SULUl is m^ost 
related to human SULU3 (SEQ ID NO:24) with 73.7% amino acid 
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sequence identity. A lower, yet significant, ncinology is 
also evident to murine PITSLRE (GE:U04S24) and DLK 
(GB:A5531S), human ZCl (SEQ ID'NOilS) and GEK 2 (SEQ ID 
NO:107), as well as to the C. eiegajis SULU STK (G3:U11230), 
sharing 39.7%, 25.4%, 29.5%, 23.6% and 37.6% amino acid 
identity, respectively. 

Immediately C- terminal to the second spacer region of 
hum.an SULUl is a 147 amino acid region predicted to form a 
coiled-ccil structure based on the Lupas algorithm. This 
region of SULUl is most related to human SULU3 (SEQ ID 
NO:24), ZCl (SEQ ID N0:13) and GEK 2 (SEQ ID NC:1C7), as 
well as to the C. elegans SULU STK (G3:U112S0), sharing 
/j.3-'s, 2S.4*, 25.1% and 39.5%, amino acid identity, 
re spe c t i ve 1 v . 



Mam.malian (human) SUI.TT3 

The 3824 bp partial cDNA human 3ULU3 nucleotide 
sequence encodes a polypeptide of 7 86 amino acids (SEQ ID 
NO:23) with a predicted molecular m.ass of 92,037 daltons. 
Analysis of che deduced amino acid sequence predicts SULU3 
to be an intracellular serane/threonine kinase lacking a 
transmembrane dom^azn. The N- terminally truncated hum^an 
SULU3 protein contains a 66 am.ino acid partial catalytic 
dom,ain followed by a 149 amino acid spacer region, a 210 
am.ino acid region predicted to form a coiled-coil structure, 
a second spacer region of 114 am.ino acids, a 24 7 amino acid 
C-termmal region predicted to form a second coiled-coil 
structure and a 100 amino acid C-term.inal tail. 

Human SULU3 is m.ost closely related murine SULU3 (SEQ 
ID NC:24), hum.an SULUl (SEQ ID NO:22), and to the C. elegans 
SULU kinase (GB:U11280), sharing 66.3%, 63.9% and 32.9% 
amino acid identity, respectively. The high sequence 
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homology between murine and human SULU3 suggescs ::hac these 
tivc proneins are orchologs of each other*. 

The 66 amino acid partial catalytic domain of human 
SULU3 is most related to murine SULU3 (SEQ ID NO:24), and to 
5 the human 3TE20 subfamily kinases ZCl (SZQ ID NO:13), STE20' 

(G3:XS9325), KHS 1 (GB : Ul 7 7 1 2 9 ) and GEK 2 (SEQ ID NO: 107;, as 
well as CO che C. elegans SULU kinase (G3:U11230), sharing 
83.3%, 47.0%, 45.5%, 43. 5%, 41. 8% and 5 5.6% am.ino acid 
i dene icy , respectively . 
10 The 14 9 am.ino acid spacer region of hum^an 3ULU3 is m.ost" 

relaced to m.urine SULU3 (SEQ ID NO:24), human STS20 
. (GB:X99325) , MSTl (GB:U18297), and CO the C. eiegans SULU 
kinase v.GB:U11280) sharing 98.7%, 21.9% and 21.3% amino 
acid identity, respectively. 
15 Immediacely C-terminal to che first spacer region of 

human SULU3 is a 210 amino acid region predicced co form a 
coiled-coil structure based on the Lupas algorithm. This 
region of SULU3 is mosc related to murine SULU3 (SEQ ID 
. • • ■ NO:24; , and cc human SULUl (SEQ ID NO:22), ZCl (SEQ ID 
20 NO: 13) and GEK 2 (SEQ ID NG:107), as well as -co the C. 

elegans SULU kinase (GB:U11280), sharing 99. 5%, 68.6%,, 
27.4% and 22.5% amino acid identity, respectively. 

The 114 amino acid second spacer region of human S.ULU3 
is miost related to m.urine SULU3 (SEQ ID MO: 24), and co human 
25 SULUl (SEQ ID NO:22) GEK 2 (SEQ ID NO:107), and ZCl (SEQ ID 

NO:13), as well as to the C. elegrans SULU kinase 
(G3:U11280), sharing 9 9 . 1% , 7 3 . 7 % , 24. 6%, 24.1% and 41.2% 
am.mc acid identity, respectively, 

Immediacely C-cermiinal co the second spacer region of 
30 human SULU3 a 247 ammo acid region predicced co form a 

coiled-coil scruccure based on che Lupas algorithmi (supra) . ■ 
This region of SULU3 is most related co hum.an SULUl (SEQ ID 
NO: 22) and ZCl (SEQ ID MO: 13 J as v;ell as to rac PKIM- 
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{GB:D2518C) murine pl60 ROCKl (GB:U5S512;, and the C . 
elegans SuLU kinase (GB:U1128C), sharing 73.7%, 26.7%, 24.0% 
and 21.0% amino acid identity, respectively. 

The IOC amino acid C-tail of human SULU3 is most 
5 related to a human prion procein (GB : L3 3 993 ) , with 45.0% 

amino acid identity. 

Mammalian t murine) 3ULU3 

The 224 9 bp murine, parrial cDNA SULU3 nucleotide 

10 sequence encodes a polypeptide of 743 am.ino acids (SEQ ID 

NO: 24.1 with a predicced molecular mass of 87,520 daltons. 
Analysis of the deduced amino acid sequence pred:LCCS SULU3 
to be an intracellular serine/ threonine kinase, lacking both 
a signal sequence and t ransmiembrane domain. The partial 

15 murine 3ULU3 protein contains a 2 5 amino acid N- terminus, a . 

243 amino acid catalytic domain with all the motifs 
characteristic of a serine/ threonine kinase, a 149 amino 
acid spacer' region, a 210 amino acid region predicted co 
form a coiled-coil structure, and a 116 amino acid spacer 

20 region. 

Murine SUIju3 is most closely relared to human SULU3 
(SEQ ID NG:23) and SULUl (SEQ ID NO:22), as well as co rhe 
C. elegrans SULU kinase (GB:U112 80), sharing 97,0%, 72.3% 
a.nd 38.4% am.ino acid identity, respectively. The high 

25 sequence hom.ology betv/een miurine and human SULU3 suggests 

chat these two proteins are orthologs . 

The 25 amino acid N-terminal domain of m.urine SULU3 is 
most related to human SUUUl (SEQ ID NO: 22) and to the C. 
elegans SULU kinase (GB:U112B0), sharing 70.0% and 44.4% 

30 amino acid iden-ity, respectively. 

[purine SULU3 lacks a glycine residue ac position 2, and 
is therefore unlikely to undergo myris toylat ion . A Smich- 
'Waterman search of the nonredundant protein database does 
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noz reveal any significant homologies than might suggest a 

pctencial funccion for this domain. 

The 248 amino acid cacalycic dom.ain of murine 3ULU3 is 
mosc relaced zo human SULUl (SEQ ID iSrC:22}, STE2C 
5 (GB:X99325) , ZCl (SEQ ID N0:13), and KKSl (GB:U77129), as 

well as to the C, elegans SULU kinase (GB:U11280), sharing 
S6.7%, 46.6%, 43.3-%, 59.4% am.ino acid identity, 
respectively. Murine SULU3 contains the potential "TPY" 
regulatory phosphorylation site in its activation loop. This 
10 nrppyi! rnotif is conserved in other STE2C-related kinases, 

including ZC2 , ZC3 , ZC4 , GSK2 , KH32 , SULUl, SULU3 , ?AK4 and 
?AK5 , 

The ,14 9 amino acid spacer of murine SULU3 is most 
related to human 3ULU3 (SEQ ID NO:23), SULUl (SEQ ID NO:22), 

15 and STE20 (GB:X99325), as well as to the C . elegans SULU 

(G3:U11230) and the S. cerGx^isiae 5TE20 (GB:L04655) kinases, 
sharing 9S.7%, 53.4%, 21.9%, 59.4% and 21.9% amino acid 
identity, respectively, 

Im.mediately C- terminal to the spacer region of murine - 

20 SULU3 is a 210 am.ino acid region predicted to form, a coiled- 

coil structure based on the Lupas algorithm. This region of 
murine SULU3 is most related to human SULU3 (SEQ ID NO:23), 
ZCl (SEQ ID N0:13), and GEK 2 (SEQ ID NO:107), as well as to 
the C. elegans SULU kinase (GE:U112S0), sharing 99.5%, 

25 27.4%, 22.5% and 29.2% am.ino acid identity, respectively." 

The 116 amino acid C-terminal spacer region of miurine 
SULU3 is most related to hum.an SULU3 (SEQ ID K!0:23), GEK 2 
(SEQ ID MO:107), and ZCl (SEQ ID MQ.-ib), well as to the C. 
elegans SULU kinase (G3:U112S0), sharing 93.3%, 24.6%, 24.1% 

30 and 40.5% am.ino acid identitv, respectively. 
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Marr.nr.ai ian f murine /hiirr.an ) SULU-1 

The 224 9 bp murine SULUS and the 3 824 bo iiuman SULU5 
cDNAs contain a 1620 nucleotide overlap (54i amino acids) 
with 90% and 98% DMA and amino acid sequence idencicv, 
respecciveiy . Owing co che high degree of sequence identic-,,- 
m tills expended overlap, we propose that chese are 
functional orthologues of a single gene. The combined 
murine /human 44 92 bp SUUJ3 sequence encodes a polypepcide of 
1001 amino acids (3EQ ID NO : 3 1 ) with a predicted m.olecular 
mass of 116,069 daltons. Analysis of the deduced amino acid 
sequence predicts SULU3 co be an intracellular 
serine/threcnine kinase, lacking both a signal sequence and 
transmembrane dom.ain. SULU3 contains a 25 amino acid N- 
cerminus, a 248 amino acid catalytic domain with all the 
motifs characteristic of a serine/threonine kinase, a 14 9- 
am.ino acid spacer region, a 210 amino acid region predicted 
to form a coiled-coil structure and a second spacer region 
or. 114 amino acids, a 247 amino acid C- terminal region 
predicted to fcrm^ a second coiled-coil structure and a 100 
20 amino acid C-cerminal tail. The m.urine SULU3 clone lacks 

the region from the second C-terminal coiled-coil cc the C- 
termnnus, v/hereas the hum.an clone lacks the N-cerm.inal 
domain, and all but 66 am.ino acids of the 243 amiino acid 
kinase domain. 

SULU3 is most closely related to SULUl (SEQ ID NO: 22) 
and the C. eiegans SULU kinase (GB:U112 80) sharing 72.3% and 
38.4% amino acid identity, respectively. 

The 25 amino acid N- terminal domain of SULu3 is most 
related to human SULUI (SSQID NO: 22) and co the C. elegans 
SULU kinase (GE:U11280) , sharing 70.0% and 44.4% amino acid 
identity, respec- i vely . 3ULU3 lacks a glycine residue at 
position 2, and is therefore unlikely to undergo 
myristylation. A Smi th- Waterm^an search of the nonredundant 
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protein database does not reveal any significan:: homolooies 

that might: suggest a potential function for this domazn. 

The 24 3 amino acid catalytic domain of SULU3 is most 
rela-ed ro human SULUl (SEQ ID NO:22), SOK-1 (GB:X99325), 
ZCl (SEQ ID NO:13), KHSl (GB:U77129) and the C. elegans SULU 
kinase (G3:U112S0), sharing 86.7%, 46.6%, 43.3%, 42.0% and 
59.4% amino acid idencity, respectively. SULU3 contains zhe 
potential ''T?Y" regulatory phosphorylation sice in ics 
activation loop. This "tpY" motif is conserved in other 
STE20 -related kinases, including ZC2 , ZC3 , ZC4 , GEK2 , KKS2, 
SULUl , ?AK4 and PAK5 . 

The 14 9 am.ino acid spacer of SULU3 is most related co 
SULUl (SEQ .ID NC:22) and SOK-1 (GE:X99325), and to zhe C. 
elegans SULU (GE:U11280), and S, cerevisi^ie 3TE20 
(GB:L04655) kinases, sharing 53,4%, 21.9%, 59.4% and 21.9,% 
amino acid identity, respectively. 

Immediately C-terminal to the spacer region of SULU3 is 
a 210 amino acid region predicted to form, a coiled-coil 
structure, based on the Lupas . algorithm. . This region is micst 
related zo ZCl (SEQ ID NO :13V, GEK 2 (SEQ ID MO: 107), and* 
the C. elegans SULU kinase (GB:U11280), sharing 27.4% 22,5% 
and 29.2% amino acid identity, respectively. 

The 114 amino acid spacer region of SULU3 is mosc 
related to human SULUl (SEQ ID NO:22), GEK 2 (SEQ ID 
NO:107), ZCl (SEQ ID NO : 1 3 ) , and to the C. elegans SULU 
kinase (GB:U11280), sharing 73.7%, 24.6%, 24.1% and 41.2% 
amino acid identity, respectively. 

Immediately C-terminal to the second spacer region of 
SULU3 is a 247 amino acid region predicted to form, a coiled- 
coil structure based on the Lupas algorithm. This region of 
3ULU3 IS mos- related to hum^an SULUl (SEQ ID NO: 22) and ZCl 
(SEQ ID NO:13), as.wel.. as to rat PKN (GE:D26130), murine 
pl60 ROCKl (GB:U53512: and zhe 0. elegans SULU kinase 
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(GB:nii2eC), sharing 73.7%, 2^.7%, 24.0%, 21.0% and 37.6% 
amino acid idenccLcy, respectively. 

The 100 amino acid C-cail of SULU3 is mosc related to c 
human prion protein {G3:L38993) with 45.0% ammo acid 
5 identity. 

Mammalian GEK2 

The 2 92 6 bp human GEK2 nucleotide sequence of the 
complete cDMA encodes a polypeptide of 96S amino acids (SEQ 

NO: 107) v;ich a predicted molecular m.ass of 112,120 
dalcons. Analysis of the deduced am.ino acid sequence 
predic-s GEPC2 co be an intracellular serine/ threonine 
kinase, lacking both a signal sequence and t ransmembirane 
aomain. The complece GEK2 protein contains a 3 3 amino acid 
15 N- terminus , a 251 amdno acid catalytic dom.ain wich all che 

motifs characceriscic of a serine/ threonine kinase, a 43 
amino acid spacer region, a 135 amino acid proline-rich 
region, a 252 amino acid region predicted co form a coiled- 
coil structure followed by a 244 amino acid region also 

20 predicted to form a cciled-coil structure. 

GHK2 is mosr closely related to rat ATl-46 (GE:U33472) 
(a partial cDNA chac extends from the middle of ::hs first 
potential coiled-ccil domain of GEK2 to the C-term.mus) 
m.urine LCPC (GB:D8972S) , Xenopus iaevis polo-like kinase 1 

25 (GB : AF100165) , and hum.an SLK (GB : A3002804 ) , sharing 91.3%, 

88.5%, 65.0%, and 44.7% amino acid identity, respectively. 
The high sequence homology betv/een human GEK2 , murine LCfC 
and ra- ATl-4 6 suggests that human GSK2 is a highly related 
protein co the rodent: form.s, or al cernac i vely , ics 

30 orthologue. Recently, ' a full-length version of GEK2 was 

reported (STKIO or human LOK AB01571S) . The 968 amino acid 
sequence is 99% idencical to GEK2 (SEQ ID N0:107). 



BNSDCCID: <WO 9953036A2_I_> 



wo 99/53036 



PCT/US99/08I50 



10 



15 



20 



25 



The 33 amine acid M-cerrriinal dorriain cf human GEK2 is 
most: related co murine LQK fGE:DS9723) and to human SLK 



Human GEK2 lacks a glycine residue at position 2, and 
is therefore unlikely co undergo myristylac ion . A Srrath- 
Wacerman search of the nonredundant protein database does 
not reveal any significant homologies than mighc suggest a 
potential function for this domain. 

The 261 amino acid catalytic domain of human GEK2 is 
most related to murine LOK (GB:D39728), rat ATl-46 
(GB:D39723) and human SLK (G3 : AB002804 ) as well as to a C. 
eisgans kinase (GB:Z8146C), sharing 97.7%, 90.8%, 54,5% and 
55.9% amino acid identity, respectively. GEK2 conta2.ns the 
potential "TPY" regulatory phosphorylation site m its 
activation loop. This. "TPY" motif is conserved in other 
STE2 0- related k:Lnases, including ZC2 , ZC3 , ZC4 , GEK2 , KHS2, 
SULUl , SULU3 , PAK4 and ?AK5 . 

The 4 3 amdno acid spacer region of human GEK2 is most 
related to murine LGK (GB:D3972S) and to human SLK, sharing 
83.7% and 77.6% amino acid identity^ respectively. 

The 135 amino acid proline-rich region of human GEK2 is 
m.ost related to murine LOK (GB:D8972S) with 66.2% amino acid 
identity, respectively. Within the proline-rich region of 
human GEK2 is a potential ^^PxxP" SH3-binding domain 
conserved with murine LOK. 

Im.mediately C-terminal to the proline-rich region of 
human GEK2 is a 252 amino acid region predicted to fcrmi a 
coiled-coil structure based on the Lupas algorithm.. This 
region of hum.an GEK2 is most related to rat ATl-46 
(G5:D89728), murine LOK (GB:D89723) and human SLK 
(GB : A3002804 - , and ZG2 (SEQ ID NO:14), sharing 90.8%, 86.9%, 
42.2%, and 29.7% am.ino acid identity, respectively. 



(GB:ABC02804) , sharing 100% and 54.5% 



ammo acic laentity, 



respect ivei v . 
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Immediacely C-terminal co the predicted coiied-coil 
s-ruccure of human GEP:2 is a second potential coiled-coil 
scruccure of 244 amino acids predicted based on the Lupas 
algorithm. This region of human GEK2 is most related no ra 
ATl-46 (G3:D3972S) and m.urine LOK (G3:D89723) as well as to 
human SLK (GB : ABC 02 3 04 ) and ZCl (SEQ ID i^O:13), sharinq 
91.8%, 92.6%, 70.4% and 26.7% amino acid identity, 
respectively. The C. elegans kinase (GB:ZS14 6C} shares 
31.5^ amino acid sequence identity over this reaicn. 

f V j a m m a 1 i a — ^ ^-r -ii 

The 3 6 04 bp human ?AK4 nucleotide sequence encodes a 
polypeptide of 6S1 amino acids (SEQ ID NO: 29) with a 
predicted miolecular Ttass of 74,875 daltons. Analysis of the 
deduced a^mdno acid sequence predicts ?AK4 to be an 
intracellular serine/ threonine kinase, lacking boch a signal 
sequence and transmembrane domain. The full-lenQth PAK4 
protein concains a 51 amiino acid N- terminus predicted to 
concain a rac-binding motif, a 173 amino acid insert 
relative to the knov/n mammalian ?AK proteins, a 169 amino 
acid spacer region, a 265 amino acid catalytic domain with 
all the mtotifs characteristic of a serine/ threonine kinase 
and a 23 amino acid C~termiinal tail. 

PAK4 is miost closely related to hum.an ?AK5 (SEQ ID 
NO:30),.PAK1 (GB: U24152), and PAK65 ( GB : U2 5 9 7 5 ) , as well as 
to a C. eiegrans kinase (GB: Z74029), sharing 76.8%, 49.5%, 
49.3%, and 34.6% amino acid identity, respec tivelv" . 

The 51 amino acid terminal domain of humtan ?AK4 is 
most related to human PA.Kl (GB:U24152), and PAK65 
lG5:U25975), as v/ell as to a Celegans kinase (GB: Z74029), 
snaring 50.0%, 50.0% and 4 9.0% amino acid identity, 
respect ively . 
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The 11 amino acid region az positions 13-23 of human 
PAK4 fics the consensus for a Cdc4 2 /Rac - binding rrotif 
{SXPX4 -oHXXH) (Burbelo, ?.D., Dreschel, D. and Hall, A. J. 
Bio. Chem. 270, 29071-29074 (1995)). 

The 173 arr.ino acid insert of human PAK4 , relative co 
zhe knov;n mammalian PAK proteins, is most related to a C. 
elegans kinase (GB: Z74029) with 39.0% amino acid identity. 

A Smi"h- Waterman search of the nonredundant protein 
database does nou reveal any significant homologies that 
might suggest a pocential function for chis region. 

The 169 amino acid spacer of human PAK4 does noc reveal 
any significant homologies that might suggest a pocential 
function for this region.. 

The equivalent spacer region in PA.Kl binds to che 
guanine nucleotide exchange factor ?IX (Manser, E. et al 
(199S) Molecular Cell, 1, 133-192) . Since PAK4 differs 
substantially from PA.Kl over this region, zhe spacer domain 
of ?A,K4 may differ in its guanine nucleotide exchange factor 
binding . specif icity, . relative to PAKl . , . 

The 265 amino acid catalytic domain of human PAK4 is ^ 
most related co hum.an PAK5 (SEQ ID NO:30), PAKl (GB:U24T52), 
GCK (GB:U07349), 30K-1 (GB:X99325), and SLK (GB : ABO 0 2 S 04 ) , 
as well as co che .C* elegans (GB: Z74C29), and S, cere^islae 
STE20-related kinases (GB:L04655), sharing 95.9%, 51.7%, 
41.3%, 39. S%, 37.4%, 60.2% and 47.9% amdno acid identity, 
respectively. PAK4 contains the potential ^^tpY" regulatory 
phosphorylac ion site in its activation loop. This "TPY" 
motif is conserved in other STE2 0 - related kinases ; ' including 
ZGl, ZC2, ZC3, ZC4 , GEK2 , KHS2 , SULUl , SULU3 and PAK5 . 

The 2 3 ammo acid G-tail of human PA.K4 ccntiains a 
sequence that is homiologous to a G-protein beta subunit 
binding site (Leeuv;, T. et al . Nature, 391, 191-195 (199S)). 
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PAK4 has, therefore , the potential to be activaced bv bcrr 
Coc42- as v/eli as G-protein-dependanc pachv;ays . 

Ma mma 1 i an ?AK5 

5 The 2,306 bp human PAK5 nucleotide sequence of the 

complete cDMA encodes a polypeptide of 5 91 amino acids (SEQ 
ID NC:1G3) with a predicted molecular mass of 64,071 
Daltons. Analysis of the deduced amino acid sequence 
predicrs PAK5 to be an intracellular STE20 -subfamily kinase, 
0 lacking boch a signal sequence and ::ransm.embrane domain. 

The full-lengrh PAPCS protein contains a 52 amino acid N- 
cerminus predicced co concain a p21 (small G-protein) 
binding domain (PDB or CRIB) , a 121 amino acid insert 
relative co the knov;n mam.malian PAK proteins, a 134 amino 
spacer region, a 265 ammo acid catalytic dom.ain v;it:h all 
the motifs characteristic of a serine/ threonine kinase and a 
19 amino acid C-termiinal tail. 

?A?:5 is mosc closely related to Kum.an PAK4 (SEQ ID 
NC:29), Drosophlla melanGgaster PAK (also known as "mushroom, 
bodies tiny") (AJ01157S), C45311.1b from C. elegans 
(Z74029) , and human PAK3 (Q13177) sharing 43% (327/674 aa), 
50% (330/651 aa) , 43% (234/435 aa excluding gap) , and 47% 
(190/4 05 aa excluding gap) amino acid identity, 
respectively. Recently, a full lengch version of PAK5 was- 
reported (PAK4 AF005045) whose 591 amino acid sequence is 
identical uo PAK5 (SEQ ID N0:103). (Abo, et al . (1998) EMBO 
J. 17 : 6527-6540) . 

Tne 5 2 amino acid N- terminal domain of hum.an PAK5' is 
most related zo human PAK4 (SEQ ID NC:29), Drcsophila 
melancgascsr PAK (AJ01I573), C45Ell.lo from C. sleaans 
(Z74029), and humian PAK3 (Q13177), sharing 65%, 57%, 54%, 
and 53% amino acid identicy, respeccively . 

iDD 
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The 11 amino acid region at positions 12-22 of hunnan 
PAr:5 (FIG. IS) firs the consensus for a small G-protein 
binding domain (PDB or CRIB) ' ( SXPX4 - 6HXXH) (Burbelo, P.D., 
Dreschel, D. and Hall, A. J. Bio. Chem. 270, 29071-29074 
(1995), hereby incorporaced by reference herein in its 
entirety including any figures, tables, or drawings.). 

The 121 amino acid insert of human ?AK5 shares 43% 
amino acid identity wich a similar domain from PAK4 (SEQ ID 
NO: 29), but that is absenc from, other knov/n PAKs . 

The equivalent spacer region in PAKl binds co rhe 
guanine nucleotide Byich^nge factor PIX -Manser, E. st al 
(199g) Molecular Cell, 1, 183-192 hereby incorporaced by 
reference herein in its entirety including any drawings, 
figures, or tables.). Since PAKS differs substantially from 
PAKl over this region, the spacer domiain of PAK5 may differ 
in its guanine nucleotide exchange factor binding 
specificity, relative to PAKl. 

The 134 amino acid col lagen- 1 ike region of human PAr'.B 
shares 34% amino acid identity to pro-a I type collagen from 
several species ' and is not present in ocher known PAKs. 

The 265 amino acid catalytic dom:ain of humian PAKS is 
most related to human PAK4 (SEQ ID NO : 2 9 ) , Drosophlla 
melanogaster PAK {AJ011578;, C45Bll.lb from. C. elegans 
(Z74029) , and human PAK3 (Q13177), sharing 73%, 30%, 61%, 
and 5S% amino acid identity, respectively. PAKS also 
contains the potential "TPY" regulatory phosphorylation site 
m its ac-ivation loop. This "TPY" motif is conserved in 
other STE20-related kinases, including ZCl, ZC2 , ZC3 , ZC4 , 
GEK2 , KKS2, SULUl , SULU3 and PAK4 . 

The 19 amino a.cid C-tail shares SC% am.ino acid i dent lev 
to a PAK- like homo'logue identified from: genomiic DNA 
(AL031552) , Furcherm.ore , this C-terminal region of hum.an 
PAKS contains a sequence that is homologous to a G-protein 
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beta subunit binding sice (Leeuw, T. et ai . Nacure, 3 91, 
1S1-1S5 {1993) hereby incorporaced by reference herein in 
its encirecy including any figures, tables, or drawings) . 
?AK5 has, therefore, che potential to be activated by both. 
5 Cdc42 as well as G-protein-dependant pathways. 

— ■ An-xoodies. Hvbridomas, Method? of Use and Kir =: fc-^- 
Detectisn of STR?n-Re 1 ^^r^d Kinp.f:;p.q 

The present: invention relates to an antibody having 
10 binding affinity to a kinase of the invention. The 

polypeptide may have the amino acid sequence set forth in 
SEQ ID MO: 5, SEQ ID NO : 6 , SEQ ID NO : 7 , 3EQ ID MO : 1 3 , SEQ ID 
NO: 14, SSQ ID NO : 1 5 , SEQ ID NO: 13, SEQ ID NO : 2 2 , SEQ ID 
NO: 23, SSQ ID NO: 24, SEQ ID NO : 2 9 , SEQ ID NO: 97, SEQ ID 
NO: 99, SEQ ID NO: 101, SEQ ID NO: 103, SEQ ID NO: 105, or SEQ 
ID NO: 107, or a functional derivative thereof, or ac least 9 
contiguous amino acids thereof (preferably, at least 20, 30, 
35, or 40 or more contiguous amino acids thereof). 

The present invention also relates to an antibody 



20 having specific bindin 



■a 



^Ifinity to a kinase of che 



invention. Such an antibody may be isolaced by comparing 
Its binding affmicy co a kinase of the invention wich ics 
binding affinity to other polypeptides. Those which bind 
selectively to a kinase of the invention would be chosen for 
use in methods requiring a discmcticn between a kinase of 
the invention and other polypeptides. Such m.ethods could 
include, but should not be limiced to, the analysis of 
al-ered kinase expression m tissue concaming other 
polypeptides . 

The STS20-.Relaced kinases of the presenc invencion can 
oe used in a variety of procedures and methods, such as for 
the generacicn of ancibodies, for use in idencifyi.ng 
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BNSDOCtD: <WO 9953036A2 J_> 



wo 99/53036 PCT/US99/08 1 50 

pharmaceutical corr.posi c ions , and for studying DNA/prctecn 

inceraction. 

The kinases of che present invention can be used to 
produce antibodies or hybridomas , One skilled in the art 
5 v;ill recognize that if an antibody is desired, such a 

peptide could be generated as described herein and used as 
an imrriunogen. The antibodies of the present invention 
include monoclonal and polyclonal antibodies, as well 
fragments of these antibodies, and humanized f orm.s . 

10 Humanized forms of the antibodies of the presenc invention 

may be generated using one of the procedures known in the 
arc such as chimer i cat ion or CDR grafting. 

The present invention also relates to a hybridcma which 
produces che above -described monoclonal antibody, or binding 

15 fragment thereof. A hybridcma is an immortralized cell line 

which is capable of secreting a specific mionoclonal 
an c i body . 

In general', techniques for preparing monoclonal 
antibodies and hybridomas are well knov;n in the art 

20 (Camvpbell,. "Monoclonal 'Antibody Technology: Laboratory 

Techniques in Biochemistry and Molecular Biology, " Elsevier 
Science Publishers, Amscerdam, The Netherlands, 1984; St. 
.Groch et al . , J. Immunol. Methods 35:1-21, 1980).. Any 
animal (mouse, rabbit, and the like) which is known to 

25 produce antibodies can be imtmunized with the selected 

polypeptide. Mechods for im.mranizat ion are well known in the 
art. Such methods include subcutaneous or int raper i tonea.1 
injeccion of the polypepcide. One skilled in the art will 
recognize thac the amcunc of polypeptide used for 

30 immunizacion will vary based on the anim^al w\^ich is 

immunized, the antigenicicy of che polypepcide and the sice' 
of ini ect ion . 
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The polypeptide may be modified or administered in an 
adjuvant in order to increase the pepcide ancigenicicy . 
Methods of increasing the antigenicity of a polypeotide are 
well known in the art. Such procedures include coupling the 
5- antigen v/ich a heterologous protein (such as globulin or p- 

galacc csidase ) or through the inclusion of an adiuvanc 
during immunization . 

For monoclonal antibodies, spleen cells from the 
immunized animals are removed, fused wich myelom;a cells, 

10 such as S?2/0-Agl4 myeloma cells, and allov/ed to become 

monoclonal anribcdy producing hybridom.a cells. Anv one of a 
numub^er of mechods well knov;n in the art can be used zo 
identif y ■ -che hybridoma cell which produces an ancibody with 
the desired characteriscics . These include screening the 

15 hybridomias with an ELISA assay, western blot analysis, or 

radioimmunoassay (Lutz et al . , Exp. Cell Res. 175:109-124, 
19SS). Hybridomias secreting the desired ancibodies are 
cloned and the class and subclass are determined using 
procedures knov/n in the art (Campbell, "Monoclonal Antibody 

20 Technology: Laboratory Techniques in Biochemdst ry and 

Molecular • Biology " , supra, 19S4). 

For polyclonal antibodies , ancibody- containing ant i sera 
is isclaced fromi the imimunized animial and is screened for 
the presence of antibodies winh the desired soecificitv 

25 using one of the above - described procedures. The above- 

described antibodies m.ay be detectably labeled. Antibodies 
can be detectably labeled through the- use of radioisotopes, 
affinity labels (such as biotin, avidin, and the like) , 
enzym.atic labels (such as horse radish perc^xidase, alkaline 

^0 phosphatase, and che like) fluorescent labels (such as FITC 

or rhcdamine, and the like), param^agne t ic accmts, and che 
like. Procedures for accomplishing such labeling are well- 
knov.-n in the art, for example, see Scemberger et al . , J. 

lo4 
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Hiscochem. Cytochem. 18:315, 1970; Bayer et al . . Mech. 
Enzym. 62:308-, 1975; Engval acal.. Immunol. 105:129-, 
1S72; Coding, J. Immunol._Mech. 13:215-, 1976. The labeled 
■antibodies of the present invention can be used for in 
5 vitrc, in vivo, and in sicu assays to identify cells or 

tissues which express a specific pepcide. 

The above -described antibodies may also be immobilized 
on a solid support. Exam.ples of such solid supports include 
plastics such as polycarbonate, complex carbohyaraces sucn 
10 as agarose and sepharose, acrylic resins and such as 

■ polyacrylamide and latex beads. Techniques for coupling 
anribodies to such solid supports are well known in the art 
(Weir er-ai., "Handbook of Experimental Imm.unology" 4ch Ed., 
Blackwell Scientific Publications, Oxford, England, Chapter 
15 10, 1936; Jacoby et al . , Meth. Enzym.. 34, Academic Press, 

N.Y., 1974). The immobilized antibodies of the presenc 
invention can be used for in vitro, in vivo, and in situ 
assays as well as in immunochrom.otography . 

Furtherm.ore, one skilled in the art can readily adapt 
20 currenrly available procedures, as well as the cechniques, 

m.ethods and kits disclosed herein with regard to antibodies, 
CO generace peptides capable of binding co a specific . 
pepcide sequence in order to generate rationally designed 
antipeptide peptides (Hurby ec aJ . , "Application of 
25 Synthetic Peptides: Antisense Peptides", In Synthetic 

Pepcides, A User's Guide, W.H. Freeman, NY, pp. 23 9-307, 
1992; Kaspczak et al . , Biochem.iscry 2 8:9230-9233, 19B9). 

.J\nti -pepcide pepcides can be generaced by replacing tn' 
basic amino acid residues found m che pepcide sequences of 
30 the kinases of the invention with acidic residues, while 

maintaining hydrophobic and uncharged polar groups. For 
example, lysine, arginine, and/or histidine residues are 
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reoiaced v/ith aspartic acid or glutarr.ic acid and glutamic 
acid residues are replaced by lysine, arginine or hiscicine. 

The ores en c invencion also encompasses a rrtechod of 
detecting a STE2 0 - related kinase polypepcide in a sarrsple, 
5 comprising: (a) contacting the sample v/ith an above- 

described ancibody, under conditions such chat 
immunocomplexes form, and (b) detecting the presence of said 
antibody bound to the polypeptide. In decail, the mechods 
comorise incubating a rest sample wich one or more of the 

10 antibodies of the present invention and assaying ivhecher the 

antibody binds co che test samiple. Altered l-^vels of a 
kinase of zhe invencion in a sample as compared to normal 
' 1 eve 1 s m.ay i nd icate disease. 

Condicions for incubating an anribody with a test 

15 sample vary. Incubation conditions depend on the format 

em:T3loved in che assav, the detection miethods employed, and 
the cype and nature of the antibody used m che assay. One 
skilled in the art will recognize that any one of the 
com:mionly available immunological assay form.ats (such as 

20 radioimim.unoassays , enzym^e - 1 inked immiunoscrbent assays, 

diffusion based Ouchterlony, or rocket immiunof luorescent 
assays) can readily be adapted to employ the ancibodies of 
the oresent invencion. Examples of such assays can be found 
m Chard ("An Introduccicn to Radicim.m.unoassay and R.elated 

25 Technioues" Elsevier Science Publishers, A^miscerdam^, The 

Necherlands, .19B6), Bullock et al . ("Techniques xn 
Im.m.unocytochem.iscry , " Academ.ic Press, Orlando, FL Vol. 1, 
1932; Vol. 2, 19S3; Vol. 3, 1985), Tijssen ("Pract2.ce and 
Theory of Enzymie Im.m.unoassays : Laboracory Techniques in 

30 Biochemistry and Molecular Biology, " Elsevier Science 

Publishers, Am.scerdam, Tl.e Necherlands, 1985) . 

The im.miunological assay cest samples of che present 
invencion include cells, crotein or memi-rane extraccs of 

lOli) 
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cells, or biolcgical fluids such as blood, serum, piasrr.a, or 
urine. The cesc sa~?lss used in nhe above-described -ethod 
will vary based on che assay format, nature of the detection 
method and the tissues, cells or extracts used as the sample 
5 to be assayed. Methods for preparing pro-ein extracts or 

membrane extracts of cells are well knov.-n -n che art and can 
be readily be adapted m order to obtain a sample which is 
testable with the system utilised.. 

A kit contains all the necessary reagents to carry out 
10 the previously described m.ethods of detection. The kit m^y 

comprise: (i) a first container means containing an above- 
described antibody, and {ii; second container m.eans 
containing a conjugate comprising a binding partner of the 
antibodv and a label. In another preferred embodiment, the 
15 kit further comprises one or more other containers 

comprising one or more of the following: wash reagents and 
reagents capable of detecting the presence of bound 
antibodies. ■ 

Examples of detection reagents include, but are not 
20 lim.ited to, labeled secondary antibodies, or in the 

alternative, if the prim,ary antibody is labeled, the 
chrom.ophoric, enzym.atic, or antioody binding reagents which 
are capable of reacting with the labeled antibody. The 
• com.partm.entalized kit m^ay be as described above for nucleic 
25 acid probe kits. One skilled in the art will readily 

recognize that the ' ant ibodies described in the present 
invention can readily be incorporated into one of the 
established kit formats which are well knov.'n in the art. 
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VI . Isolation of Ccnriocunds Which Interc^ct V-7ich S TE2 0- 
H.elated Kinases 

The ore sent invention also relates to a Tie c nod or 
detect: ing a compound capable of binding to a STE2 0 - related 
kinase of the invention comprising incubating the compound 
v;ith a kinase of the invencxon and defecting the presence of 
the comoound bound co zhe kinase. The compound may be 
oresenc v/ithin a complex mixture, for exam;ple, s.erum, bcoy 
fluid, or cell extracts. 

The ore sent invention also relates to a me e hoc or 
dececting an agonist or antagonist of kinase accivicy or 
kinase binding partner accivity comiprisrng rncuoacmg cells 
thar produce a kinase of the invention in che presence ot a 
compound and detecting changes in che level of kinase 
activicy or kinase binding partner activity. The compounds 
thus identified would produce a change in activity 
indicative of the presence of the compound. The compound 
miav be present wichin a complex mixture, for example, serum:, 
bodv fluid, or cell . extract s . Once the compound is 
identified ic can be isolacsd using techniques well knov/n in 
the aru. 

The ore sent invencion also encompasses a m^ecnoa or 
aaonizing { s t im.ulating) or antagonizing kinase associated 
activitv m a mam.mal comprising administering co said mam.mal 
an agonist or antagonist to a kinase of the invention in an 
amount sufficient to effect said agonism or antagonism.- A 
method of treating diseases in a m.ammial with an agonist or 
anragonisc of STE2 0 - related kinase activity comprising 
admiinistering the agonist or ancagonisr to a mam.m.al m an 
amcunr sufficient zo agonize or antagonize 3TE2 0 - related 
kinase associaced functions is also encompassed in the 
oresenr aoolication . 
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in an effort to discover novel treatments for diseases, 
biomedical researchers and chemists have designed, 
synthesized, and tested molecules that inhibit the function 
of protein kinases. Some small organic molecules form a 
S class of com.pounds chat modulate the function of protein 

kinases. Examples of m.olecules that have been reported to 
inhibit the function of protein kinases include, but are not 
limited to, bis monocyclic, bicyclic or heterocyclic aryi 
compounds (PCT WO 92/20642, published November 26, 1992 by 
10 Maguire at al.), vmylene -azaindole derivatives (PCT WO 

94/14808, published July 7, 1994 by Ballmari et al.), 1- 
cvclooropyl - 4 -pyr idyl -qumolones (U.S. Patent No .. 
.5^,3,3'^ 952-; , snyryl compounds (U.S. Parent No. 5,217,999), 
scv^vl-substicuted pyridyl compounds (U.S. Patent No. 
IS 5,io2,606), certain quinazoline derivatives (EP Application 

NO. 0 566 266' Al), seleoindoles and selenides (PCT WO 
94/03427, published Feoruary 17, 1994 by Denny er al.), 
tricvc-.ic oolvhvdroxylic compounds (PCT WO 92/21660, 
published December 10, 1992 by Dow), and benzylphosphonic 

. - 'P— 91/1 5495, published October 17, 1991 

20- acid compounds \P^i. v\'-' i^i/ -s'^t^- / f 

by Dow et al) . 

= - -. = T- r7-^ verse cell memibranes and are 
Gompcunas tha^ v-an tia^eri^ti 

• ■ ^resistanc co acid hydrolysis are potentially advantageous as 
therapeutics as chey can becom.e highly bioavailable after 

r-. = r-ient= Hovvever, many ol 
25 being administerea oraxxy uo ^■a^.<=..^^. 

■these protein kinase inhibitors . only wec.K-,. .n-.x^_c -n_ 

^^^^A^ k-nases In addition, m.any inhibi 
function 01 protci-i K^nasei= . 

c ^i-^-.-, iHn3=;.= =; and will cause miultiple side- 
varietv of protein r^massfe aii'--^ 

effects as therapeutics for diseases. 
30' Some indolinone compounds, hov/ever, form classes or 

acid resistant , and -rembrane permeable organic m.clecules . 
-95/22976 {published August 1, 1596 by Ballmari et al . ) 
descrioes hydrosoluble indolinone compounds that harbor 
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cetraiin, naphthalene, quinolme, and indole s'ubst xcuents 



lUS 



ed ro the oxmdole rxng . These b^cyclxc substituents are 



U.S. 



in cum substituted with polar tr.oieties including 
hvdroxvlated aikyi , phosphate, and echer moieties 
Patent Application Serial Nos . 08/702,232, filed Augusc 23, 
1996, entitled "Indolinone Combinatorial Libraries and 
p.iar.d Products and Methods for the Treatment of Disease- 
by Tang ec al . (Lyon Lyon Docket No. 221/137) ^and 
08/435,323, filed June 7, 1995, entitled Benzyl laene- ^i- 
indcdlne Compounds for the Treatment of Disease" by Tang et 
ai: (Lvon Lyon Docket No. 223/29-3) and International 
P^-,..e'p"biicacion WO 96/22976, published August. 1, 1996 by 
Ballinari et al . , all of- which are incorporated herein by 
reference in their entirety, including any drawings, 
describe indolmone chemical libraries of indolinone 
corr.counds harboring other bicyclic moieties as well as 
monocvclic moieties fused zo the oxindole ring. 
Ap-l^cat^ons 08/702,232, filed August 23, 1396, enritlea 
-indo-.^none Coml3inatorial Libraries and Related Products and 
Methods for the Treatm.ent of Disease" by Tang et al. (Lyon . 
Lvon Docket No. 221/187), 08/485,323, filed June 7, 1995, 
enr^tled Benzvl idene- Z - Indol me Compounds for the Treatment 
of Disease" bv Tang ec al . (Lyon s. ^yon u.^^r..^'- - 
and WO 96/22976, published August 1, 1996 by Ballmari et . 
.5 ^-a.-r methods of indolmone synthesis, methods of 

..=tino the biological activity of indolinone compounds m 
cells, and inhibition patterns of indolmone derivatives. 

ether examples of substances capable of modulating 
kinase activity include, but are not limited to, 

- , s r-'s , and auinolines. 

10 tvrohostins, oumazoi _n-==, , .^.u^-u^-----'-- 

The'auinazolmes, tyrphostms, quinolmes, and qumoxolmes 
re-erred to above include well known compounds such as tnos. 
irscnbed m the literature. For exam.ple, representative 

10 
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oublications describing qu.nazol.nes include Barker ec al . , 
EPO Pub^icacion No. 0 520 722 Al ; Jones ec al . , U.S. Facenc 
NO .,447,603; Kabbe ec al . , U.S. Patent No. 4,757,072; Kaul 
and Vougioukas, U.S. Patent No. 5, 316,553; Kreighbaurr, ana 
Corr.er, U.S. Patent No. 4,343,940; Pegg and Wardleworth, EPO 
ou-ication NO. 0 562 734 Al ; Barker et al . , Prnc . of A:t^. 
..... f..- r.nc^- Rese arch 32:327 (1991); Bertino, 
r^.r..^-,- P.search 3:293-304 (1979); Bert.no, J.R., Cancer 
P^..arch 5(2 part l):2S3-304 (1979); Curtxn ec al . , 3r_J. 
ria:^ 53:361-358 (1966); Fernandes et al . , C^nr^.- Research 
.3.1 1 1 7-1123 (1983); Ferns et al . -T Ora . Chen.. 44(2) :173- 
17S; Frv et al . , ^c.ence 2 6 5:1093-1095 (1994); Jackman et 
a^., Cancer_Research 51:5579-5586 (1981); Jones et al. J. 
M^^. Che:.. 29(6) : 1114-1118; Lee and Skxno, p - nchemx scrv 
26 (23) : 7355-7362 (1987); Lemus et al . , -T 0>-a . Cnen. 
54:3511-3518 (1989); Ley and Seng, S ynthesis 1975:415-522 
(1975); r4axwell et ai . , Mann^M c Re-o^pnr-P Medicine 
17:189-196 (1991); Mmi et al . , Canr-^- Research 45:325-330 
(1985); Phillips and Castle, J H^terocvcM.c Chem. 
■ 17(19) :1489-1596 (1980); Reece et al . , r^nr-^r Research 
47(11) :2996-2999 (1977); Soulier et al . , ^ = T^pr^unol ■ and 

Tn.^.r.other. 23:A65 (1936); Sikora et al . , r.n^.^ Letters 
2 2:289-295 (1984)\- Sikora et al . , Analytical Biocnem. 
172:344-355 (1988); all of which are incorporated herein by 
r-eference in their entirety, including any drawings. 

Quinoxaline is described in Kaul and Vougioukas, U.S. 
Patent No. 5,316,552, incorporated herein by reference m 
ics entirety, including any drawings. 

Quinolmes are described m Dolle ec al . , M-n . Chen-. 

27:2627-2629 (1994;; MaGuire, ,T Med. Chen:. 37:2129-2131 

, . T M^d --^"^e^ 36:425-432 (1992); and 

{1994;; Burke et a^. . , «J_: — ' ■-'-^ — •- 

o,.,.>. =r .1 ^--^ ^'^""^ Letters 2:1771-1774 
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(19-52), ail of which are incorporaced by reference in cheir 
entirecy, including any drav/ings . 

Tvrphostins are described in Allen et al . , Clin. Sxo . 
Tmrnunol . 91:141-156 (1993); Anafi eC al . , Blood 82:12:3524- 
3529 (1993); Baker et al . , -T. Cell Sci. 102:543-555 (1992); 
Eilder et al . , Amer. Phv ^iol. Sec, pp. 63 63 - 5143 : C72 i -C73 0 
(1991.1; Brunton et al . , Proceedi nc.g of Am.er . Assoc. Cancer 
T?sch ■ 33:553 (1992); Bryckaerc et al . , Experimental Cell 
Research 199:255-261 (1992); Dong et al . , J. Leukocvte 
Biology 53:53-60 (1953); Dong et al . , J. Immunol . 
151 (5) :2717-2724 (1953); Gazit et al . , J. Med. Chem. 
32:2344-2352 (1939); Gazit et al . , " J. .vied. Chem. 36:3556- 
3564 (1995/; Kaur ec ar-.r -P^n.-er Druas 5:213-222 

(1594); Kaur et al . , King et al . , P^ochem. J. 275:413-418 
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VX T . 3:Loloaical s lani r 2-canc^ , — .".pp- iv-^' — 

p.=.l^v-anr:g of Mr-.y^l ST E2Q -Relar-^n Kinases 

unp.^n ■qTLK2, .qTT.K3. ST T.K^ . sTL.-rCS , .^TLK6 , and STLK7 

STLK2, STLK4, STLK5 , STLK6 and STLK7 belong to an 

5 expanding family of mcraceilular STKs that have varying 

degrees of sequence homology to SOK-1, a kinase implicaced 

in' oxidative stress agents (Pombo, CM et al , EMBO J. (17) 

4537-4545, 1996). Our data shows that STLK2 is expressed 

highly m hemacopoietic cells. Therefore, STLK2 may 

10 participate in the oxidative response pathway during 

^ inflammation. In addition, 3TLK2 could also be a possible 

component in the signaling pathways leading to T cell 

activ.= tion. Hiah levels of STLK2 in several tumor cell 

- ^ -n ^~y^->^- t-h^\- =5TLK2 mipht be involved in 
lines couici also i..ipj.v cnac sli-iz^.^ . m. ■-).-'- 

15 cumorigenesis . 

STLK2 is most -closely related to two human STE20- 
subfamily kinases: MST3 and SOK-1. MST3 is a 52,000 
daluons cytoplasmic kinase that is ubiquitously expressed 

■ with ics highest levels of expression found in heart, 

20 skeletal muscle and pancreas. The serine/ threonine kinase 

acciviry' of MST3 is activated by phosphorylation-. Unlike 
30K-i, MST3 prefers Mn" over Mg^' and can use both GT? ana 
ATP as phosphate donors. M3T3 may undergo dimerization. No 

■ agonists have yet been identified that activate MST3 . The 
25 downs-.ream signaling m.echanism of this kinase is unknown 

(Schinkmann, KandBlenis, J. (1997) J. Biol. Chem. 272, 
2S695-28703) . 

SOK-1 is a 50,000 daltons -cytoplasmic Kinase expressed 
predominantly in testis, large mcestme, brain and scomach 
30 and cc a lesser exrenc in hearc and lung. 30K-1 is also 

. ^-^^ c^nc'-r B-cell line (RAMOS) and in 

a rn.acure B ceil line (Hcu bu.-'^an..' . -l--- ^- 

kinase activicy or ouh.-x acc-.c., . 

Iii> 



3NSDOCID: <WO 9953036A2 J_> 



wo 99/53036 PCT/LS99/08I50 

The C- terminus of SOK-1 has been shown to be inhibi-cry cc- 
the catalycic accivicy of chis kinase. The only agonists 
knov/n to activate SOK-1 are oxidant agents, like H.O^ and 
menadione, a quinone that is a potent intracellular 
generator of. reactive oxygen species (Pombo, CM. ec al . 
EM30 J. 15, 4537-4546) . SOK-1 is also activated by chemica: 
anoxia through the generation of reactive oxygen species anc 
release of calcium into the cytoplasm from, intracellular 
stores. SOK-1, therefore, may play an important role in 
ischemia, the cause of myocardial infarction, stroke and 
acute renal failure (Pcmbo, CM. st al. J. Biol. Chem. 272, 
29372-29379 (1997)). The activity of SOK-1 in the response 
to oxidant- stress is inversely correlated with the activity 
of the stress-activated protein kinases (SAPKs) : elevated 
SOK-1 activity correlates with absent SAPK activity and 
vice-versa. SOK-1 does not activate any of the four MAP 
kinase pathways, SAPKs, p33, ERK-1 or MEK-5/ERK-5 (Pombo, 
C.M. et al. EMBO J. 15, 4537-4546). The downstream 
signaling mechanism of this kinase rem.ains unknown. 

STLK2 is expressed in a wide variety of immune cell 
types and tissues including thym^us; dendrocytes, mast cells 
monocytes, B cells (prim.ary, Jurkat , RPMI , SR) , T cells 
(CDS/CD4-f, TKl, TH2, GEM, M0LT4 ) and megakaryocytes (K562), 
whereas STLK3 is restricted to thymus and STLK4 is 
predominately expressed in thymius, T cells (CD4/CD8^, THl , 
CEM) and 3 cells (Jurkat, RPMI). Consequently, these STKs 
miioht particioate in the oxidative response pathway during 
inf lam.mat ion, reperfusion injury (stroke, surgery, shock) , 
TNFa-mediated signaling, insulin desensitizat ion, 
atherogenesis , vascular injury, T or B cell cost imiuiat ion , 
or alternatively, participate in other ?^LAPK- related signal 
transduction processes. 

in 
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STLK5 is more discantiy related co nhis 3TE20-subraniily 



mav 



including SOK- 1 and STLK2 , STLK3 and STLK4 . STLKS 
therefore mediate a signaling pathway that is distinct from 
the oxidative stress response pathway. 

The high degree of sequence homology in the C- termini 
of SOK-1. STLK2, STLK3, STLK4 , STLK5 , and STLK6 raises 
possibility that these novel STKs , like SOK-1, may be 
subject to autoinhibition through a conserved C- terminal 

motif . 



:he 



Knn-)an ZC , ZC2 ■ ^.C-^ and ZC4 

ZCl is a good candidate for any disease in which 
tvrosine- kinase, cytokine., or heterotrimer ic G-protein 
coupled recepcors have been implicated. The m.ouse nomologue 
15 binds to NCK, and is recruited to activated PDGF (Su et 

al.,EMBO 16: 1279-1290, 1997). The Drosophila homolog has 
b--n shown uo bind tc TRJ^-F2 , implicating it in TNF-a 
sionaiino (Liu et al . , (1999) Curr. Biol. o.,oi-lC4. 1999)). 
While ZCl does not contain the exact NCK- and TRAF2 -binding 
20 domains, it is likely to bind to related proteins. 

Of the ZC subfamily of STE20 -related protein kinases, 
ZCl has very broad over-expression in many tumor types, 
suooesting- that it may be involved in cellular growth, 
crl^sformacion, or tumor progression. A truncated form of 
ZCl containing only the C-term.inal putative MEKKl -binding 
domain was found to reduce the number of foci generated by 
H-Ras-V12 in Rat Incestmal Epithelial cells (RIE-1). These 
data indicate that ZCl m.ay play a role m the ability for 
these cells to overcom^e contact inhibition and anchorage- 

30 dependent growth. . 

The ZCl homolog, Misshapen (msn) in Drosophila 
melanogaster was cloned as a result of complem.ent ing a 
rr.utacion in a developmencal pathway required for dorsal 

\\6 
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closure, a process involving changes in cell shape and 
positricn in the embryo (Treisman ec al . Gene 186 119-125, 
1997) . A D. melanogaster homolog of che JNK1/JNK2 kinases 
from mammals was shown to function dov/nstream of rnsn in che 
5 dorsal -closure signaling pathway (3u et al . Genes Dev. 

12 : 2371-23S0, 1998) . 

While ZCl could be involved in multiple aspects of 
tumtorigenesis , by analogy with Drosophila , the role of 
misshapen in dorsal closure suggests a critical role in che 

10 regulation of che cycoskeleton for the processes of cell 

attachment, cell movement and perhaps magr-ation. 

The association of the ZCl fam.ily mtemibers rnsn and NIK 
with TRAE2 miay indicate a role for this kinase in cell 
survival and/or in apoptosis. The ZCl fam.ily contains a 

15 highly conserved dom.ain that in the mouse homiolog, NIK, has 

been shown to bind to MEKKl (Mitogen-act ivated/ 
Extracellular-regulated Kinase Kinase 1) (Su et al , , (1997) 
EMBO 16(6): 1279-90). MEKKl is involved in cell survival 
and/or apoptosis in several systems (Schlesinger et al . , 

20 Front. Biosci . 3 : D1131 -6 , 1998). Depending on the context, 

MEKKl appears to be upstream of either the ERKl/MAPK or the 
JNK/SAPK pathway [Schlesinger et al . , (1998 Prone. Biosci. 
3:DllSl-6;. Three homiologues of ZCl: m.urine NIK (NCK- 
interacting kinase) (Su et al . EMBO 16:1279-90, 1997), 

25 Drosophila msn (Liu et al.Curr. Biol. 9:101-104, 1999) and 

human HGK (HPK/GCK- like kinase) (Yao et al . , J. Biol. Chem. 
274:2118-25,. 1999) have all been shown to activate the JNK 
pathv/ay when over-expressed in 293T cells. 

ZCl shares a high degree of homology with these other 

30 family memb^ers in both the kinase dcmiain and the "MEKK"- 

binding domains, yet it differs in the mnervening region, 
which contains several putative binding domtains for upstream 
Signaling adapter molecules (e.g. NCK, TPJ\F2) . Unlike the 
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Other family rr.embers, ZCl does not appear co accivace zhe 
JNK pachv/ay in 293T cells as seen by ics ability to induce 
expression of either a JUN or AT?2 -driven lucif erase gene. 
Upon CO- transf ection into these cells wich KA-cagged JNK, 
modest accivacion of JNK was detected. ZCl also modestly 
activated co- rransf ected ERKl . Both the ERK and the JNK 
activacicn were very slight compared with the positive 
controls in the assay (activated forms of MSKl and KZKKl , 
respectively) . In both cases, activation required the full- 
length kinase. While the kinase domain alone is up to 5x 
more active in autophosphorylat ion and in phosphorylation of 
MEP, it does not lead . to act ivation of these potential 
dovmscream kinases. Based on the strong sequence homology 
of ZCl with the other family members, it is very likely chat 
ZCl v;ill be important for either JNK or SR?: activation once 
the proper ■ context is found. 

ZCl profoundly inhibits ERKl kinase expression in co- 
transf ection assays. This effect is dependent on ZCl kinase 
activity, occurring with the wild-t>iDe and the kinase domain 
20 alone, but not with the kinase-dead mutant even chough all 

three forms of ZCl are expressed at similar levels. This 
may suggest a role for this kinase m transcriptional or 
post - transcriptional regulation . 

ZCl may be an important comiponent in the signaling 
25 pathways mediated by the co- stimulatory receptor - CD2 8 in T 

cells and/or by the pro- inf lam.matory cytokine TNFa, since 
co-cransf ection of the wild-type ZCl activated the RE/A?- 
lucif erase and NFkB- lucif erase reporter genes. While our 
data showed that ZCl strongly activates NFkB m T-cells, no 
activation of NFkB driven luciferase was detectable in NIH 
3T3 cells. A recent oaoer (J. 3iol . Chemi. 2 74:2118-25; 
1995.) has shown that a hum.an ZCl splicing isoform, HGK, is 
involved in the TNFa- signal ing pathv/ays . 

Ill 



30 
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Given the imporcance of T cell acciva-ion in 
autoirri.muniuy and transpiancat ion, as v.-ell as che ke-- role 
chat TNFa plays in xni lammarory diseases, ic is possible 
that ZCl could be a therapeutic target for im.n^;unoiogical 
diseases which include buc are not lirrdced co : rheumatoid 
arthritus, chronic inflammacory bowel diseases (ie Crohn's 
disease), chronic inflammatory pelvic disease, mulciole 
sclerosis, asthma, osteoarthritis, psoriasis, 

atherosclerosis, rhinicis, and autoimmunity as well as orca 

rransplanranon and cardiovascular diseases. 

ZCl appears to be che human orthologue of murine MIK 

and possibly an orthologue of a C. eieaans STE2 0 - subf am^r ly 

kinase encoded by the ZC504.4 cosmid. 

Murine NIK is a 140,000 dalcons kinase chat is most 

highly expressed in brain and heart. NIK interacts with th^ 

SH3 domains of the adapcor miolecule Nek through its proline 

rich regions found in che C-termrnal excra-cacalyt ic region 
The specific regions that miediace chis interaction are two 

Pxx? motifs that are nearly uniformly conserved between NIK, 

ZCl, 2, 3 and .the C. eiegans STE20 ZC504.4 kinase. In 
addition, NIK binds MEKKl through its 719 amino acid C- 

cermmal (Su, Y~C. et al , {1997; EMBO J. 16, 1279-1290). 
MEKKl is a m.emJcrane-associated kinase responsible for 
activating MKK4 (also known as SEKl) , which in cum 
activaces SA?K (Yan, M et al . (1994} Nature, 372, 798-300). 

NIK m:ay function as a kinase chac links growth factor 
activaced pathw^ays and the stress - response pathway mediaced 
by SAPKs. According to this hypothesis, activation of 
growcn factor recepccrs leads to receptor cyrosine 
phosphorylation, Nek binding to tne phospnory 1 acec cyrosmes 
via ics 5H2 domain, NIK red^scr ibucicn to a membrane 
compartm.enc via binding za che SH3 dom-am of Nek, and 
J uxtaposicion to che membrane-associaced MEKKl. The NIK- 

lis 
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MEKP'l inceraccion would, m this fashion, Cum on the 5Ai^?- 
pathway in response to growch factor st in:^.ulat ion (Su, Y-C. 
et al. (1997) EMBO J. IS, 1279-1290). 

Given the high homology between ZCl, ZC2 , ZC3 , and ZC4 
5 STKs and NIK, it is conceivable that these kinases may each 

function to connect growth factor- and stress -activated 
signaling pathways. The heterogeneity that the ZC kinases 
exhibit within their putative SH3-binding dcraain could 
provide signaling specificity m terms of the nature of the 
10 adaptor molecule that they bind. The high level of sequence 

conservation in the C-termini of the ZCl, ZC2 and ZC3 
strongly suggests that these human kinases, like murine NIK, 
also may bind to MEKKl and activate SAPKs . The ZC kinases 
also display strong homology at their C-termini to protein 
15 domains chat bind sm.all QTPase proteins such as Rab , Rho and 

Rac. For example, the C-term.ini of ZCl is 36.2% identical 
to citron,, a murine Rho-binding protein, and 23.1% identical 
to the rab-bmding region of GC kinase. This suggests that, 

--. ■ • — "i oc; oTi? 1 'i GT^ase oroteins m.av 

• in aaaiticn uo aoapuor moi^^uie*, sm^^_ v_i.-ci.-_ y 

20 also mediate membrane association and activation of tne ZC 

kinases. The presence of a potential coiled-coil region 
located imm.ediately C-terminal to the catalytic region 
strongly suggests that zhe ZC kinases m.ay also be subject to 
regulation via homo or heterodimerizat ion events. 

25 The C. slegans STE20 ZC504 . 4 kinase is the product of 

the mig-15 gene. The product of this gene has been 
imolicated in several developm.ental processes such as 
epidermal developm.ent , Q neuroblast m.igrations and m.uscle 
arm targeting in the developing worm (Zhu, X. and Hedgecock 

- = --1-^=. 1-1 If:) Th- hic;h level of 
30 E. (1997; Worm Breeder = oa^ct-s i- , / c ,- . -i-_ ---^^ 

sequence conservation between the ZC kinases and the ZC504 . 4 

j-^^ ^ ^2 eaan s kinase ill rria k e 
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crganisrTi cc study, through episuatic analysis, the signaling 
properties cf the human ZC kinases. 

Hu-an KHS2 

5 KHSl (kinase homologous to 5PS1/STS20) is a 100,000 

dalcon cytoplasmic STK zhac is expressed ubiqui uousiy . KHSl 
has been implicated in the mechanism of SAPK accivation in 
response cc inflammatory cytokines such as TNFa as well as 
to ulcraviolighc light, vmich also uses zhe TNF signaling 

10 pachv;ay. TNFa binding co its recepcors (TNFRl and TNFR2 ) 

resulcs in che sequential asscciacion v/ith the recepcor C- 
tail of multiple signaling molecules including TNFRl - 
associated- death dom^ain procein (TRADD) , Fas -associated 
death domiain protein (FADD or MORTl) , TNFR- associated factor 

15 . 2 (TFw2lF2) , and the STK RIP (receptor interacting procein) . 
The TRADD-T?J^.F2 interaction is mediated by a conserved 
region presenc at the C-cerminus of TRAP 2 , the TFJ\F domain. 

Activation of the NFkB and SAPK pathways is mediated by che 
ring finger mocif present: at the N- terminus of TRA.F2 (Curr. 

20 Opinion in Cell. Biol. (1997) 9:247-251). FCKSl is activated 

by TNFa stimulation in a TRA.F2 -dependant manner and 
inhibition of KHSl blocks TNFa- induced SAPK aczivacion but 
not NFkB accivation. The mechanism by which TRA.F2 accivaces 
KHSl is not known. Cocransf ecc ion of TRAF2 - and KH31- 

25 expressing constructs in 293T cells failed to reveal a 

direcc association between these two molecules;. KHSl 
accivaces che SA.PK pathway by a direcc associacion with che 
constitut ively active kinase MEKKl . MEKKl subsequently 
5. c c 1 vac e s S EKl , v;h i c h in t urn a c t i va t e s S A? ?C . Ne i t he r c he 

30 MAPK nor che p3£ kinase pathways are accivaced by I-CHSl (Shi, 

C-S and Kehrl'. J.K. (1997) J. Biol. Chem. 272 / 32102- 
32107) . In addition to ics. catalytic domain, dov;nscream 
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signaling of llHSl requires ics conserved C-cermmus (Diener, 
K. et al (1997) ?rcc . Natl. Acad. Sci . 94, 9687-9692). 

GCK (ger'iTiinal cencer kinase) is a const iuuciveiy active 
S7,0C0 dalcon STK chat is broadly expressed. GCK may 
participace in B-cell differentiation since its expression 
is localized to the germinal center within lymphoid 
follicles. GCK activates the SAPK pathway in response to 
TNFa via activation of 3EK1 . The upstream activators of GCK 
in response to cytokines as well as the immediate downstream 
cargec of this kinase are unknown. The C-cerminus of GCF: is 
sufficient to accivace SEKl (Pombo, CM. et al (1995) 
Nature. 377, 750-754). 

The murine orthologue of GCK, rabSip (rabS-interaccing 
protein), is a 97,000 dalton protein that fractionates with 
both the soluble cytoplasmic fraction as vvell as with a 
salt-sensitive fraction associated with the basolateral 
membrane of the trans -Golgi region in polarized MDCK 
epithelial cells. The C-term.inus of rab8ip binds to rab8 , a 
.small GTP-binding protein required for vesicular transport 
from the Golgi apparatus (Ren, M. et al, (1996) Proc . Natl. 
Acad. Sci. 93, 5151-5155). In addition to inducing the 
transcriptional activation of cytokines like IL2 via SAPK, 
GCK may also promote the rab- dependent release of secretory 
proteins in response to TN?r/ (Buccione, R. et al (1995i Mol . 
Bio, Cell 5, 291) . 

HFKl (hematopoietic protein kinase) is' a 
const itutively active 90,000 dalton STK restricted to 
hematopoietic cells. H??:i activates the SAPK pathway by 
directly binding to and activating MEK?:i (Hu, M. ec al 
(1995) Genes and Dev. 10:2251-2264) as v.-ell as the 
ubiquitously expressed mixed- 1 ineage kinase MLK-3 (Kiefer, 
F. et al (1996) EM30 J. 15:7013-7025). This function of 
HPKl requires, in contrast to GCK, both its kinase dom.ain as 
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well as its C-terminus. The upscrearr. activators of HPKl 
rerriain unkncv/n . HPKl also plays a key rcle as a mediator cf 
transforming growth factor-(3 (TGFp) signaling. HPKl 
accivates the TGFb-act ivated kinase (TAK) , which in turn 
5 stimulaces the SAPK pathway by phcsphorylat ing SEKl (Wang W. 

et al (1997) J. Biol. Cherri. 272:22771-22775). 

KHS2 is expressed in thymus, dendrccytes and monocvres. 
KHS2 could have a ccmplemencary function to chat of KHSi as 
a rrtediator of SAPK activation in the cellular response to 
10 inflammatory cytokines. KHS2 could have the pocential to 

interact directly v/ith TRAF2 since a STK with the predicced 
molecular weight of KHS2 (approximately 101,000 daltons) is 
found in . the TNFR-TPAF2 complex upon TNFa stimtulation 
(VanArsdale, T. and V7are, CF. (1994) J. Im^munol . 153, 3043- 
15 3 050) . The presence of a putative binding dom:ain for Rab or 

a.Rab-like molecule ai: the C-terminus of KHS2 indicates that 
KHS2, in addition to having a potiential role in the TRAF2 - 
dependant TNFa cytokine response, could also mediate 
signaling events that utilize small GTPase proteins. 
20 Alternatively, the binding of a small GTPase procein t:o che 

C-terminus of KHS2 may be required for its potential TRAF2 - 
dependant signaling to a downstreami kinase such as MSKKl . 

Human GEK2 . SULUl and SULU3 

A recent report (Y-W Qian et al . , Science 282:1701- 
1704,1998) described xPlkkl as the activator of Plxl (the 
Xenopus Polo kinase) . In Xenopus oocytes, the STK Plkkl can 
phosphorylate and activate Plxl STK (the mam.malian Polo 
kinase or PLK) . A dominant - negac ive (kinase -dead) form, of 
30 xPlkkl prevents Plxl activacion and delays germinal vesicle 

breakdown. Yet another unidentified kinase is probably 
responsible for xPlkkl activation during micosis. 
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The homology through che entire length of the xPlkkl 
protein with GEK2 suggests that GEK2 "ight represenc the 
human homologue for xPlkkl . Based on this, GEK2 might be 
upstream of PLK in mammalian cells. In addizicn, based on 
the phage display screen results using the SULUl coiled- 
ccil2 domain as bait, SULUl might also interac:: in vivo vjith 
GEK2 and therefore regulate GEK2 (and/or 3LK through zhe 
coiled-coil domain) activation leading to PLK accivation and 
mitosis. 

If such a cascade of events is required for mitosis in 
mammalian cells, interruption of this signaling cascade ar 
any point might block micosis and could be beneficial for 
cancer nreaumenc . 

A recently cloned STE2 0 - subf ami ly kinase, rac TAOl , is 
most likely the rodent orthologue of humian SULU3 
(Hutchinson, M, et al . J. Biol. Chem 273:28625-28632, 1998). 
TAOl activates MSK3 , 4 and 6 in vitro, while m cransfected 
cells it associaces and act ivates only MEK3 , resulting in 
phosphorylation and aczivation of p38. Thes.e results 
implicate TACl (SULU3) in the regulation of the p3S 
containing stress-responsive MAP kinase pathway. 

Human SULUl is v;eakly expressed in hem.ai::opoiet ic 
sources whereas SULU3 is found in B-cells and THl - res t ricted 
T cells. These mammalian SULU STKs display strong homology 
to the C. elegrans SULU kinase. The role that this kinas.e 
plays in nematode development is unknown. The strong 
sequence homology between the catalytic domain of mammalian 
SULU kinases and other 3TE2 0 - subf amiiy kinases such as SOK-1 
(human STE2 0) and KHS2 suggests that the mammalian kinases 
may participate in the stress-response pathway. The 
potential coiled-coil domains found ac the C-terminus of the 
SULU kinases may play a role in che regulation of this 



kinase . 
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Murine LOK ( lyiriphocyte-orienced kinase) is a 
const iuucively activated STK cf approximaceiy 13 0,000 
daltons that is predominantly expressed in spleen, thymus 
and bone marrov/ (Kuram.ochi, S, et al (1997) J. Biol. Chem. 
272: 22679-22684) as well as in meiotic testicular and 
primordial germ ceils. The LOKl gene is located in 
chromcsomie 11 of the mouse near the v/r locus, a region thac 
is associated with reproductive and neurological defaces 
(Yanagisawa, M. et al (1996) Mol , Reprod. and Dev. 45:411- 
420) . LC:< does not: activate any of the known MA?K pathways 
(ERK, JNK and p3 8) nor the NFkB pathway. The upstream 
signaling elem.ents of LOK as well as the extracellular 
scimuli chat utilize this kinase co elicit a biological 
response are also unknown (Kuramochi, S . et al (1997) j. 
Biol. Chem. 272: 2 2 6 7 9-22684). 

Kum.an GEK2 is highly related to murine LOK, but based 
on sequence divergence in the non-catalytic domain, it 
appears to be a distinct member of this STE20-subf amily . 
GEK2 m.ay signal "hrough a pathway that remains to be 
defined. The presence of potential coiled-coil regions at 
the C-terminus cf GEK2 could play a key role in regulating 
the functions of this kinase. 

Human ?AK4 and PAK5 

The p21 activated protein kinases (PAK) are a closely 
related subgroup of the STE2 0 family of serine/ threonine 
kinases. Extensive genetic and biochem^ical analysis of the 
budding yeast STE20 has shov;n the critical role this 
serine/threonine kinase plays at the junccure of several 
important: innracellular pathways required to appropriately 
respond to extracellular signals; STE2C links the 
transcriptional response by mediating the accivation of the 
appropriate dov/nstream MA?K pathway as- well as coupling 
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changes in cellular morphology via ics control of the actin 
cytoskelet on . 

A hallT.ark of the PAK subgroup is their small G 
protein-binding domain (?BD) that confers G pro^iein- 
dependent activation upon chis group of kinases. Via the 
?BD, PAFCs bind to activated small G proteins resulting in 
the derepression of the PAK' s intrinsic kinase activity. 

Until recently, there were three known PAK kinases: 
PAKi , a 6S kD protein whose expression is restricted 
expression to brain, muscle, and spleen; PAK2 (PAKI, PAK65), 
a 62 kD protein whose expression is ubiquitous; and PAK3 , a 
65kD protein whose expression is restricted to the brain. 
Similar to STE20, the m:amimalian PAKs (1,2, and 3) have been 
shown CO respond to extracellular signals {growth factors, 
mitogens, cytokines and a variety of cellular stresses) 

(Bagrodia, et al . (1995). J. Biol. Chem, 270: 22731-22737; 
Zhang, S., et ai . (1995), J. Biol. Chem. 270: 23934-23936, 
Frost., J. et al . (199S) J. Biol. Chem. 273: 23191-2S19S; 
Galisteo, M. et al . (1996) J. Biol. Chem. 271: 2 0 997-21000), 
and are linked to TCR activation (Yablonski, D., et al . 

(1998) EM3C J. 17: 5647-5657), and he t erocr imer ic G protein- 
coupled receptors (Knaus, U. et al . (1995) Science 269: 221- 
223) . 

The PAKs were originally identified as effectors for 
members of the Rho fam.ily of sm.all G proteins (such as Rac 
and Cdc42), hence their namie, p2 1 -accivaced kinases (PAK) 
(Manser et al Nature 367:40-46) . The recruitment of the 
PAKs to the appropriate intracellular- location is critical 
to cheir function. Arcempts to elucidate the role played by 
PAKs m intracellular signaling and morphological changes is 
complicared due to the com.plex interactions by v/hich they 
can be recruited by such factors as activaced small G 
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proceins (rac, cdc42). adaptors (nek) and eyichange prczeins 
(FIX, Cocl) . 

The adapccr molecule. Nek, is consci tut ively* bound via 
ICS SH3 domain to the proline-rich motif in nhe M-cerminal 
5 portion of . PAKl . Binding of the Nck-?AK complex to 

activated growth factor receptors in response co grov.-th 
factor stimulation provides a mechanism to link growth 
ractor-stim.ulated and stress -response pathways (Galisteo, M 
et al . {1996) J. Biol. Chem. 271:20997-21000). 
10 The ?BD found at the N- terminus of PAKl is responsible 

for Its high-affinity interaction with the GT?-bound form.s 
of Cdc-;2 and Rac (Burbelo, P. et al . (1995) J. Biol. Chem. 
270:29071-29074). The exact mechanism through which the 
small GTPases activate PAKs may involve, in part, 
association of the kinase with activated growth factor 
receptors through guanine nucleotide exchange factors 
(GEFs) . GEFs activate small GTPases by catalyzina the 
formation of their GTP-bound state, thereby orcm.otinc their 
association with, and activation of, PAKs. The known 
mammalian PAK kinases, as well as Drosophlla. and C. elecrans 
PAKs, all conserve an N-term.inal extracat alyt ic motif 
responsioie tor a high-affinity interaction with the GEF, 
PIX. The PAK-Cdc42 interaction and subsequent PAKs occurs 
as a PIX/PAK complex (Manser, E. et ai . (1993) Molecular 
Cell , 1 , 133-192 ) . 

PAK signaling stimulated by heterotrim^er ic G- proteins 
is mediated through the interaction between a short 
conserved amdnc acid region located at the C- terminus of 
PAKl with the G-protein (3-subunit (Leeuw, T. et ai. (1993) 
Nature, 391: 191-195). 

A variety of studies have indicated that the human PAKs 
are involved* m mediating the ac t iva t ion ' of stress -act ivated 
protein kinase pathways (JNK and to lesser extent o38) . 
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PAKs are also potential niediators in the crosstalk becv/een 
t:he pachv;ays regulated by che Rhc family of small G proteins 
and the signaling pathways directly downstream of Ras 
leading to the activation of the ERK pathway (Bagrcdia, et 
al, (1995). J. Biol. Chem. 270: 22731-22737; Zhang, S., en 
al, (1995). J. Biol. Chem. 270: 23934-23936; Brown, J., et 
al. (1996) Curr Biol. 6:598-50596; Frosc, J., et al . (1996). 
Mol . Cell. Biol. 16: 3707-3713). 

PAKl has been implicated in phcsphorylat ing a 
regulatory site in MEKl that is necessary for MEKl ' s abiliry 
tc inceract v;ith Rafl (Frost, et al. (1997) EMBO J.. 16:6426- 
6433) . PA?:3 has been shown to phcsphorylat e. Rafl on a site 
thac is important for Rafl acuivicy (King, A., et al . 
{1993). Nacure 396: 1S0-1S3;. 

PAKs play an imporcant role in controlling 
m^orphological changes in cell shape mediated by the actm 
cytoskeleton . Such morphological changes are required for 
cellular functions ranging from, cell division and 
proliferation to cell mo-tilicy and vesicle -ransporc . PAK 
activity has been implicated in the localized assembly 
(leading . edge ) and disassemb-ly (retracting edge) of focal 
adhesions necessary for cell motilicy (Frost J. et al (199S) 
J. Biol. Chem. 2 7 3:28191-23198). 

PAK2 m.ay ha^ve a role in the morphological changes 
induced during apcptosis (Memirane and morphological changes 
in apoptoEic cells regulated by caspase -m.ediaced accivation 
of PAK2 . (Rudel, T, (1997) Science. 276:1571-4)), and PAKl 
may be important in preventing apcptosis (Faure S, et al . 
(1997) EMBO J. (1997) 16:5550-61). In addicion to 
overcoming mioogen- and anchorage - independent growth, tumor 
cells need to escape che programm.ed cell death that 
accomtpanies deregulated cell growch. Thus, inhibition of 
PAKs may be effective in triggering apcptosis in tum.ors . 

^^7 
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A direcc requirement for ?AKs in nhe cransf or-a- ion of 

mammalian cells has been shewn for ?AK1 and PAK2 . Kinase- 
dead alleles of PAP:i block ras cransf crmacion of RATI and 
Schv/ann cells (Tang, Y., at al . (1997) Mol . Cell. Bid. 17, 
5 4454-4464) . Dominant - negat ive alleles of PAK2 have been 

shown CO interfere with ras -mediaced transformation of 
mammalian cells (Osada, S, , (1997) FEES Lett 404:227-233) 

Mucacions m ?AK3 have been implicaced in nonsvndromic 
X-imked mental retardation suggesting a role for PAK3 in 
10 cognitive function (Allen, K. et al . (199S) Nat. Genet. 20: 

25-30) . PAKl has been implicated in neurice oucgrowth in 
?C12 cells (Daniels, R. en ai . (199£) EMBO J. 17: 754-754; 
Nikclic, et al , (1998; Nacure 3 9 5:194-193). 

Finally, FA.K-like STKs may also play a role in AIDS 
pathogenesis since che myr is toy la ted 27kD m:em±>rane- 
associated HIV Nef gene product directly interacts with and 
activates these kinases via cdc42 and Rac . The Nef -mediated 
activation of PAK-like STKs correlates with the induccion of 
high viral titers and the development of AIDS in infected 
20 hosts (Cullen, B. R. (1996) Curr . Biol. 6:1557-1559). 

Cur results show that ?AK4 is expressed m thymus, 
ctenarocyt es , mtasc cells, monocytes, as well as in T cells 
(TK2 -restricted cells and M0LT4 ) and the B cell line RPMI . 
PAK5 is f ound in m.ast cells and in the T cell line MOLT4 . 
25 These data "suggest potential roles for PAK4 and PAK5 m the 

imim:une systemi . ' 

PAK4 and PAK5 share with the known PAKs a potential 
cdc42 -binding motif at their M-termiini. Both PAK4 and PAK5 
display sequence hcmioiogy in their C-termiini to a motif 
30 responsible for an interaction between PAKl and the (3- 

subunic of he terotrim.ic G-proteins \^L^a±no acid residues 665- 
675 in PAK 4, and amino acid residues 336-398 m FAKS). 
Consequently, ?A?C4 , and possibly PAK5 , could mediate 

IAS 
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signaiirxg events originating from growth factors as well as 
from ligands thac scimulace G-procein-linked recepcors . 

PAK4 conserves a leucine (leu 44), that when mutated to 
a phenylalanine renders the kinase activity of human PAKl 
5 const itutively active, bypassing its cdc42 -binding 

requirement for activation (Brown J. et ai (1996) Current 
Biol. £:59S-605). PAK5 contains an isoleucme at the 
equivalent position. Therefore, the mechanism by which 
cdc42 potentially activates humian PAKl, FAK4 , and. possibly 

10 PAKE, may be very similar. 

PAK4 and PAK5 however, lack the PIX- binding motif, and 
consequently cdc42 -act ivacing GE?s other than PIX (fox- 
example pbl and Cool) must be responsible for the activation 
of these kinases. Alternatively, PAK4 and PAK5 m.ay be 

15 activated by another . GTPase , such as Racl which uses the 

Tiarr.l GE? for its activation to the GT?-bound state. 

PA:K4 and PAK5 also lack the PxxP motif responsible for 
the Nek- PAKl association. Between zhe PBD or cdc4 2 -binding 
N- terminal motifs and the putative GEF-binding regions, PAK4 

20 and PAK5 have long insertions (135 *and 12 3 am.ino acids for 

PAK4 and PAK5 , respectively) relative to PAKl. This region 
probably confers different binding characteristics to 
adaptor molecules and/or GEFs from, those exhibited by known 
mamm^alian PAKs . . - 

25 PAKs have been show^n to be upstream in pathways leading 

to activation of both the JNK (Bagrodia, S., et al . (1995) 
J. Biol. Chemi. 270: 22731-22737) and ERK kinase pathways 
(Brown, J., et al . (1996) . Curr Biol. 6:598-605) .. PAKl was 
shown to synergize v;ith ras in activation of- the ERK pathway 

30 through phosphorylation of MEKl (Frost, J. et al . vl997). 

EMBO J. 16:6425-5433). Our data shows that MEKl serves-as 
an in vitro substrate for PAK4 , suggesting a potential role 
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r.^r ?AK4 in the acciva-ion of che ERK pathway and 
mitogenesis . 

PAfC5 rr.ay also- have a mitcgenic role, and be linked to 
cancer, based on ics expression profile (elevated RNA and 
protein levels in a v/ide variety of tumor cell lines), its 
interaction with cdc42 via its PBD, and the abilicy of a 
kinase-dead allele (Lys350, 351 Ala) zo block ras 
transformation of NIH3T3 cells. Thus, a screen for small 
molecule inhibitors of PAK5 kinase activity may yield 
compounds with therapeutic potential for intervention in 
cancer derived from a wide variety of tissue tvpes . 

PAK4 and PAK5 may also play a role rn HIV pathogenesis 
as potential mediators of Nef signaling, since none of the 
known PAKs correspond to che PAK-like kinase shown to 
interact with, and be activated by, the HIV nef protein (Lu, 
X. et al. (1996) Current Biology 6:1677-1684) 

The 3' untranslated region of PAK4 contains a CA repeat 
that is prone to undergo expansion. CA dinucleocide reoeac 
instability has been associated with disease (Toren, M.Z. er 
al {1992} Am. J. Hematoi . 57: 148-152), and expansion of 
such repeat in the 3' untranslated regi-on of PAK4 could 
im:plicare this kinase in as yet unknoivn pathologies. 

C linical aool icat ions 

Human 5TLK 2 , ST LK3 , STI.K4 . STLKS . STTiKn . and STr.K7 , 
STLK3, STLK5, 3TLKS and STLF:7 , as well as other 
homologues of the STLK subfamily of STE2 0 protein kinases 
such as STLK4, may play an im.portanc role as m.ediators of 
the immune response. Thus, zhey are targecs for the 
developm.enr of specific sm.all molecule inhibitors to treac 
immunological diseases, including, but not limited' to, 
rheumatoid archritis, chronic inflammatory bowel diseases 
(e.g. Crohn's disease), chronic inf lamm:acory pelvic disease, 

/50 
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multiple sclerosis, asthma, osteoarthricis , psoriasis, 

atherosclerosis, rhinitis and autoimmunity, as well as in 
organ transplantation. Ocher diseases include 
cardiovascular diseases . 
5 The human STLKs may also play an important role in cell 

growth regulation. Thus, they are cargets for developing 
small m.olecule kinase inhibitors for the treatm.ent of cancer 
and metastases. STLKS miaps tc a chromtosomal region 
frequentlv am.olified in a variety of tumors including those 

10 from, non- small ceil lung cancer, breast cancer and 

peripheral nerve tumors. This suggests chac STLK5 could 
plav a. role in the development, maintenance., or progression 
of hum.an cum.ors . 

The potential role of human STLKs 2,3, and 4 in 

15 m.ediating oxidative scress strongly suggests chat drugs 

targeting these kinases could prove useful in the creacmenc 
of myocardial infarction, arrhythmia and other 
cardiomyopathies, stroke, renal failure, oxidative stress - 
relared neurodegenerative disorders such amyotrophic lateral 

20 sclerosis, Parkinson's disease and Leigh syndrome , a 

necrotizing mitochondrial encephalopathy, as v;ell. 

Human ZCl , ZC2 . ZC3 , and ZC4 

ZCl m.ay be a component of the CD2 8 - signaling pathway 

25 and therefore im.portant in T cell activation. As such, ZCl 

as v/ell as other 2C subfam.ily kinases, are targets for the 
development of specific smiall m:olecule inhibitors to treat 
imimunological diseases, including, but not limited to, 
rheumatoid arthritis, chronic inflammatory bov-el diseases 

30 (e.g. Crohn's disease), chronic inf lamm.atory pelvic disease, 

multiple sclerosis, asthmia, osteoarthritis, psoriasis, 
atherosclerosis, rhinitis and autoimmunity, as well as organ 
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" ransplancat ion . Other diseases include cardiovascular 

diseases . 

ZCl and ZC2 are also irrtplicated in cell grov/ch 
regulation. Thus, ZC subfamily kinases are cargens for 
5 developing small molecule inhibitors for the creacment of 

cancer and metastases. ZC2 maps zo a chromosomal region 
frequently amplified in a variety of tumors including those 
from non- small cell lung cancer, small cell lung cancer, and 
cervical cancer. This suggescs that ZC2 could play a role 
10 in zhe development,, m.aintenance , or progression of human 

tum.ors . 

The role of human ZCl, ZC2 , ZC3 , and ZC4 in rhe 
inf la-T.macory and stress - response pathv^ays, scrongly suggescs 
that drugs targeting chese kinases could have strong 

15 immunosuppressive actions. These drugs can prove valuable 

for che treatm.enr of rheum.acoid archritis, ar therosclerosis , 
autoimm.une disorders and organ - ransplancat ion among others. 

At least one very imtporranc class of immunosuppresant s , 
corticosteroids, functions by blocking SA?K activanion ac an 

20 as yet undefined site on this pathv/ay {Swantek, J.L. et al 

(1997) Mol. Cell. Biol. (1997) 5274-6282). Other 
immunosuppresive drugs like zhe pyridinyl imidazoles 
specifically target che p3S kinases (Kum.ar, S. et ai (1997) 
Biochem. Biophys. Res. Comm.un . 235: 533-528). Drug 

25 targeting of the MAPPC and p3S pa-hv/ays could lead to the 

development of novel immunosuppresant s , 



30 



Human SUI^U and GEK 

The potential role of chese novel 3TE2 0 - related protein 

.y sugg-i 

agents nhac specifically inhibit chese kinases could be 
useful for cancer and mecastases treatm.enc , 
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The close homology of human STLK5, GEK2 , SULiUl and 
SULu3 CO STE2G-subf anrdiy kinases involved in the stress and 
oxidative response pathway strongly suggests that drugs 
targeting these kinases rr^ay also be useful as 
immunosuppressants as well as to treat ischemic disorders. 

Human KHS2 

The role of human KHS2 in the inflammatory and stress - 
response pathways, strongly suggests that drugs targeting 
this and related kinases could have strong immunosuppressive 
actions. These drugs can 'prove valuable for the treatment 
of rheumatoid arthritis, artherosclerosis , autoimm-une 
disorders and organ transplantation among others. At least 
one very important class of immunosuppresant s , 

corticosteroids, functions by blocking SA?K activation at an 
as. yet undefined site on this pathway (Swantek, J.L. et al 
(1-997) Mol . Cell. Biol. (1997) 6274-5282).. Other 
immunosuppresi ve drugs like the pyridinyl imidazoles 
specifically target the p3S kinases (Kumar, S. et al (1997) 
Biochem. Biophys . Res. Com^m^un. 235: 533-523). .Drug 
targeting of the MA?K and p3 8 pathv;ays could lead to the 
development of novel immvunosuppressant s . 

Human PAK f a m.i 1 v 

PAK5 has a role in cancer based on its expression . . 
profile (elevated RNA and protein levels in wide' variety of 
tumor lines), its interaction with Cdc42 via its PBD, and 
the ability of the kinase-dead allele of PAK5 (Lys350, 
351Ala/ to block ras trans forma^t ion of NIH3T3 cells. Thus, 
a screen for small molecule inhibitors of PAKE kinase 
activity may yield compounds v/ith therapeutic potential for 
intervention in cancers and metastases derived from, a wide 
range of tissue types. 

133 
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?AK5 "aps "O a chrorr:03orrial region frequently amplified 

in a variety cf tumors including chose from non-s"all cell 
lung cancer, and small cell lung cancer. These findings 
suggest that PAK5 could play a role in the development, 
5 maintenance, or progression of human tumors and/or 

metastases . 

The role cf human PAK4 , and ?a:<5 in the inflammatory 
and scress - response pathv/ays also strongly suggests chac 
drugs ta^rgeting these kinases could have strong 

10 immunosuppressive accions. These drugs can prove valuable 

for the treatment of rheumatoid arthritis, artherosclerosis , 
autoimmune disorders and organ transplantation among others. 

At lease one very imporcan- class of imtmunosuppresancs , 
cort icosceroids , functions by blocking 3A?K ac"ivacion a- an 

15 as yet undefined site on chis pathway (Swantek, J.L. et ai 

(1997} Mol . Cell. Biol. (1997) 6274-5232). Other 
immunosuppresive drugs like the pyridinyl im.idacoles 
specifically targec the p38 kinases (Kum:ar, S. et a.1 (1997) 
Biochem. Biophys . Res. Commun . 235: 533-52S). Drug 

20 targeting of che MAPK and p3 3 pathv/ays could lead to the 

developm.ent cf novel immunosuppresant s . In addition, drugs 
targecing PAK4 or PAK5 could prove useful as 
im.munosuppresant s as -.veil as in AIDS treatmenc . 

25 ' VT 1 1 . Transgenic Anim.als . 

A variety of miothods are available for the production 
of transgenic animals associated with this invencion. DNA 
can be injected inco zhe pronucleus of a fertilized egg 
before fusion of the male and female pronuclei, or injected 

30 into the nucleus of an eTb>ryonic cell (e.g., the nucleus of 

a two-cell embryo) following the initiacion of cell division 
(Brinscer et al . , Proc . Nat. Acad. Sci. USA 82: 4433-4442, 
1985) . Embryos can be infected with viruses, esoeciallv 
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retroviruses, [rodified zo carry inorganic - ion recep-or 
nucleotide secruences of zhe invencion. 



Pluripctenc seem cells derived from che inner cell 



rr.ass 



of the embryo and stabilized in culcure can be manipulated 
5 in culture to incorporate nucleotide sequences of che 

invencion. A transgenic anirr.al can be produced from such 
cells through implantation into a blastocyst that is 
implanted into a foster mother and allov/ed to come to term. 
Animals suitable for transgenic experim^ents can be obta^ined 
10 from, standard commercial sources such as Charles River 

(Wilmington, MA), Taconic (Germiantov;n, NY) , Harlan - Sprague 
Dawley (Indianapolis, IN), etc. 

The procedures for manipulation of the rodent embryo 
and for microin;] ec t ion of DNA into the pronucleus of the 
15 zygote are v;ell known to those of ordinary skill in the art 

(Hogan et al . , supra). Microinjection procedures for fish, 
amphibian eggs and birds are detailed in Houdebine and 
Chourrout (Experientia 47: S97-9C5, 1991). Other procedures 
for introduction of DNA into tissues of animals are 
20 described in U.S. Patent No., 4,945,050 (Sandford et ai . , ' 

July 30 , 1990) . 

By way of example only, to prepare a transgenic m^ouse, 
female mice are induced to superovulate . Females are placed 
v/ith males, and the m.ated fem.ales are sacrificed by CO-, 
25 asphyxiation or cervical dislocation and embryos are 

recovered from excised oviducts. Surrounding cumulus cells 
are removed. Pronuclear emb-ryos are then washed and stored 
until the tim.e of miection. Randomly cycling adult female 
mice are paired with vasectomized m^ales. Recipient females 
30 are mated at. the same tim.e as donor females. Smbrvos then 

are transferred surgically. The procedure for generatina 
transgenic ra.ts is similar to that of mice {Hammier et al . , 
Cell 63:1099-1112, 1990). 

|3S 
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Metnods for the culcuring of embryonic seem (es) cells 
and the subsequent production of transgenic animals bv che 
introduccion of DNA into ES cells using methods such as 
eieccropora-ion, calcium phosphate/DNA precipitation and 
direct injeccion also are well known to those of ordinary- 
skill in the art (Teratocarcinomas and E-brvonic Scerr. Cel^s 
A Practical Approach, E.J. Robertson, ed . , lEL Press, 19S7). 

In cases involving random gene incegration, a clone 
containing che sequence (s) of the invention is cc- 
transfec-ed with a gene encoding resistance. Alternatively, 
the gene encoding neomycin resiscance is physically linked 
no the sequence (s) of the invention. Transfection ana 
isolation of desired clones are carried ouc bv anv ore -.f 
several methods well known co those of ordinary skill in the 
art (S.J. Robertson, supra). 

DNA molecules introduced into ES cells can also be 
integrated into the chromosom.e through the process of 
homologous recombination (Capecchi, Science 244: 1283-1292, 
1989) . Methods for positive seleccion of -he recc-binat ion 
20 evenc (i.e., neo resistance) and dual posicive-negacive 

selection (i.e., neo resistance and gancyclovir risiscance) 
and Che subsequent identification of the. desired clones by 
?CR have been described by Capecchi, supra and Jovner et al . 
(Nature 333: 153-15S, 1989), the- teachings of which are 
incorporared herein m their encirety including any 
drawings. The final phase of che procedure is to inject 
targeted ES cells into blastocysts and to transfer the 
blastocysts into oseudopregnant females. The resulting 
chimeric animals are bred and the offspring are analyzed bv 
Southern blotting to identify individuals that carry the 
transgene. Procedures for the production of non-rodent 
mammals and other animals have been discussed by others 
(Houdebme and Cnourrout , supra; Pursei et ai . , scxe: 
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Labora-ory, N.Y., 1989; Ausubel et al . ; Current Pronocols i 
Molecular Biology, Greene Publishing Associates and Wiley 
Interscience , 1989). Al cernat ively , reccuibinant 

nucleic acid molecules encoding procein sequences can be 
used as naked DNA or in a reconstituted sysrem e.g., lipo- 
somes or other lipid systems for delivery to carge* cells 

(e.g., Feigner et al , , Nature 337:537-8, 1989). Several 
other methods for the direct transfer of plasmid DNA mco 
cells exist for use in human gene therapy and involve 
targeting the DNA to receptors on cells by complexing irhe 
plasmid DNA. co proteins (Miller, supra) . 

In i~s simiplest form, gene transfer can be performed b 
simply injecting minuce amouncs of DNA inro the nucleus of 
cell, chrough a process of m.icroinn ect ion (Capecchi, Cell 
22:479-88, 1980) . Once recomibinant genes are mcrcduced 
into a cell, they can be recognized by the cell's normal 
mechanismis for transcription and Translation, and a gene 
produce will be expressed. Other methods have also been 
•attem.pted for incroducing DNA into larger numbers of cells. 

These methods include: transf ection, wherein DNA is 
precipitated with CaPO. and taken into cells by pinocytosis 
(Chen ec al . , Mol , Cell Biol. 7:2745-52, 19S7); 
eleccrcporat ion , . wherein cells are exposed to large voltage 
pulses to incroduce holes into the m.embrane (Chu et al . , 
Nucleic Acids Res. 15:1311-26, 1987); lipof eccion/liposome 
fusion, wherein DNA is packaged into lipophilic vesicles 
which fuse with a target cell (Feigner et al . , ?roc . Natl. 
Acad. Sci. USA. 34:7413-7417, 1987); and parcicle 
bombardm.ent using DNA bound to small projectiles (Yang et 
al., Proc. Natl.. Acad. Sci . 87:9568-9572, 1990). Another 
method for introducing DNA into cells is co couple the DNA 
to chemically modified proteins. 
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Ic has also been shovvn chat adenovirus proteins are 

capable cf destabilizing endosomes and enhancing the uotake 
of DNA into cells. The admixture of adenovirus to solutions 
containing DNA complexes, or the binding of DNA to 
polylysine covalently attached to adenovirus using protein 
crosslinking agents substantially itr.proves the uptake and 
expression of the recombinant gens {.Curiel et al . , An^ J 
Respir. Cell. Mol . Biol., 6:247-52, 1992). 

As used herein "gene transfer" means the process of 
introducing a foreign nucleic acid molecule into a cell. 
Gene transfer is com,monly performed to enable the expression 
or a particular product encoded by the gene. The product 
may include a protein, polypeptide, anti -sense DNA or .RMA, 
or enzymatically active RNA . Gene transfer can be oerformed 
in cultured ceils or by direct administration into animals. 

Generally gene transfer involves the process of nucleic 
acid contact with a target cell by non-specific or receptor 
mediated interactions, uptake of nucleic acid into the ceil 
through the n-^embrane or by endocytosis, and release of 
nucleic acid into the cytoplasm from t.he plasma m.embrane or 
endosome. Expression may require, in addition, movement of 
the nucleic acid into the nucleus of the cell and oinding to 
appropriate nuclear factors for transcription. 

As used herein "gene therapy" is. a form of gene 
transfer and is included within the definition of gene 
transfer as used herein and specifically refers to gene 
transfer to express a therapeutic product from a cell in 
vivo or in vizro. Gene transfer can oe performed e:-: vivo on 
cells which are then transplanted into a patient, or can be 
performed by direct administration of the nucleic acid or 
nucleic acid-protem complex into the patient. 

In another preferred embodim.ent , a vector havina 
nucleic acid sequences encoding a ST32 C - related kinase 
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244:1281-1268, 1989; and Simms ec al . , Bio/Technol cgy 6:179- 

133 , 19SS) . 

Thus, che invent: ion provides cransgenic, nonhuman 
rr:ammals containing a ti'ansgene encoding a kinase of the 
inventiion or a gene effecting the expression of the kinase. 
Such transgenic nonhuman mammals are particularly useful as 
an in vi\^o test system for studying the effects of 
incroduction of a kinase, or regulating the expression of a 
kinase (i.e., through the introduction of additional genes, 
ancisense nucleic acids, or ribozymes) . 

A "cransgenic anim.al" is an animal having cells that 
concain DNA which has been artificially inserced inco a 
cell, v.'hich DNA becom.es part of the genome of the animal 
which develops from than, cell . Preferred transgenic animials 
are primaces, mice, rats, cows, pigs, horses, goats, sheep, 
dogs and cats. The transgenic DMA may encode human STE20- 
relaced kinases. Native expression in an animal mtay be 
reduced by providing an amount of ant i -sense RNA, or DNA. 
effective to reduce expression of che receptor. 

IX. Gene Theraov 

STE20-related kinases or cheir genetic sequences will 
also be useful in gene therapy (reviewed in Miller, Nacure 
357:455-450, 1992). Miller states chat advances have 
resulted in practical approaches to human gene therapy that 
have demonstrated positive inicial resulcs. The basic 
science of gene therapy is described in Mulligan (Science 
260 : 925 - 531 , 1993 ) . 

In one preferred eirbodiment , an expression vector 
concainmg STE2 0 - related kinase coding sequence is inserted 
into cells, che cells are grown in I'iLro and then infused m 
large numb-ers into patients. In another preferred embodi- 
ment, a DNA. segmenc containing a promoter of choice (for 

130 
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example a strong prcmocer) is cransferred inco cells 

containing an endogenous gene encoding kinases of the 
invention in such a manner chac the promoter seqmenc 
enhances expression of che endogenous kinase oene (for 
5 example, the promoter segment is transferred co the cell 

such that it becomes direcrly linked zo che endogenous 
kinase gene) . 

The gene therapy may involve the use of an adenovirus 
containing kinase cDNA targeted to a rumor, systemic kinase 
10 increase by implantation of engineered cells, inieccion v;ich 

kinase -encoding virus, or injection of naked kinase DMA into 
appx-opriate tissues . 

Target cell populations may be modified by introducing 
altered form.s of one or more components of the protein 
15 complexes in order co m.odulate tuhe activity of such 

comiplexes. For example, by reducing or inhibiting a comolex 
component accivicy within cargec cells, an abnormal signal 
cransduccion event (s) leading "c a condicion miav^ be 
decreased, inhibited, or reversed. Deletion or mdssense 

20 mutants of a ccmponenc , thac retain che abilitv to interact 

v/ith other components of the protein complexes but cannot 
funccion in signal cransduccicn m-ay be used to inhibit: an 
abnormal, deleterious signal cransduccion event. 

Expression vectors derived from, viruses such as 

25 retroviruses, vaccinia virus, adenovirus, adeno - associated 

virus, herpes viruses, several RNA viruses, or bovine 
papillom.a virus, may be used for delivery of nucleotide 
sequences (e.g., cDNA) encoding recombinant kinase of the 
invention protein into the targeted cell population (e.g., 

30 tumor cells) . Methods v;hich are well known to chose skilled 

m the art can be used to construct recombinant viral 
vectors containing coding sequences (Mani^tis et al . , 
Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 

133 
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polypeptide is provided in -v/hich the -nucleic acid sequence 

is expressed only in specific tissue. Methods of achievir.g 
tissue - specif ic gene expression are set forth in 
Incernat ional Publication No. WO 93/09236, filed November 3, 
1992 and published May 13, 1993. 

In all of Lhe preceding vectors sez forth above, a 
further aspect of the invention is that the nucleic acid 
sequence contained in the vector may include additions, 
deletions or modifications to some or all of the sequence of 
the nucleic acid, as defined above. 

In another preferred err.bodim.ent, a method of -gene 
replacement is set forth. "Gene replacement" as used herein 
m.eans supplying a nucleic acid sequence ivhich is capable of 
being expressed in vivo in an anim.al and thereby providing 
or augment ing the function of an endogenous gene which is 
missing or defective in the animal . 



20 



30 
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X . Admj.nist rat ion of Substances 

Methods of determining the dosages of com.pounds to be 
admrnistered to a patient and modes of admiinis ter ing • • 

compounds to an organism, are disclosed in U.S. Application 
Serial No. 0S/7C2,282, filed August 23, 1996 and 
International patent publication numJoer WO 96/22976, 
published August 1 1996, both of which are incorporated 
n by reference in their entirety, including any 
ngs, figures, .or tables. Those skilled in the art v/ii; 
appreciate that such descriptions are applicable- to the 
present invention and can be easily adapted to it. 

The proper dosage depends on vario'us factors such as 
the type of disease being treated, the part icular ' composi - 
tion being used, and the size and physiological condition oi 
the' patient. Therapeutically effective doses for the 
compounds described herein can be estimated initially from- 

HI 
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cell culture and animal models. For example, a dose can be 
formulated in animal m.odels to achieve a circular ino 
concentration range chat initially takes into account -he 
IC;o as determined in cell culture assays. The animal m.odel 
data can be used to more accurately determine useful doses 
in humans . 

Plasma half-life and biodistribut ion of zhe drug a.nd 
m.etaboli-es in che plasma, tumors, and major organs can be 
also be determined to facilitate the selection of drucs most 
appropriate to inhibit a disorder. Such measurem.ents can be 
carried cut. For example, HPLC analysis can be performed cn 
the plasma of animals created with the drug and the location 
of radiolabeled compounds can be determ.ined using detection 
methods such as X-ray, CAT scan, and MRI . Compounds chat 
15 show potent inhibicory accivity in che screening assays, but 

have poor pharmacokinetic characteristics, can oe optimized 
by altering the chemical structure and retesting. in chis 
regard, compounds displaying good pharm.acokinet ic character- 
istics can be used as a model. . 

Toxicicy studies can also be carried out by measuring 
the blood cell com.position . For example, toxicity studies 
can be carried out in a suitable animal m.odel as follows: 1) 
the compound is administered to mice (an untreated control 
m.ouse should also be used) ; 2) blood samples are 
periodically obtained via the tail vein from one mouse m ■ 
each trea-ment group; and 3) che samples are analyzed for 
red and white blood cell counts, blood cell composition, and 
tne percent of lymphccyces versus polymiorphonuclear cells. 
A comparison of results for each dosing regime with the 
con-rcls indicates if coxicicy is cresenc. 

At the termination of each coxicity study, furcher 
studies can be carried out by sacrificing the animals 
(prererably, in accordance wich the American Veterinary 
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Medical Association guidelines Report of nhe American 
Veterinary Medical Assoc. Panel on Euthanasia . Journal of 
American Veterinary Medical Assoc., 202:229-249, 1993). 
Representative animals from each treacm.enc group can then be 
examined by gross necropsy for immediate evidence of 
metastasis, unusual illness, or toxicity. Gross 
abnormalities in tissue are noted, and tissues are examined 
histologically. Compounds causing a reduction in body 
weight or blood components are less prefei'red, as are 
compounds having an adverse effect on major organs. In 
general, the greater the 'adverse effect the less preferred 
the compound. 

For the treatment of cancers the expected daily dose of 
a hydrophobic pharm.aceut ical agent is between 1 to-5CC 
mg/day, preferably 1 to 250 mg/day, and most preferably 1 to 
50 mg/day. Drugs can be delivered less frequently provided 
plasma levels of the active moiety are sufficient to 
maintain therapeutic effectiveness , 

Plasma levels should reflect the potency of the drug. 
Generally, the more potent the compound the lower the plasma 
levels necessary to achieve efficacy. 



The examples below are not limiting and are mterely 
representative of various, aspects and features of the 
present invention. The examples below demonstrate the 
isolation and characterization of the STE20 - related kinases 
of the invention. 



EXAMPLES 
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EXAMPLE l : l3olacion of cDNAs gnrn d ina M^^r^^l^^-. 

3TE2C-relaned Pr otein Kit^h^^^ 
Materials and Methods 
Identification of novel rl nr^f^c. 

Total RNAs v/ere isolated using the Guanidine 
Saits/Phenol extraction protocol of Chomczynski and Sacchi 
{?. Chorr:CEy-nski and N. Sacchi, Anal. Biocherr;. 152, 156 
(1987)) from primary human tumors, normal and tumor cell 
lines, normal human tissues, and sorted human hematopoietic 
cells. These RNAs were used to generate s ingle - scranded 
cDNA using the Superscript Pream.plif icaticn System (GI3C0 
BRL, Gaithersburg, MD; Gerard, GF et al . (1939), FCCUS 11, 
65) under conditions recommended by the manufacturer. A 
typical reaction used 10 ,ug total RNA with 1.5 ug oliQo(dT),. 
xs in a reaction volume of 60 uL. The product v.'as treated 
with RNaseK and diluted to 100 with H,C . ■ For subsequent 
PGR amplification, 1-4 ,uL of this sscDNA was used m each 
reaction. 

Degenerate oligonucleotides were synthesized on an 
Applied Eiosystems 3 948 DNA synthesizer using established 
phosphoramidite chemistry, precipitated with ethanol and 
used unpurified for PGR. The sequence of som.e of the 
degenerate oligonucleotide primers and the amino acid motif 
they encode is as follows: 

TRKl 5' -CTGA.ATTCGC-MGCMTTYGGNAARGT-3 ' GAFGK^J (sense) 
TRK4 5' -GGTGGATCGYTCNGGNGGG.2VTCCA-3' WMPPE (antisense) 
ROS 1 5 ' - GGMTTYGGNGARGTNTAYG.ARGG - 3 ' AFGEVYEG ( sense ) 
CGK4b 5' -GCTGGATGCYTGKTGGM3WGATCGA-3' WMSPE (antisense) 
CGfC4 c 5 ' - GAGTT YGGNGARGTNTTY YTNGC - 3 ' E FGE VYEG (sense) 

These prim.ers v/ere derived from the sense and antisense 
strands of conserved motifs within the catalytic domain of 
several protein kinases. Degenerate nucleotide residue 
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designacicns are: N = A, C, G, or T; R = A orG; Y = c 

T/ K = A, C or T not G; D = A, G or T not: C; S = C or G; and 
W = A or T . 

PGR reactions were perfornn.ed using degenerace primers 
5 applied to multiple single -stranded cDNAs . The primers were 

added at a final concent racion of 5 uM each co a rndxture 
containing 10 mM TrisHGl, pH 8 . 3 , 50 mM KCl , 1.5 wM MgCl--, 
200 uM each deoxynucleoside triphosphate, 0.001% gelarin, 
1.5 U AmpliTaq DNA Polym.erase ( Perkin-Elmer/Cetus ) , and 1-4 

10 LiL cDNA. Following 3 min denaturation at 95 "^C, the cycling 

conditions were 94 'C for 30 s, 5 0 "^G for 1 mtin, and 72 ^G for 
1 min 4 5 s for 3 5 cycles. PGR fragmen::s magrating between 
300-350 bp were isolated from; 2% agarose gels using che 
GeneGlean Kit (BiolOl), and T-A cloned inco the pGRII veccor 

15 (Invitrogen Gorp . U.S.A.) a.ccording to the manuf accurer ' s 

prococol . 

Golcnies were selected for mini plasm.id DNA- 
preparations using Qiagen columns and che plasmid DNA was 
sequenced using a cycle sequencing dye - terminator ki~ v/ich 
20 AmipliTaq DNA ?olym;erase, FS (ABI, Foster Gicy, GA) . 

Sequencing reaction products were run on an A.BI Prism 3 77 
DNA Sequencer, and analyzed using the BLAST alignm.ent 
algorithm (Alcschul, S.F. ec al . , J.Mol.Biol. 215: 403-10). 

Additional PGR strategies were employed to connect 
various PGR fragm;ents or ESTs using exact or near exact 
oligonucleotide prim.ers as detailed in the results section 
for each cDNA. PGR conditions vyere as described above 
except the annealing temperatures were calculated for each 
oligo pair using the formula: Tm. = 4 (G-i-G) -f 2 {A+TJ , 
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volation of cDNA clones: 

Hum.an cDNA libraries were probed v/ith PGR or EST 
fragments corresponding to STE2 0 - related genes. Probes we: 
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'''P- labeled by randcrr. primrng and used ac 2xl0' cpm/mL 
follov/ing standard cechniques for library screening. Pre- 
hybridization (3 h) and hybridization (overnighc ) * were 
conducted ac 42 ^^C in 5X SSC, 5X Denhart ' s solucion, 2.5% 
dex-ran sulface, 50 mM Na,?0,/NaHPO, , pH 7.0, 50% forniamide 
wirh 100 mg/mL denatured salmon sper-m DNA . Stringenc washes 
were performed at 65 '"^C in 0 . IX SSC and- 0,1% 3DS . DNA 
sequencing was carried out on both scrands using a cycle 
sequencing dye - 1 erm.inat or kit with AmpliTaq DNA Polym.erase, 
FS (ABI, roster City, CA) . Sequencing reaction products 
v/ere run on an A.3I Prism 377 DNA Seauencer. 



Make^ene 5ioinf ormacics EST assemhl^-y- 

The EST reports were downloaded from, ncbi 
( v/ww. ncbi . nlm. nih . oov ) . Aft er uncompressing the files, the 
program >report2est' was scripted to extract the following 
information: 1) EST names, 2) GenBank Accession numbers, 3; 
GenBank gi nurri^ers , 4) Clone Id numbers, 5; the nucleotide 
sequences of the HSTs 6) the organisrr,, 7) the licrary name, 
20 8) the name of the lab, and 3; the inscitution. The output 

of ^report2esc' is a file in PASTA format wirh all of the- 
information listed above in the first line of each entrv 
except the sequence, which is listed in the second line of 
each entry. The resulting file is formatted for BLAST using 
'pressdb' (available as par- of the ncbi tool kit) . 

To build a gene or part of a gene from ESTs, the 
program 'makegene' was developed. Input co this program is 
a query sequence and the organism/species for which a gene 
is to be built. .An initial search of the formatted EST 
database described above is perform^ed using BLAST (blastn^- . 

Any results that contain warnings, such as polyA tails or 
other repeat .elements, are eliminated from future queries. 
The program ' blast_parse_repcrts ' was de-veloped cc extract 

W6? 
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the PASTA header line from the search resuics and zhe oucpuc 

is then, filcered to excracc only FASTA header lines for zhe 
desired species. 

The initial results, having been filtered for warnings 
5 and species, go into a loop in which searches against the 

database are repeated until no new ESTs are found. The looc 
consists of the following seeps: 1) when possible the names 
of both ends of the ESTs are extracted from the database by 
searching using the 'Clone Id' -field or che part of che 'EST 

10 narr.e ' field before "he .r or .s postscript, 2) any ESTs zhaz 

have been used as queries in previous loops a.re removed from 
the current query by the program 'subtract' , 3) the 
resulting list of ESTs is used to extract the sequences from 
zhe dacabase by the program batch_parse_f as t a , 4; EL.AST is 

15 run against the dacabase using each sequence, 5) .the oucput 

files from BLA.ST containing warnings are removed, 6} che 
■results are filcered by species, and 7) the loop is 
reentered if there were new ESTs found in the previous pass 
through the loop. 

20 The ESTs chosen by 'makegene' are used as inpuc for, the 

program. ' m:pd2_cluster ' {Hide, W,, Burke, J, and Davison,. D. 
U. of Houston, unpublished) which clusters overlapping 
sequences. The programts ^contig' (Kerlavage, T., TIGR, 
unpublished), 'gde2mulc' and 'gde2sing' (Smtith, S.W., et 

25 al . , CABIOS 10,. 671-675 (1994)), are used to make an 

alignment and consensus sequence of the overlapping ESTs. 

RESULTS 

cDMA cloning and characterization of STLK2 
30 The human STLK2 cDNA sequence is comioosed of tv-/o 

overlapping EST fragm^ents, AJ\191319 and W16504, tnat were 
identified using a Smd ch- Waterman search of the EST database 
with STL?:i fMST3 GE:Ar 024636) as a query. The com^plete 
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sequence of both clones was determined and used co gsnera-e 
the full-lengch human STL2 sequence. 

EST clone AA19131S concaxns a 1327 bp insert and an ORF 
of il4S bp (382 amine acids) . EST clone Wi6504 contains a 
24 74 bp insert (not including che poly-A tail) ana an ORF of 
637 bp (382 amino acids) . 

The full-length human STLK2 cDNA (SEQ ID NiO . 1) is 3253 
bp long. .aA191315 spans positions 1-1327 and W15504 
positions 743-3216. The overlap between these cwo clones 
exhibits 100% sequence identity. The human STLK2 cDNA 
constains a 1243 bp ORF flanked by a 131 bp 5' UTR (1-181) 
and a 17S4 bp 3' UTR (1433-3215) chat is followed by a 52 
nucleotide polyadenylaced region. A polyadenylat ion signal 
(AATA-AA) IS found an positions 3193-319S. The sequence 
flanking -he first ATG conforms zo che Kozak consensus for 
an ini-iacing m.ethionine , . and is believed to be the 
translacional start sice for STLK2 . Furthermore, human 
STLK2, and the related SOK-1 and MST3 proceins, conserve che 
amano acid sequence immediately following this presumed 
initiacing methionine. 

Several EST fragments span che com.plete STLK2 sequence 
wich A_A1S1319 a- the 5' end and W16504 at the 3' end. 

All searches against the public nucleic. acid database 
(NRN) and protein dacabase (NRP) were conducted using the 
Smith-Waterman gap alignment program ((Smith, TF and 
Wacerman, MS (1981) J. Mol . Biol, 147, 195-197).) with the 
PA^ilOO m.atrix and gap open and extension penalties of 14:1, 
respectively . 

CDNA Cloning and Cha ^.g^ c eri zat ^ on of STt,k- 

A mam.m:alian STLK3 clone, 135-31-19, was first 
idencified from a PCR screen with the degenerace oligos, 
TRKl and TRK4 , applied to a sscDNA generated from adulc rat 
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brain subscantia nigra. Sequence analysis cf zhe 4zl bp 
inserc indicated that it represented a novel merrJ^er of the 
STE2G-3ubf amily of STKs . 

A Srrdch- Waterman search of che EST database wich the 
rac STLK3 fragment and human STLKl (MST3 G3:AF02 4 63 6) as ■ 
queries identified several overlapping ESTs spanning mosc of 
the human STLK3 cDNA sequence. A Makegene analysis 
generated a 3037 bp contig from approximately 44 EST 
sequences. Since the 3' ESTs were not commercially 
available, a pair of prim^ers ( 5 ' -CA.CAGAAJ\CGGTCAGATTCAC- 3 ' 
and 5 ' -GATCAGGGTGACATCAAGGGAC- 3 ' ) v/ere derived from this 
region to generate PGR clone 3R21-2C-6 from human fecai 
liver sscDNA. This clone and EST A-^27S967 were fully 
sequenced cc. generate the full-length STLK2 cDNA. sequence. 

AA27S967 is a 337 bp EST isolated by the IMAGE 
consortium from cDNA made from GD204-/IgD- germ.inal center B 
cells sorted from human tonsillar cells. 

PGR clone 3R21-20-6 was isolated from human fetal 
sscDNA and contains a 1116 bp insert, including a 10S6 bp 
ORE encoding the 362 G-terminal amino acids of STLK3 . 

The full-length hum.an STLK3 cDNA (SEQ ID NO. 2) is 3 03 0 
bp long. AA27S9d7 spans positions 1-S14 and 3R21-20-O spans 
positions 464-1579. The overlap betv/een these two clones 
exhibits 100% sequence identity. The remaining 1452 bp of 
3' UTR is derived from, an assembly of multiple unconfirmed - 
EST fragments. 

The near full-length human STLK3 cDNA (SEQ ID NO. 2) is 
3030 bp long and consists of a 154S bp ORE flanked by a 1476 
bp 3' UTR (1550-3025) and a 5 nucleotide polyadenylated 
region. A polyadenylat ion signal (A^z^TAAA) begins at 
position 3004. Since the coding reqion is open throughout 
the 5' extent of this sequence, this is apparentlv a partial 
cDNA clone lacking the N-term;inal start methionine. Six 
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copies of a "GGCCCC" repeat were observed in positions 21- 

57. Five independent ESTs (AA15033S, AA2S6S79, AA251o79, 
AA252004, AA27S957) showed zhe same repeac, suggesting chaz 
chis sequence may ^ be an integral region of the hurrian STLK3 
gene. Stronger evidence for this being the case is providea 
by the sequence of the murine orthclogue of STLK3 
represented by a 875 bp EST W20737. 

Multiple EST fragnnents span the complece STLK3 sequence 
with AA278967 at che 5' end and AAo2S477 and others at the 
3 ' end . 

cDNA Cloning and Characneri za t ion of 3TLK4 

The human 5TLK4 cDMA sequence is composed of cwo 
overlapping EST fragments, AA297759 and AA100434, chac were 
idencified using a Smtith- Waterman search of the EST dacabase 
with STLKl (MST3 GB:AF024 63 6) as a query.. The complete 
sequence of boch clones was determined and" used co generate 
the near full-length hum^an STLK4 sequence. 

AA1004S4 is an IMAGE consortiuTi cDNA clone isolated 
from the T-84 colonic epithelium cell line. It has an 
insert of 3694 bp and a coding region of 1146 bp (382 am.ino 
acids) . A Smith- Waterman sequence alignment against the NRN 
database shov/ed this EST to be 71.4% identical to the human 
STE20-like kinase (G3:X99325) . 

Wlo504 is an IMAGE consortium clone isolated from, a 
hum.an fetal heart cDNA library. It has an insert length of 
2474 bp (not including Che poly-A tail) and a coding region 
of 5S7 bp (22 9 amino acids) . A Smi th- Wa term.an seouence 
alignment of W15504 against the NRN database shov/ed this EST 
to be 5 9.2% identical to tne human STS20-like kinase 
(GB:X99325) . 

The full-length humian STLK2 cDNA (SEQ ID NO. 1) is 3253 
bp long. AJ^.191319 spans positions 1-1327, and W15504 

ISO 
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positions 743-3216. The overlap between chese cv;o clones is 

585 bp long v/ith 100% sequence identity. 

AA10G484 is an IMAGE consortium cDNA clone isolated 
from the T-S4 colonic epithelium, cell line. AA1004S4 covers 
5 the bulk of Human STLK4 with its 3 6 94 bp, which spans 

positions 146-3839 of SEQ ID NO : 3 . A second EST, AA297759, 
isolated from a Jurkat T cell cDNA. library, spans positions 
1-271 of the human STLK4 contig. The cwo ESTs overlap over 
a 126 bp scretch that has only one nucleotide discrepancy az 

10 • posicion 149 (G in AA297759 and T in AA1C04S4) . A T at this 
position was chosen for the SEQ ID MO : 3 based on sequence 
data generated from A100484. The 5' 145 bp of human STLK4 
contains three sequencing ambiguities (N's in SEQ ID NO : 3 ; 
arising from sequence errors in the GenBank encry for 

15 AA297759. Three amiino acid sequence ambiguities in the N- 

terminus of hum^an STLK4 are present also in SEQ ID NO : 7 as a 
consequence of the sequence inaccuracies from che EST entry. 

The coding region of human STLK4 is 124 2 bp long (2- 
20 1243), capable of encoding a 414 ammo acid polypeptide,, and 

is followed by a 2596 nucleoside 3' UTR (1244-3839). Human 
STLK4 ends in a polyadenylaced screech char has 16 adenines 
(3840-3857) . A pclyadenylat ion signal {AATAJ^J^.} is found 
between positions 3822-3827. Targeted-?CR cloning 
25 identified one rat orthologue of human STLK4 , clone 135-31- 

IS.. In addition, one m:urine orthologue of human STLK4 was 
recognized m the EST database as AA1174B3. . None of these 
orthologues add additional M-cermdnal sequence co the human 
STLK4 . 

30 The near full -length human STLK4 cDNA (SEQ ID NO. 2; is 

3 8 57 bp long and consists of a 124 2 bp OR? flanked by a 2 5 96 
bp 3' UTR (1244-3839) and an 18 nucleotide polyadenylaced 
region. Pclyadenylat ion signals (AJ\TAAAJ begin ac posicions 
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2131 and 3322. Since the coding region xs open chroughcuc 

the 5' extent of this sequence, this is apparently a parcial 
cDNA clone lacking che M-terminal start methionine. A near 
full-length murine STLK4 cDNA is represented in the 1773 bp 
5 EST AA.11743S. It extends an addinicnal 21 nucleotides 5' of 

the human STLK4 consensus, buc since its coding region is 
open chroughoun the 5' extent of the sequence, this is also 
probably a partial cDMA. clone lacking the N-terininal scart 
me uhionine . 

10 Several EST fragments span the complete STLK3 sequence 

v/iuh AA2 97759 at the 5' end and AA1004S4 and others at the 
3' end. 

cDNA Clonmo and Characterization of STLK5 
15 The human STLK5 cDNA. sequence is composed of four 

overlapping sequences, AI413298, 2R96-13-i, 3R25-45-3 and 
R466S5. A human STLK5 clone, F07734; was firs- identified 
using a Smith- Waterman search of nhe EST database with 
SPS_sc (U33057) as a query. 
20 AI4 1S2 98 is an IMA.GE consortium cDNA clone with an 8 95 

bp insert . 

?CR clone 2R96-13-1 was isolated from human brain 
sscDNA using prim.ers 5 ' - CTCATCTGTACACACTTCATGG and 5'- 
GATTCCCACACTGTAGATGTC derived from. F07734. 2R96-13-1 
25 contains a 330 bp inser" and an ORE of 33 0 bp (110 amino 

acids) . 

EST clone R46c85 was identified using a Smith- Waterman 
search of the EST database wich the G-cerminus of SPS_sc 
(GE:U33057) as query. Sequence: analysis of zhe 1047 bp 
30 inserc identified this EST to contain an ORE of 285 bp (95 

am.ino acids) encoding the G-termiinus of hum.an STLK5 . 

PGR clone 3R25-45-3 was isolated from human fecal brain 
sscDNA using primers 5'- GGGGGTCGAGTAGATGGAGGAGA.T and 5'- 
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CAACGAA.2^CTAACACAGCAT;^J^.GG derived frcm 2R96-13-1 and R466S5, 
respectively. ■3R25-45 -3 concams a 330 bp insert: and an OR? 
of 750 bp (250 amine acids) . 

The full-lengrh human 3TLK5 cDNA (SSQ ID NO: 96) is 2110 
bp long and consists of a 1119 bp ORF flanked by a 229 bp 5' 
UTR and a 762 bp 3' UTR . The sequence flanking the first 
ATG conforms to the Kozak consensus {supra) for an 
initiating methionine, and is believed to be the 
transiatipnal start site for STLK5 . 

Several EST fragments span the complete STLK5 sequence 
with AA297059 and r07734' at the 5' end and R466S6 and rC3423 
and others at the 3' end. 

STLK5 displays a 100% manch over a 41 bp stretch 
(position 2-42, SEC ID NO. 97) to a human CpG island repeat 
(Z61277) . 



cDNiA GlonincT and Characterization o f STLK6 

Human STLK6 was first identified in the translated EST 
database (AA219667) as a novel serine threonine . kinase . 

20 The partial human 3TLK5 cDNA (SEQ ID K^O:93) is 2,001 bp 

long and- consists of a 1,254 bp ORF flanked by a 75 bp 5' 
UTR and a 673 bp 3' UTR, The sequence flanking the first 
ATG conforms to the Kozak consensus (Kozak, M., Nucleic 
Acids Res. 15, 8125-8148 (1987)) for an initiating 

25 methionine, and is believed to be the translat lonal start 

site for STLK6 . 

At the tim.e of filing, inventors believe that 3TLK6 
does not have any significant mvatch m the nucleic acid 
database . 



30 



cDNA Cloning and Characterization of STLK7 

Hum.an STLK7 was first identified in the translated EST 
database (AA938954) as a novel serine threonine kinase. The 

(2)3 
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original clone v/as noc available through public sources, so 
a PGR fragment: arriplified from the sequence of AAS83S54 
vielded. 5R54-21-2. 

The partial human STLK7 cDNA (SEQ ID NO: 100) is 311 bp- 
long and consiscs of a 309 bp ORP". Since che coding region 
is ocen chroughouc the 5' and 3' excenc of chis sequence, 
chis appears to be a partial cDNA clone lacking che N- 
terminal start methionine and C- terminal stop codon. 

STLK7 shares S0% sequence identity to hum.an SFAK 
(AFC99935) over a 167 bp region and 50% nucleotide sequence 
ident i 



to SLTK7 (SEQ ID NO. 101) over 391 nucl-^ccides. 



cDNA Cloning and Characterizatio n of ZCl 

The human ZCl cDNA sequence is composed of two 
15 overlapping ?CR clones, 3R25-24-2 and R65-12-2. 

A humian ZCl clone, 125-33-5, was first identified from 
a PCR screen with degenerate oligos, TRKl and TRK4 , applied 
CO sscDNA generated from; hum.an small airway epithelial cells 
(Cloncech) . Sequence analysis of nhe 503 bp insert 
20 identified a 501 bp ORF (167 amino acids) with che pocential 

to encode a novel human STK relaced co the C. elegans 
ZC504.4 gene product. 

PCR clone 3R25-24-2 v/as isolated from human SN319 
glioolastoma sscDNA us mg ■ primers 5 ' - ATGGCGAACGACTCTCCCGCGAA 
25 and 5 ' -ACACCAAAATCAACAAGTTTCACCTC derived from the N- 

term.inus of a murine orthclogue of ZCl (NIK, GB:USS9S4) and 
the original humian ZCl clone 125-33-5, respecc ively . 3R25- 
24-2 contains a 527 bp mserc and an OR? of 519 bp (173 
ammo acids) . 

30 PCR clone Rc5-12-2 was isolated as follows: A Smich- 

Watermian search of the EST database with the C. elegans 
ZC504.4 gene (GB:Z50029) as a query identified a human EST 
{V?31656"i whose ORF is related to the C. elegans gene and 
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.^rninates an identical residue (Trp) . A primer was 
designed 3' .o this SCop codon ( 5 ' - AGTTACAAGGAATTCCAAGTTCT ^ 
and used in a PGR reaction with a primer derived frorr. tne 
■ original' human ZCl clone 125-33-5 (5'- 
ATg1aGAGGAAGA.aJ^TCAAA.CTG) using sscDNA frorr. human SK319 
glioblascoma as a template. PGR clone RS5-12-2 was 
idencified and was found co contain a 3611 bp mserc with a 
3534 bp ORF encoding the C-terminal portion of human ZCi 

(1173 amino acids) . 

The full-lengch hum.an ZCl cDNA (SEQ ID NO. 9) is 3798 
bp long. Clone 3R25-24-2 spans posicions 1-527, and clone 
r'65-12-2 spans positions 188-3798. The overlap between 
■ chese two clones exhibits 100% sequence identity. The human 
ZCl contains a 3717 bp ORF (17-3723) flanked by a 5 bp 5' 
UTR and a 75 bp (3724-3798) 2' UTR . No polyadenyiation 
signal (AATAA.A) or polyadenylated region are present m the 
3 'UTR. The sequence flanking the first ATG conforms co the 
Kozak consensus for an initiating methionine, and is 
believed to be the translational start site for numan ZCl. 

Multiple EST fragments (W81656) m.atch the 3' end of the 
hum.an ZCl gene, but at the time of filing, the inventors 
believe that none exist in C-enBank or the EST database that 
match its 5' end. 

."DivA nonino .=ind Char -.^nr.^^rizat- i on of ZC2 

The hum.an ZC2 cDNA sequence is com.posed of four 
• overlapping PGR clones, G75-31-17, R55 - 24 - 5 , ■ 2R2S - 8 - 1 , and 
R99-5-10. 

A human ZC2 clone, G75-31-17, was first identified from 
a FCR screen with degenerate oligos, ROSl 
GCMTTYGGNGARGTNT.AYGARGG! and CCK4b 5 ' - 

GCTGGATCCYTCNGGNSWCATCCA) , applied to sscDMA generated from 
the hum.an HLT333 prim.ary non-sm.all cell lung cancer tissue. 

15b 
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ceqij^rice analvsis of che 492 bp insert idenrifiea a 492 ORr 

,c encode a ncvel hu"=n 
,o the C. elegans ZC504.4 gene produc- . 
?CR clone R59-6-10 was isolated as follov/s: A Sniiih- 
5 VJanerman search of the EST database wich C. elegans ZC504.4 

gene {GE:Z50 02 9) as a query idencified zvj'd overlapping human 
EST f ragmen- s (AA115844 and R51245) whose ORFs v/ere related 
to che C. elegans gene and terminate in an idencical residue 
(Tro) . A orimer was designed 3 ' to che scop codon round in 
10 R51245 ( 5 ' -AGATGGACTGTACTGGGAGG) and used in a PGR reaction 

wich a primer derived from AA115S44 (5'- 

A.CTTTGTGCAGCTCTGTGGG) using hum.an fetal brain sscDKA. as a 
• template.. PGR .clone R9976-IO was identified and was found 
CO contain a 1095 bp insert with a 93 0 bp ORF encoding the 
15 G- terminal portion of hum.an ZC2 (310 amino acids) , 

PGR clone R65-24-6 v;as isolated frd-m human HT29 colon 
cancer cell line sscDNA using prim.ers 5'- 

AAGGTTA.TGGATGTCAGAGGG and 5 ' -AGATGGACTGTACTGGGAGG derived 
from G75-31-17 and R51245, respectively. The 3' prim.er used 

20 in this PGR reaction masprimed between positions 1634-1653 

of this gene leading to the formiacion of a truncaced 
product, R65-24-6 contains a 1593 bp insert and an ORF of 
159 3 bp (531 amdno acids) . 

PGR clone 2R28-S-1 was isolaced from hum.an colon cancer 

25 cell line HT29 sscDNA using primers 5 ' - CTCACAAGGTTGCCAACAGG 

and 5 ' -AGTCCCCA.CCAGAAGGTTTAC derived from R65-24-6 and R99- 
6-10, respectively. 2R2o-3-l ccncains a 1538 bp insert and 
an OF.F of 153 5 bp (512 ammo acids) . 

The parcial 'human ZC2 cDNA (SEQ ID MO. 10) is 4055 bp 

30 long. Clone G75-31-17 spans pes ic ions 1-4 92, c' -^ne R55-24-6 

spans posicions 58-1650, clone 2R23-S-1 spans p ."iicions 
1466-3003 and clone R99-6-10 spans positions 2961-4055. The 
overlaoino reaions betv/een these clones exhibic 100% 
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sequence idencicy except for a single guanine (G7d-21-17; co 
adenosine (R65-24-6) mismacch at position 280 resulting in a 
Glu to Lys change. Based on the presence of an acidic 
residue in this position in human ZCl and ZC3 and C. elegans 
5 ZC504.4, the sequence encoding the Glu is probably correct. 

The human ZC2 gene contains a 3891 bp OR? (1-3851) flanked 
.bv 164 bp (3892-4055) 3' UTR . No polyadeny lac ion signal 
{AKTI^A?.) or polyadenylaced region is present in the 3 ' UTR . 

Multiple EST fragments (R51245) match the 3' end of the 
10 human ZC2 gene, but at the time of filing, the inventors 

believe that none exist in GenBank or the EST database that 
match its 5' end. 

cDNA Cloning and Characterization of ZC3 
15 The human ZC3 cDNA sequence is composed of. four 

overlapping PCR clones, G75-30-30, 3R33-5-3, 3R19-17-6, and 
R99-43-11. 

A human ZC3 clone, G75-30-30, was firsz identified from 
a PCR screen with degenerate oligos, ROSl and CCK4b, applied 
20 to sscDNA generated from a human HLT370 primary non- small 

cell lung cancer tissue. Sequence analysis of the 492 bp 
insert identified a 492 ORF (164 amino acids) with the 
potential co" encode a novel human STK related to the C. 
elegcins ZC504.4 gene product. 
25 PCR clone R99-43-11 was isolated as follows: A Sm.ith- 

Waterman search of' the EST database with the C. elegans 
ZC504.4 gene (C-B:Z50029) as a query identified a human EST 
(R54563) whose ORF is related to the C. elegans gene and 
terminates in an identical residue (Trp) . A primer was 
30 designed 3' to the stop codon found m R54563 (5'- 

TC.AGGGGTCAGAC-GTCACG) and used in a FCR reaction wich a 
crim.er derived from: the 5' end of R54563 (5'- 

AACCCTACC.ACAAATTC : ■ using sscDN.a frcmi hum.an fetal cram 

151 
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as a cerr.piace. PCR clone R99-43-11 vjas identified and v/as 
found to contain a 719 bp insert with a 564 bp OP.F encoding 
che C-cerniinal portion of human ZC3 (13S arr.ino acids). 

PCR clone 3R19-17-6 vyas isolated from hurrian A549 luno 
5 cancer cell line sscDNA using primers 5'- 

CCCCCGGGAAACGATGACCA and 5 ' -AGCCGCTGCCCCTCCTCTACTGT derived 
from G75-30-30 and R99-43-11, respectively. The 3' primer 
used in this PCR reaction misprim.ed leading to the formiatior 
of a cruncated product. 3R19-17-5 contains a 1172 bp inser: 
10 and an ORF of 1170 bp (390 amino acids) . 

PCR clone 3R33-5-3 was isolated from humian A.54 9 lung 
cancer cell line sscDNA using primers 5'- 

ACCGCAACATCGCCACCTACTAC and 5 ' - CTCGACGTCGTGGACCACC derived 
from G75-30-30 and 3R19-17-6, respecc ively . 3R33-5-3 
15 contains a 2465 bp insert and an ORF of 2453 bp (321 amdno 

acids) . 

The full-length human ZC3 cDNA (SEQ ID NO. 11) is 4133 
bp long. Clone Q7 5-30-30 spans positions 1-433, clone 3R3 3- 
5-3 spans posit ions . 1 34 - 2 5 98 , clone 3R19-17-6 spa.ns 

20 positions 2356-3512 and clone R99-43-11 spans pcsicions 

3415-4133. The overlaps between chese clones exhibic 100% 
sequence idencity. The human ZC3 gene coniiains a 3 97S bp 
ORF {1-3 973} flanked by a 152 bp 3' UTR (397 9-4133). No 
pclyadenyla- ion signal (AATAAJ^.) or polyadenylated region is 

25 present m che 3 ' UTR. . 

Multiple EST fragments (R54553) match che 3 ' end of che 
human ZC3 gene, but at: the time, of filing, the mvencors 



30 



match ICS b' end. 
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r^T^MZ. Cloai ^^ ^r-.^- Ch? ^ r i za M on of ZC4 

The human ZC4 cDNA sequence, represented by PCk 
fraarr^ent 3R25-27-1, was f.rsc idencified in che human 
genomic cosmid 82oll {GE:ZS33350) containing excn sequences 
thac displayed strong homology to che ZC504.4 C. elegans 
Qene . 

PGR Clone 3R25-27-1 was isolated from hum.an fetal liver 
sscDNA and primers 5 ' -C.A.ATGTTA^XCCACTCTATGTCTC and 5'- 
AGTTTGCCGATGTTTTTCTTTTC derived from a potential ORF 
(cosicions 25729-25852) from the 82J11 .cosmid and from an 
EST (R98571) encoding the C-terminus of the human ZC4 gene, 
resoecuively . 

" The- partial human ZG4 cDNA (SEQ ID N0.12) is 1459 bp 
long and consists of a 1047 bp ORF (2-1043) flanked by a 411 
bo (1049-1459) ^'UTR region. No polyadenylation signal 
(A_ATAAA) or polyadenylated region is present in the 3 ' UTR . 

The N-cermmal coding sequence for ZC4_h was expended 
by building a contiguous DNA sequence of 233,137 bp 
containing ZS385G and four other sequences: CUS4B10 and 
CU230B10 (from the Sanger Human Genome Sequencing Project, 
v..^ p■//www■sa ^^-- ^^.uk/HGP/; and Z97356 and Z69734 

(available from che National Instituce for Eiocechnoiogy 

Inf ormat ion , 

H^ ■ / /WWW ■ n ^H. .1 ■ nih ■ on^WP^^r :^ez /p.^r-l -^.-r ^ de . hcml ) . The 
position of each, sequence m the contig is represented in 
the table below. 



i Accession ' 


: zt: ■ 

i^engt n 


Start 


End i 


1 CU84B1C 


4 3 2 7 3 


G 


4 3 2 7 3 i 


i Z97356 


2184 3 


4 3171 


! 65C18 


■ Z69734 


37077 


; 63073 


! 100149 j 


i CU230310 


11S41 


! S 8 4 1 6 


\ 100256 


ZS3S50 


13 2 9 31 


1 10015b 


i .233137 



30 



Sequences in ZC4 genom.ic contig. 
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The 233,137 bp ccncig v;as analyzed for exons usmc che 
programs FGENES 1.5 and FGENE3H, human gene scruccure 
prediction software available from The Sanger Cencre 
( htto : / /genomic . sanaer . ac . uk/af / of . shtml ) . 

The resulting human ZC4 coding sequence (SEQ IB MO: 104) 
is 3,5 31 bp long (excluding the stop codon) and encodes for 
a STE20 kinase of 1227 am.ino acids. 



10 The human rCHS2 cDNA sequence is composed or tour 

overlapping clones, 3R25-51-2, 3R16-34-2, 3R16-31-2, and 
T79916. 

A human KHS2 clone, AA250S55, was first identified 
using a Smith- Waterman search of the EST dacabase with KHSl 
15 (GE:U77129) as a query. Sequence analysis of the 1112 bp 

insert identified a 613 bp ORE (206 amino acids) v;ith the 
potential to encode a novel STK related to the human KHSl 
gene product. Using AA250355 as a query, a second EST 
(AA446022) v;as found whose sequence was shown to contain the 
20 initiator mLethionine for humtan KHS2 based on a com.parison 

with PCHSl . 

?CR clone 3R25-51-2 was isolated from hum.an tescicular 
cancer sscDMA using primers 5 ' -CCGCCATGAACCCCGGCTT and 5'- 
CGATTGCCAAAGACCGTGTCA derived from. AA446022 and AA250855, 

25 respectively. 3R25-51-2 contains an 850 bp insert and an 

ORE of 849 bp (2S3 amino acids) . 

EST clone, T79916, was identified using a Smiith- 
VJatermian search of the EST database with che C-cermtinus of 
KHSl (GB:U77129) as a query. Sequence analysis of t:he 2107 

30 bp insert identified this EST to contain an ORF of 345 bp 

(115 amiino acids disrupted by a single snop codon) encoding 
the G-cermtinus of hum.an KHS2 , followed by 1762 bp 3 ' UTR . 



{^0 
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PGR clone 3R16-34-2 was isolated from human ces-is 
sscDNA using prioiers 5 ' -AC-.^^.GTTGCAGCTGTTGAGAGGA and 5'- 
TATGGCCCGTGTAAGGATTTC derived from AA250885 and T79516, 
respeccively. 3R16-34-2 contains an 1516 bp insert and an 
5 ORF of 1128 bp (376 amino acids) . 

PC? clone 3R15-31-2 was isolated from, normal human 
colon SSCDNA using primers 5'- GTGCCAGAAGTGTTGTGTTGT.^-^. and 
5' -TATGGCCCC-TGTAA.GGATTTC derived from EST T79916. 3R15-31-2 
contains a 723 bp insert and an ORF of 669 bp (223 ammo 
10 acids) . This clone lacked the stop codon present within EST 

T79916 (postion 2662 in the KKS2 sequence) . 

The full-length human KHS2 cDNA (SEQ ID NO. 17) is 4023 
bp long. •■ ■Clone' 3R25- 51 -2. spans positions 1-855, clone 
AA250S85 spans positions 336-923, clone 3R16-34-2 spans 
15- positions 545-2051, and clone T79916 spans positions 1917- 

4023. The overlaping regions between these clones exhibit 
■ 100% sequence identity, except for 4 nucleotide differences, 
two of which are silent, a third corrects the internal stop 
codon at position 2662, and the fourth at position 247 (T to 
20 C change) results in a Pro to Leu change. The human KHS2 

cDNA contains a 2682 bp ORF (6-2687) flanked by a 5 bp (1-5) 
5'UTR and a 1336 bp (2688-4023) 3' UTR . A potential 
polvadenylation signal (AATAAA) is found at positions 4 008- 
4013. No polyadenylated region is present in the 3'UTR. The 
25 sequence flanking the first ATG is in a poor context for 

translational initiation, however, a 134 bp. S'UTR sequence 
from EST A_A445022 did not reveal any additional ATG's and 
displayed two in-frame stop codons 5' to the putative start 
ATG for hum^an KHS2 . 
30 Multiple EST fragments match the 5 ' end , ( AA.445022 ) as 

v;ell as the 3' end (R37625) of the hum.an PCHS2 gene. 
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cDMA Cicninc; and Characterization cf SULUl 

The human SULUl cDNA sequence is composed of three 
overlapping clones, N40091, 2R90-1-1 and R90907. 

A human SULUl clone, N40091/ v/as first identified using 
5 a Smich-Wacerman search of the EST database v/ith che C. 

elegans SULu gene {GB: U32275) as a query. Sequence 
analysis of the 1321 bp insert identified a 905 bp ORF (302 
amino acids) with che potential to encode a novel human STK 
related co the C. elegans SULU gene product. 

10 EST clone R90907 was firsc identified using a Smith- 

Vvaterm.an search of che EST database wich the 3 ' end of the 
C. elegans SULU gene (GB: U32275) as a query. Sequence 
analysis -of the 1647 bp insert identified a 578 bp CRF (192 
amino acids) v/ith che potencial to encode che C-cerminus cf 

15 the human SULUl gene product, 

PGR clone 2R90-1-1 was isolated from hum.an HT2 9 colon 
cancer cell sscDNA using primers 5 ' - TATTGAATTGGCGGAACGGAAG 
and 5'- TTGTTTTGTGGTCATTCTTTGGAG derived from. N40091 and 
R90907, respectively. 2R90-1-1 contains a 1625 bp insert 

20 and an ORF of 1623 bp (541 amino acids) . 

The full-length human SULUl cDNA (SEQ ID NO. 19) is 4177 
bp long Clone N40091 spans positions 1-1321, clone 2R90-1-1 
spans positions 1048-2671, and clone R90907 spans positions 
253 1-4177. The overlaping regions between these clones 

25 exhibit 100% sequence identity. The hum.an SULUl cDNA 

contains a 2694 bp ORF (415-3108) flanked by a 414 bp (1- 
414) 5'UTR and a 1069 bp (3109-4177) 3' UTR followed by a 19 
nucleotide polydenylaced region. A potential 

polyadenylat ion signal (AATAAA) is found ac positions 4164- 
30 4169. The sequence flanking the first ATG conf c/'-'^.s to the 

Kozak consensus for an init lacing m.ethionine, ana is 
believed to be the cranslat ional start sice for human SULUl. 
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MulciDie EST fragments iP.acch the 5 ' end (N27i = 3) as we.. 



as tne 3 enc 



(R90903) of the human 3ULU1 gene. 



^n^a nonina ?r^H rharac r ^^'^^ ^a- ion of M'lrine 5ULU3 
5 The rr.urme SULU3 cDNA sequence is represented by FCR 

fragment 2R92-1-6. 

A murine SULU3 clone, G83-4-5, was first identified 
from a PCR screen with degenerate oligos, CCK4c and CCK4b; 
applied to sscDNA generated from murine day-12 embryos. 
10 sequence analysis of the 473 bo insert idencified a 471 ORF 

(157 am.inc acids) wich che potential to encode a novel human 
STK related co the C. elegans SULU gene (GB: 'J32275) 
produce - The antisense s.trand of G83-4-5 is identical at 
Che nucleic acid level to the 5 ' UTR of the murine ecsl 
15 orotooncogenic transcription factor (GB:X53953). This 

homology is likely -.he result of a cloning artifact attached 
to the 5' -end of the database entry for murine ecsl . 

PCR clone 3R1S-17-6 was isolated from hum.an A549 cell 
3SCDNA using prim.ers 5 ' - CCCCCGGGAAACGATGACCA and 5 ' - 
20 AGCCGCTGCCCCTCCTCTACTGT derived from G75-30-3C and R99_-43- 

•11, respectively. The 3' prim.er used in this PCR reaction 
misprimed leading to che formation of a cruncated product. 
•3P.i9.i7-6 contains a 1172 bp mserc and an ORF of 1170 bp 

(390 amdnc acids) . 
25 PGR clone 2R92-1-6 was isclaced from murine d8 embryo 

sscDNA using primers 5 ' - ACCGC.AACATCGCCACCTACTAC and 5'- 
GATTGCTTTGTGCTCATTCTTTGG derived from the 5' UTR of che etsi 
aene and the human EST A.A2 3 4 62 3, respectively. The latter 
(shown herein) encodes the C-terminus of hum.an 3ULU3 . 2R92- 
30 1-6 concams a 22^^ op ^n^e-^ c^-.u 

amino acids) . 

The partiao- murine So-Luj. cuNi-. v^-s. i.--.^-/ 
long and consist:s of a 2244 bp ORF (6-2249; flanked by a 5 

\(p3 
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bp (1-5) 5 ' UTP. . The sequence flanking the first: ATG 
conforms to the Kczak consensus for an iniciacmg 
methionine, ana is believed to be zhe translat ional scare 
sice for murine SULU3 . 
5 One EST fragmenc (AA446022) matches the 3' end of che 

partial murine SULU3 gene, but ac che tim^e of filina, che 
inventors believe that none exist in GenEank or the EST 
database chac macch its 5' end. 

10 cDMA Cloning and Characterization of Human S'JLlT3 

The hum.an SULU3 cDNA' sequence is composed of two 
overlapping clones, 2R90-22-1 and AA234623. 

A human SULU3 clone, AA234623, was firsc identified 
using, a Smi ch- Waterman search of the EST database v/ith the 

1^ eieqrans SULU gene (GE: U32275) as a query. ' Seauence 

analysis of the 2652 bp insert idencified a 11S5 bp ORF (395 
amino acids) v/ith che potential to encode che C~cerminus of 
a -novel human STK related to the C. eiegans SULU gene 
product . 

-0 PCR clone 2R90-22-1 v;as isolaced from hum.an SKMell28 

m.elanomia cell line sscDNA. using crimpers 5'- 

TATTGAATTGGCGGAACGGAAG and 5 ' - TTGTTCTAAGAGTGCCCTCCG derived 
from, che m^urine SULU3 2R92-1-6 clone and from AA234523, 
respectively. 2R92-1-6 contains a 1397 bp inserc and an ORE 

25 of IS 9 6 bp (63 2 am.mo acids) . 

The partial human SULU3 cDMA (SEQ ID NO. 20) is 3824 bp 
long. Clone 2R90-22-1 spans positions 1-1897 and clone 
A.^234623 spans posicions 1173. The overlaping region 
becween these clones exhibics 100% seauence idencitv. '^'pc^ 

30 human SULU3 cDNA contains a 2 3 53 bp ORF (2-22E9) flanked bv 

a 1 ^ 6 b op (2360-3824) 3 ' UTP. foil owe d b v a 19 nucleocide 
polydenylated region. A potencial polyadenylat ion s ional 
(AATAAA) is found ac positions 2602 -2507, Since the coding 
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region is open throughout "he 5' e:-:cent of tnis sequence, 
this is apparently a partial cDNA clone lacking che N- 
terminal start methionine. 

Multiple EST fragrrients (R02283) match the 3'end of the 
5 human 3ULU3 gene, but ac the time of filing, the inventors 

believe chac none exist in GenBank or the EST database that 
match its 5' end. 

cDMA Cloning and Characterization of GEK2 

10 The human GEK2 cDMA sequence is composed of three 

overlaopinq clones, AA45944S, 3R25-4S-1 and. GEK2_h#3 . 

A human GSK2 clone, AA459443, v/as first identified 
using a Smi th- Waterm.an search of the EST database with the 
human SLK gene (G3: A3002804; as a query. Sequence analysis 

15 of the 1286 bp insert identified a 1227 bp ORF (409 amino 

acids) with the potential to encode the N-terminus of a 
novel human STK related to the human SLK gene product. An 
additional Sm:i t h- Wat ermian search using the C-termLinus of the 
SLK gene as a query yielded three additional EST's, 

20 AA3236S7, AA3S0492 and AAlo8869, that encode the C- terminal 

region of hum.an GEK2 . 

?CR clone 2R93-41-17 was isolated from humian testis 
sscDNA using prim.ers 5 ' - AAGACCATGCCGTGCGCCG and 5 ' - 
ATTCCTTCAGGTTCTGGTTATGG derived from AA323637 and from. 

25 AA38C492, respectively. 2R98-41-17 contains a 851 bp insert 

and an GRF of 849 bp 1283 am^ino acids) . 

PGR clone GSK2_h#3 was isolated from, human sscDMA m^ade 
from the H23 tumor cell line using prim.ers 5'- 

GCAGCAAGTGGAGAA.GA.TGG and 5 ' - GGAAGCATCCCCAGAGCTGTAG derived 
30 from the sequence of clone 3R25-43-1 and from the 3' end of 

mrurine LOK (G3 : DS972S ; , respectively. GEK2_h#3 contains a 
■ . 1042bp insert and an ORE of 1041 bp (347 amiino acids) . 
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The fuil-length human GEK2 cDMA (SEQ ID NC:iC6.' is 29£, 
bp icng. Clone AA45944S spans posicions 1-1236, clone 3F:25 
43-1 spans positions 1100-2449 and clone GEK2_hff3 ' spans 
posicions 1920-2962. The overlapping regions between these 
5 clones exhibit 100% sequence identity. 

The ftuman GEK2 cDNA contains a 2904 bo ORF ( 5'^ -2 ->6'' ; 
flanked by a 5 8 bp (1-5S) 5 ' UTR . The sequence flankinc the 
first ATG conforms to the Kozak consensus for an initiating 
methionine, and is believed to be the translationai scart 
10 site for human GEK2 . 

Multiple EST fragments (AA465671) match the 5 ' end of 
zhe sequence, but only one (AA380492) matches zhe 3' end of 
che human GEK2 Qene . 



cDNA Cloning and Characterization of ?AK4 

The human ?AK4 cDNA sequence is represented by clone 



SK-B2#1 . 

A human PAK4 clone, R38460, was first idencified usinc 
a Smith-Wacerman search cf the EST database with zhe hum.an 
20 PAK gene (GB: U24152) as a query. Sequence analysis of che 

2332 bp insert idencified a 930 bp OR? (310 amino acids) 
with che potential to encode che C-cerm.inus of a novel human 
STK related co the human P.AK gene product . 

CDNA clone SNB2#1 was isolated from human gl ioblastom.a 
25 cell line SNB75 cDNA library using a probe derived from 

R88460. SNB2#1 contains a 3604 bp insert and an ORF of 2043 
bp (631 am.ino acids) . 

The full-length humian ?AK4 cDNA (SEQ ID NO. 27) is 3604 
bp long and consists of a 2043 bp ORF (143-21S5) flanked by 
30 a 14 2 bp ;l-14 2} 5 ' UTR and a 1419 3' UTR followed by a 22 ' 

nucleccide pclydenylaced region. A pocenCxal 

polyadenylation signal (A_ATTAAA) is found at positions 3532 - 
3538. The sequence flanking the first ATG conforms co che 
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Kozak consensus for an initiating methionine, and is 
believed zo be che cranslat ional start sice for huir^an FAK4 . 

The 3' UTR of the PAK4 gene contains a GT dinucleocide 
repeat prone co undergo expansion based on the nunnber of 
5 repeats found in clones SNBhI and R8S460; 32 and 23, 

respectively. Several neurologic disorders have been 
correlated wich the expansion- of di- or tri -nucleotide 
repeats similar to those found in the ?AK4 se^au^nce , 
suggesting ?AK 4 may also be a disease cargec and char rhis 
10 repeat in ics 3 ' UTR may serve as a diagnostic m.arker. 

Multiple EST fragments (AJ^.535791) macch "he 3 ' end of 
the human PAK4 gene, but at the time of filing, che 
inventors believe chac none exist in GenBank or "he EST 
database that: match ics 5' end. 

15 

cDMA Cloning and Characterization of PAK5 

The full-length hum^an PA.K5 cDNA sequence is composed oi 
two overlapping clones, H4 50#l-1 and SN38#5. 

A human PAP'E clone, R13S25, v;as firsc idencified using 

20 a Smd ch- Wacermian search of che EST database with the hum.an 

PA.K4 gene as a query. Sequence analysis of the 1248 bp 
inserc identified a 420 bp ORF (140 am^inc acids) v/ith the 
pocencial to encode che C-term.inus' of a no'/el hum.an STK 
related zo the human PA?C4 gene product . 

25 cDNA clone SNiBS#5 was isolated from human SNB75 cDNA 

library using a probe derived from; R18S25. SN32#1 contains 
a 2028 bp insert and an ORE of 1194 bp (398 amino acids) . 

The partial human PAK5 cDNA (SEQ ID NO. 28) is 2028 bp 
long and consiscs of a 1194 bp OR? (2-1195) flanked by an 

30 S33 bp (1196-2023) 3 ' UTR followed by a 22 nucleotide 

polydeny-la t ed region. A potenciai polyadenylat ion signal 
(AATTAAA) is found at positions 2004-2010. Since che codinc 
region is ooen chroL^crhou" che 5'- extent of chis seauence, 
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tnis is apparently a partial cDNA clone lacking che N- 
terrrdnal start methionine. 

Clone H4 6C#l-i was isolated frora a hurnan lung H4 5C cDMA 
library using a probe derived from the partial £iv32iii cDt\- 
clone described above. Sequence analysis of the 252b bp 
insert identified a 1773 bp ORF (592 amino acids) with che 
potential co encode a full-length PAK5 . 

The human PAK5 cDNA (SEQ ID NO: 102) is 2, SOS bp long 
and consists of a 1,773 bp ORF flanked by a 201 bp 5' UTR 
and a S33 bp 3' UTR. The sequence flanking tne first ATC- 
conforrr.s to the Kosak consensus {Kozak, M., Mucleic Acids 
Hes. 15, 8125-8145 (1987).! for an initiating methionine, and 
is believed to be the translational start site for ?AK5 ! 
FAK5 shares 99% sequence identity over 2 7 95 bo 

recent database '^ntr-^- 2i~'^{^^nAa -r-^-.^ 

, .".X .^L, _.u4o . Tnese sequences are 

oresumed to b-= fr-.rr- t-h- c^-i^ r^^^^ ■ 

j.x_rr, cn_ sa^.^e gene, with minor polvmor!: 

variations . 



i- w a 



20 



25 



30 



Exnressio:. Analv.n^ ^-r Mammal i^^ ^T E2 0 - . 
Protein Kinases; 
Mater ials .qnd Merhodc; 
Norther n blot ^n.^l..^So 

Mcrthern blots were prepared by running 10 a total RNA 
isolated from 60 hum.an tum.or cell lines (HOP-92, EKVX, MCI - 
H23, NCI-H226, NCI-K322M, MCI-H46C, MCI-H522, A^49 --^QP-so 
OVCAR-3, OVCAR-4, OVCAR-5, OVCAR-8, IGROVl , SK-OV-3, 
SNB-75, U251, SF-268, SF-295, SF-539, CCRF-CEM, K-562, MOLT- 
4, HL-50, RPMI S226, SR, Dy-145, PC-3, KT-29, HCC-2998, KCT- 
116, SW62C, Colo 205, HTC15, KM-12, UO-31, SN12C, A493! 
CaK.l, RXF-393, ACHN, 736-0, TK-iO, LOX IMVZ , Malm.-.f./ .v. 
MEL-2, SK-HEL-5, 3K-MEL-2.5, UACC-62, UACC-257, .M14, MCF-. 
MCF-7/ADR RES, Hs57ST, MDA-M3-231, MDA-.ME-435, MDA-N ='r/ 
549, T47D), from 22 human adult tissues (t.hym.us, lung, 



BNSDOCID: <WO 9953036A2J_> 



wo 99/53036 ■ ^ PCT/US99/08150 

duodenum, colon, cestis, brain, cerebellurrw cortex, salivary 

gland, liv^er, pancreas, kidney, spleen, scomach, ucerus, 
prostate, skeletal muscle, placenta, mammciry gland, bladder, 
lymph node, adipose tissue), and 2 human fecal norm.al 
5 tissues (fetal liver, fetal brain ), on a denaturing 

f ormta^ldehyde 1.2% agarose gel and transferring no nylon 
membranes . 

Filters were hybridized with random, primed [a'^PidCT?- 
labeled probes synthesized from the inserts of severed of 

10 the STE2 0 -related kinase genes. Hybrid! zac^-on was performed 

at 42 overnight in 6X S3C, 0.1% SDS, IX Denhardt ' s 

solution, 10 0 Lig/mL denatured herring sperm DNA with 1-2 x 
10'" cpm,/T.L.of ^^?-labeled DNA probes. The filters were 
washed in 0 . IX £3C/0.1% SDS, 65 ^C, and exposed on a 

15 Molecular Dynamics phosphcrimager . 

Quantitative PCR analysis 

RNA ivas isolated from, a variety of normal humtan tissues 
and cell lines. Single- stranded cDNA was synthesized from 

20 10 Lig of each RNA as described above using' the Superscript 

Pream^plif icat ion System (GibcoBRL) . These single strand 
templates were then used in a 25 cycle PCR. reaction with 
pi'imers specific to each clone. Reaction products v;ere 
elect rophoresed on 2% agarose gels, stained with ethidiumt 

25 bromide and photographed on a UV light box. The relative 

intensity of the STK-specific bands were estimated for each 
sample . 

DNA Array Based Expression Analysis 
30 Plasm.id DNA. array blots -were prepared by loading 0.5 ug 

denatured plasm.id for each STE2 0 - related kinase on a nylon 
mem.brane . The [c/^^PldCT? labeled single stranded DNA probes 
were synthesized from the total RNA isolated from several 
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human immune tissue sources or cumor cells (thvmus, 
dendrccy-es, mast cells, monocytes, E cells (primary, 
Jurkat, RPMIS226, SR) , T cells {CD3/CD4-f, THl , TK2 , CEM, 
M0LT4), K562 (megakaryocytes). Hybridization ivas performed 
at 42 °C for. 16 hours m 6X 3SC, 0.1% SDS , IX Denhardt ' s 
solution, 100 j.ig/m.L denatured herring sperm DMA wich lO'' 
cpm/mL of ia''P]dCTP labeled single stranded probe. The 
filters were washed in 0 . ix SSC/0.i% SDS, 55 °C, and exposed 
tor quantitative analysis on a Molecular Dynam.ics 
phcsphorinager . 

RESULTS 

Distriburion of STE20-Rel ar e d Gene TT-ansc- ^ nr'.^ m No^-r^.^ ^ 

2- s 3 u 'g s 5.nc Tu -TtO r 1 A Lii^e^ 



ZCl, ZC2, and ZC3 RNA expression was analyzed by 
quantitative PCR from m.ultiple hum.an normal tissues, 
cultured primary epithelial and endothelial cells, and tumor 
ceil lines. The results are summarized in Tables l and 2, 
with relative expression values ranging fro- o 
20 (undetectable) to 23 (very strong) . An "x" refers to samcle 

not tested. ZCl, ZC2 , and ZC3 were all expressed at very" 
low levels in most ncrm^al hum.an tissues, however ZCl and ZC2 
were more abundant m cultured epithelial cells and ZC3 in 
normial kidney and breast tissue. 

Expression of these 3 genes was also examined in a 
panel of human tum.or cell lines representing a diverse 
sampling of tumor types (Table 2). ZCl and ZC2 showed 
strong expression m cell lines from most melanomas and 
renal tumors and from some non- small cell lung cancers and 
colon tumors. ZC3 expression was consistently lower in the 
tumor cell lines except for high expression m m.ost breast 
cancers and leukerriias ^-h^ ^oHt-qt- -.r^> ^ • 

\-10 
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and ZC3 in cumcr cells versus normal tissues rriay provide an 

attractive target for oncology drug development. 

Expression of all che novel STE20 - related kinases was 
examined in a panel of human im.mune tissues /eel Is by 
5 hybridization to a DMA array bloc containing plasmids 

encoding each of these genes. STLK2 was broadly expressed 
in all 14 immune samples, whereas STLK4 and ?Arv4 were highly 
expressed in a subset of 6-7 of the samples (Table 3) . 
Several other kinases (SULU3, ZC4 , KHS2) had more rescriczed 

10 expression, while others were expressed in only a single 

imimune source (STLK3, thym.us; ZCl , dendrocytes; ZC3 , 
monocytes; PAK5 , masc cells and M0LT4 ) , and several more 
were absent from, all the imimrane sources assayed (GEK2, 
SULUl , ZC2 , STLK5) , These expression paccerns v;ere quite 

15 distinct am.cng m^embers of the same subfam.ily (.i.e., ZCl , 

ZC2, ZC3 and ZC4 , or PAKl , PAK2 , PAK3 , PAK4 , PAK5 ) . This 
analysis suggests that some of these kinases may be 
candidate targets for various im.mune disorders, and that 
som^e, which are more broadly expressed, miay m.ediate 

20 functions vital to the basic biology of most proliferating 

cells . 

TABLE 1 

ZCl, ZC2 and ZC3 Expression 
in Normal Hum;an Tissues and Cells 



Sample 




ZCl 


ZC2 


ZC3 












Brain 


Tiss 


2 . S 


0 . 6 


0 . S 


Duod 


Tiss 


3 . 8 


1 . 5 


0 . 3 


Heart 


Tiss 


1 . 2 


G . 3 


0 . 0 


Kidney 


Tiss 


i'\ —} 
'•^ . / 




'7 ("^ 


Lung 


Tiss 


1 . o 


.-^ o 




Pancreas 


Tiss 


2 . 0 


,-1 


2 . 5 
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Placenca 


Tiss 


i. . 4 


0 . 0 j 0 . C 

t 


Sal gl . 


Tiss 


3 . 0 


0 . 3 


3 . 2 


Sk mus . 


Tiss 


2 . 3 


0 . 1 


0 . 1 


Spleen 


Tiss 


0 . 4 


0 . 0 


X 


is iiomach 


Tiss 


0 


0 . 0 


0 . 0 


Thymus 


Tiss 


2 . 5 


0 . 4 


1 . 5 


Cereb 


Tiss 


2 . S 


1 . 1 


4 . 4 


Liver 


T:ls3 


1 . 8 


0 . 0 


0 . 4 


Uterus 


Tiss 


1 . 6 


0 . 0 


1 . 4 


Prostate 


Tiss 


1 . 4 


C . 0 


1 . 6 


Testis 


Tiss 


X 




5 . 8 


f Brain 


Tiss 


X 


X 


3 . 1 


Mam gl 


Tiss 


X 


X 


7 .2 


HCAEC 


ENDO 


1 . 0 


0 , 0 


0 . 0 


HMVEC-d 


ENDO 


0 . 7 


0 , 0 


0 . 4 


HMVEC - L 


ENDO 


2.2 


1 . 6 


1 . 8 


HPAEC 


ENDO 


9 . 3 


5 . 3 


6 . 4 


HMEC ] 


SPI 


4 . 1 


2 . 3 


1 . 9 


RPTEC Iept 
i 


3 . 6 


2 . 2 


0 .2 


HRCE I 




5 . 3 




1 - o 


HSAE I 




0 . 9 


3 . 3 


4 . 8 



TABLE 2 



ZCl , 2 


22 and ZC3 Expression in Tumor Cell iLnes 


Sample 


Origin 


ZCl 


ZC2 


ZC3 




Sample 


Origin 


ZCl 


ZC2 


ZC3 


HOP- 92 


LunQ 


9 . 3 


7 . 2 


3 .3 


HCC-2 993 


Cclon 


2 . 4 


3 . 3 


3 . 0 


EKVX 


Lunc 


10.713.7 


3 . 5 


HCT 115 




^ 

^ - *i 


2 . 1 


5 . 4 


MCI -H2 3 


Luna 


5 . 8 


6 . 3 




SW-52C 


Colon 


7 . 3 


12 . 1 


3 


NCI -H2 2 6 


Lung 


€ . 5 


5 . 3 


3 . 3 


COLO 2 05 


Colon 


9 . " 


16 . 2 13 . C 



SNSDCOD: <WO 9953036A2J_> 



wo 99/53036 



PCTAJS99/08150 



NCI -K3 2 2M 


Lung 


3.5 j5 . 3 


4 . 9 




HCT - 1 5 


Colon j 13 . S i4 . 9 

f i 




NCI -H460 


Lung 


4 . 5 


3 . 7 


2 . 9 




KM- 12 


Colo- 


7 . 0 


13 . 2 j3 . 1 


NCI -H522 


Lung 


4 . 7 


3 . 3 


4 . 6 




UO-31 


Colon 


10 . 4 


10.6 


0 . 9 


A54 9/ATCC 


Lung 


3 . 8 


3 . 6 


4 . 1 


SN12C 


Renal 


8 . 1 


3 . 4 


2 . 8 


KOP-62 


Lung 


4 . 3 


3 . 8 


4 . 2 


A496 


Renal 


6.2 


3 . 1 


2 . 9 


OVCAR-,2 


Ovary 


2 . 9 


3 . 1 


1 . 5 


Caki-1 


Renal 


9 . 2 


14 . 4 


2 . 3 


OVCAR-4 


Ovary 


3 . 3 


1 . 0 


3 . 8 


RXF 3 93 


Renal 


10 . 6 


4 . S 


2 . 8 


OVCAR-5 


Ovary 


2 . 6 


3 . 6 


2 . 2 


ACHN 


Renal 


9 . 3 


6 . 0 


3 . 9 


OVCAR-S 


Ovary 


3 . 6 


2 . 0 


4 . 7 


786-0 


Renal 


3 . 8 


15 . 6 


5 . 6 


IGROVl 


Ovary 


3 . 8 


1 . 7 


3 . 2 


TK-10 


Renal 


20 . 9 


21.2 


5 . 0 


SK-OV-3 


Ovary 


4.9 . 


0 . 0 


3.5 


LOX IMVI 


Mel 


2.3 


2 . 4 


3 . 3 


SNB- 19 


CNS 


5 . 1 


5 . 4 


4 . 2 


Malrr.e- 3M 


Mel 




X 


2 . 2 


SNB-75 


CNS 


2 . 5 


0 . 9 


C . 7 


SK-MHL-2 


Mel 


15 . 7 


14 . 1 


2 . 9 


U2 51 


CNS 


1 . 5 


1 . 2 


0 , 6 


SK-MEL-5 


Mel 


7 . 9 


7 . 0 


0 . 0 


S?-268 


CNS 


5 . 8 


2 . 7 


3 . 0 


SK-MEL-28 


Mel 


16 . 5 


23 . 1 


0 . 0 


SF-295 


CNS 


6 . 4 


1 . 1 


3 .2 


UACC-62 


Mel 


12 . 1 


18 . 3 


5 . 3 


SF-539 


CNS 


5 . 1 


2 . 9 


4 . 3 


UACC-2 57 


Mel 


10 . 8 


9 . 4 


6 . 2 


CCRF - CEM 


Leuk 


3 . 4 


2 . 7 


3 . 1 


M14 


Mel 


4 . 4 


0 . 9 


7 . 9 


K-562 


Leuk 


4 . 1 


6.3 k . 3 

i 


MCF7 


Breasc 


4 . 8 


1 . 3 


7 . 7 


MOLT - 4 


Leuk 


7 . 1 


3.4 j4 . 2 


MCF-7/ADR 


3reas c 


3 . 8 


3 . 4 


7 . 7 


KL-60 


Leuk 


X 


X |0.4 
f 


Hs 57ST 


Breasc 


c . 9 


2 . 6 


5 . 7 


RPMI 8226 


Leuk 


0 . 5 


0 . 2 


1 . 4 


MDA-MB-231 


Breast 


5 . 7 


1 . 9 


6 . 4 


SR 


Leuk 


3 . 5 


7 . 2 


5.4| 




MDA-MB-435 


Breasz 


4 . 8 


6 . 7 


9 . 1 


DU- 14 5 


Pro 


X 


X 


1 

3.4! 




MDA-N 


Breasc 


7 . 3 


6 . 3 


9 . 1 


PC-3 


Pro 




X 


3.4 j 




ET-549 jSrease 




1 . 9 


8 . 0 


HT-2S jCoion 
t 


2 . 4 


5 . 9 


6 . 6 i 


T-47D 


Ereasc 

i 


ii 


12.3 


9 3 i 
1 



113 
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Table STE2 0 - re j^ated kinase expression in a human imrr.une 

Danel 



KINASE 


thymus 


Dendro- 
cytes 


Mast 
cells 


Mono- 
cytes 


B 

cells 


CD84- 
CD4-f 


THl 


TH2 


GEK2 


350 


350 


350 


350 


350 


350 


3 50 


350 


SULUl 


350 


350 


3 50 


350 


3 50 


3 5 0 


3 50 


3 5 0 


SULU3 


350 


350 


350 


350 


12149 


3 50 


5115 


350 

1 


STLK2 


117770 


13771 


27620 


92036 


18 3 0 5 


39109 


5405 


3564 


STLK3 


8624 


350 


350 


350 


350 


350 


350 


3 50 


STLK4 


8 52'4 


350 


350 


'350 


350 


3685 


5642 


350 


STLK5 


XXX 


XXX 


xx>: 


XXX 


350 


350 


3 5 G 


XXX 


• ZCl 


3 5 0 


■ 3377 


3 50 


350 


3 50 


3 50 


3 50 


350 

i 


ZC2 


3 5 0 

i 


350 


3 50 


350 


3 50 


3 5 0 i 


3 5 0 ^ 


350 ! 


ZC3 


350 


350 


350 


20156 


350 


3 5 0 


3 5 0 


350 


ZC4 


XXX 


XXX 


XXX 


XXX 


350 


350 - 


350 


XXX 


KHS2 


8766 


2508 


3 50 


56575 


350 


35 0 


3 50 


350 


PAK4 


3265S 


7684 


3729 


10 0 94 8 

1 


350 


350 


350 


1604 


?AK5 


350 


350 


4905 


35 0 


350 


3 50 


3 50 


350 



KINASE 


CEM 

(T cell) 


M0LT4 
(T cell) 


iJURKAT 
(B cell) 


RPMI822 6 
(B cell) 


SR 

(B cell) 


K562 
(MO) 


GEK2 


350 


350 


350 


350 


350 


350 


SULUl 


350 


350 


350 


350 


350 


350 


SULU3 


350 


350 


350 


350 


350 


350 


STLK2 


47236 


53262 


47605 


22560 


55936 


3 0 3 9 0 


STLK3 


350 


3 50 


350 


350 


3 50 


3 5 0 


STLK4 


3 64S 


3 50 


26772 


157 0 


3 50 


350 


STLK5 


350 


350 1 350 


x:<:x 


350 


3 50 


ZCl : 350 


3 50 


! — ^ 

3 5 0 : 3 5 0 
i 




3 50 



1^^ 
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ZC2 


350 


350 


350 


350 i 350 
! 


3 50 


ZC3 


350 


350 


350 


350 


350 


350 


ZC4 


1094 


7813 


14945 


XXX 


350 


6385 


KHS2 


350 


350 


350 


350 


350 


350 


PAK4 


350 j 10246 


350 


3229 


350 


350 


PAK5 


350 


12672 


350 


350 


350 
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Transcript size from Northern data 



Kinase 


(kb) 


STLK2 


3 . 8 


STLK4 


5 . 0 


ZCl 1 6.9/4.7 


ZC2 I 6.0/8.0 


ZC4 




KHS2 


■4.4 


SULUl 


4.5 \ 


SULU3 


10 . 0 


GEK2 


5 . 5 


PAK4 i 4,8 


PAK5 i 3.5 



STLK2is widely expressed; the highest expression levels 
5 were found in placenta, spleen and PBL . 

STLK4 is also widely expressed in normal cissues ^ 
including heart, brain, placenca,. lung, liver, skeietal 
muscle, kidney, pancreas, spleen, thymus, prostate, testis, 
ovary, small intescine, colon, and peripheral blood 
10 lymphocytes. 3TLK4 was also decect:ed in Jurkac T cells. 

ZCl is highly overexpressed in the following human 
cancer cell lines: HCP-92, EKVX, MCI-H23, NC1-H226, MCl- 
H322M, NCI-H522, A54S, HO?-o2 (lung); OVCAR-3, CVCAR-4, 
OVCAR-5 (ovary); SMB-19, u2 5 1 , SF-26S, SF-295, 3F-53S (CMS); 
15 K-562, RPMI-3226 (leukemia); DU-145, ?C-3 (prosrace); HT-29, 

HCC-299S, HCT-115, 3W620, COLO-205, HCT-15, K:^-12 (colon); 
UO-31, CAKi-1, RXF-393, 736-0, TK-10 (renal); LOXIMVI , 
Malme-3M, SK-MEL,-2, SK-MEL-23, UACC-62, UACC-257, M14 



^15 
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(melanoma); and MCF-l, MCF-l/ADR, HIS 573T, MDA-MB-231, MDA- 
MB-431, MDA-N, BT-549, T-47D (breasc). 

ZC2 is expressed in brain and csscis. It: is hichly 
overexpressed in the follov/ing human cancer cell lines: TK- 
5 10 (renal); SK-MEL-2S, UACC-62 (melanoma );• T4 7D (breas::). 

Mcderace expression in HOP92 (lung); OVCAR4 , IGROVl 
(ovary); DNB75, U251 (brain); K-562 (leukemia); and COLO205 
(colon) . 

SULUl i.s cverexpressed in che following human cancer 
10 cell lines: HOP-92, EKVX, NCI-H23, NCI-K225, NCI - H3 2 2M , NCI - 

H522, A549, HO?-o2 (lung); OVCAR-3, OVCAR - 4 , 3VCAR-5, SK-OV- 
3 (ovary); SNB-19, U251, SF-263, SF-295, SF-539 (CNS); K- 
562, RPMI-8226 (leukemia); DU-145, ?C-3 (pros-ace;; HT-29, 
HCC-299S, HCT-116, SW52C,. COLO-205, HCT-15, KiM-12 (colon); 
15 UO-31, CAKi-1, RXF-393, 7S6-0, TK-10 (renal); LOX , IMVI , 

Malme-3M, SK-MEL-2, SK-MEL-2S, UACC-o2, UACC - 2 5 7 , vil4 
(melanom.a) ; MCF-7, MCF-7/ADR, HIS 578T, MDA-M3-231, mdA-MB- 
431, MDA-N, BT-549, T-47D (breast) 

SULU3 showed a broad pattern of expression in t-e 
20 normial tissue panel of RNAs . 

GEK2 v/as expressed in spleen, thymus and testis. 
Expression v^as high in the cell lines RBL-2K3 and H441. 

?AK4 was expressed in che normal tissues: brain, testis 
and prostate, and in the human cancer cell lines: HNCI-H23 
25 (lung); OVCAR-3 (ovary); SNB-19, U251 (CMS); RPMI-3226 

(leukemia); DU-145 (prostate); COLO-205, HCT-IE (colon). 

PAV.5 shov;ed weak expression levels in the norm.al 
tissues: brain, testes, bladder, colon, adrenal m.edulla, 
spleen, fetal liver, breast, cerebral cortex, cerebellum, 
30 thymus, salivary gland, lung, stomach, duodenum, uterus, 

prostate, skeletal muscle and placenta. PAK5 v/as 
overexpressed in the human cancer cell lines: HOP-92, EKVX, 
NCI-H23, NCI-H226, NCI-H322M, MCI-H522, A549, HOP-62 (lung); 
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OVCAR-3, OVCAR-4, OVCAR-5, SK-OV-3 (ovary); U2 5 1 , 

SF-26S, SF-295, 3F-539 (CNS); K-562, RPMI-S225 (leukemia); 
DU-145, PC-3 (proscace) ; KT-29, HCC-2993, KCT-llo, SW620,. 
COLO-205, HCT-15, KM-12 (colon); UC)-31, CAKi - 1 , RXr-393,, 
5 736-0. TK-10 (renal); LOXIMVI , Malme-3M, SK-MEL-2, SK-MEL- 

23, UACC-62, UACC-257, M14 (melanoma); MCF-7, MCF-7/ADR, HIS 
578T, MDA-MB-231, MDA-MB-431, MDA-M, ET-549, T-47D (breasr). 

EXAMPLE 3 : STE2 0 - re 1 a t ed Protein Kinase Gene Expression 
1 0 Veccor Construct ion 

Materials and Methods 
Expression Vector Construction 

Several expression constructs were generated for some 
of the human 3TE2G - related cDNAs including: a) full-length 
15 clones m a pCDNA expression vector; b) a GST- fusion 

construct containing the catalytic domain of the novel 
STS2 0 - related kinase fused to the C-term.inal. end of a GST 
expression cassette; and c) a full-length clone containing a 
Lys to Ala (K to A) mutation at the predicted ATP binding 
20 site within the kinase domain, inserted m the pCDNA vector. 

The to A" mutants of the STE2 0 - related kinase might 

function as dom.mant negative constructs, and v;ill be used to 
elucidate the function of these novel STKs . 

25 RESULTS 

Constructs for ZCl , 2C2 , ZC3 , SULUl , SULu3 , PAK4 and.PAKS 
have been generated. 

Numerous additional constructs have been generated for 
the various STE2 0 - subf ami ly kinases, including full length, 
30 kinase inactive and tagged versions. In addi t ion , ' t he 

follcv/mg three constructs v/ere designed for specific 
applications based on their uniaue dcmiain structure: 
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Construct 1: SULUl-coiled-coil2 
Vector: pGEX-4T 
Insern : CoileG-coil2 
Sequence: Amino acids 752-398 
Purpose: phage display 
Result: Interacts with GEK2 CCl 



Construct 2: SULU3 - coi led - coi 1 2 
Vecccr: pGEX4T 
10 Insert : coiled-coil 2 doiriain fused zg GST 

Sequence range of insert: amino acids 802-898 of SEQ 
Purpose: phage display 

Resulc: Inceraccs with coiled-coiled region of human 3LK 

15 Conscrucc 3 : PAK5 Dominant N'egative 

Vector: pCANS 

Inserc: Full length coding sequence of human PAK5 concaining 
Che following mutation: K350,351A (Lys at aa positions 350 
and 351 changed to Ala) . 
20 Purpose: to determ.ine role of human PA.K5 kinase activity in 

cell growth and cransf ormat ion . 
Result: Interferes with Ras transformation. 

EXAMPLE 4: Generation of Specific Im^m.unoreaQencs to 
STE20-Relaued Protein ?:inases 

Macerials and Methods 

Specific immunoreagencs were raised in rabbits agamsc 
KLH- or MAP- conjugated synthecic peptides corresoonding zo the 
hum.an STE2 0 - related kinases. C-cerminal peptides were 
conjugated to KLH v/ich giucaraldehyde , leaving a free C- 
term^inus. Internal peptides were MA? - con j ugaced with a 
blocked N- terminus. Additional immunoreagent s can also be 
generated by im.m.unizing rabbits v;ith the bacteriallv expressed 

1*7?: 



30 
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GST- fusion ' proteins containing the cyt cpl astrii c domains of ea-; 

novel STK. 

The various immune sera are first tested for reaccivicy 
and selectivity to recombinant protein, prior to testing for 
endogenous sources. 



10 



Western blots 

Proteins in SDS PAGE are transferred to im.mcbilon 
m.embrane . The v/ashmg buffer is PBST (standard phosphate- 
buffered saline pH 7.4 + 0.1% triton x 100). Blocking anc 
antibody incubation buffer is PBST -f5% milk. Antibody 
dilutions varied from. 1:1000 to 1:2000. 



15 



20 



25 



30 



RESULTS 

Three SULUl antisera (against both 53 9A and 540A) and 
tv;o SULU3 antisera (542A) reacted specifically with the 
peptide antigens. Antisera binding was competable with 
peptide. Experimencs with extracts from cells transfected 



vjxzn 



epitope -tagged SULUl and 3UL.U3 genes are underv;a] 



Antisera against the PAK4 C-termLinal peptide 554A 
reacted wi-:h purified Gst-?AK4 and detected a protein of the 
correct molecular weight from, tissue culture cells. 
Specific immunoprecipi tat ion experiments are ongoing to 
determine the reactivity v;ith native protein. 

Simdlar immunization and antisera testing experiments 
are underway for each of the other novel STE2 C - kinases . 

3TE2 0 - related protein kinase peptide im.mranogens and cheir 
specificity in recognizing endogenous protein by Western 
blots or immunoprecipi tations . 



j Protein 

i 


Sequence 


1 positions 


1 Con- : 

" 1 


West . 


IP ! 

1 


i STLK2 


EKFQrCCSADESr 


i t' D - 1 O 


i KLH 1 


V 


W i 


\ 3TLK4 


SISMSELFPTTDPVGT 


i 252-2€7 


1 KLH ; 




Y i 


i SULUl 


LDFPKEDYR 










i SULUl 


HGD PR ? E ? R FTQ 




i KLH 1 


Y 
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SULU3 j DPRTRAS DPOSPPnvQPUK 
PAK4 ' 



2-14 



j PAK5 
i PAK5 



PLMRQNRTR 



KLH 



411-429 
666-680 



KLH 



Mr 



KLK 



39G-39S 



_ggggRRAG?EKRPKSS ] 148^ 



I (C) RRKSLVGTPYWMA PE i 471 .40^ 

ND = no t done yen ^ ^ ' — 



KLH 



KLH j 
KLH ! 



V 



i ND 



cD^^^O-rcilacea orotein kinase rq^ 

imnuroQ^n<:; ^nrs -v^^- ~ ^-L^.asc; ub^ lasion Drone in 

rT^.-n^Q^ns c.na .neir specif ioitv in ^..coqp . nr. ;>.^- 

orocein by Wesr^^-r v.ir^^o ^ ■ ----gn^^^ng enaogenous 



Protein 



iomain 



2C1 



Coiled-coil/pro/B/ C 
3 



ZC2 



2C2 



! Coiled-coil /pro/B 



3 



PAK4 



?AK4 



Nterm 



PAK5 



Kinase/Cterrri 



PAKE 



A/ Nterm 




A/Nterm I ^ 



ND=noc done ye: 

The 5 0kD STLK2 o-or-ii- t;^- ^ 

p.ou.m wa^ expressed highlv m s-verai 
he[nacoooie^ ic 1 ^-.'r^^^ • . - 

^ ^-ic c.ll .ines including Jurkat, pGLlO, Rarros 

A20, W:.HI-231, K562, .KEL and fresM-. ^so"'^-- -h 
rrom C57/BL6 ^i.-e R^r-^ 1^1- - . 

_ ot b-L-.2 expression v/e-^e 

also de.e=.ed .n several t.^cr .eU lines .ncl.....o c.^ue, 

^^^o2ld, LS130, MDAM231 and A549. 

, ^ , -s detected .n .urka. T ^^^n. 

C0I02OB, HCTIIS, RX.-l, 293T, MDA^E231, andS.-..r.. 

Tne 170 kD ZC2 protein was defected in SK-Me^o. 
UACC-62. ■ 

Elevated levels of the 64 kD PAK5 oroce^n w^^e 

confirrr^ed m che breasc cancer eel", lin^. m^. , 

^ . , .^l^A-231 ana iMCF-7 

ctnd in the luno ^-nc-^ ^---'^ 

- -s^nc^r ^c^i line A549. 
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Exarr.ole : Recombinanr Sxoression and BiolocTical Assays 



Materials and Me c hods 

Transient: Expression of the Sce2 0 - relaced Kinases in 
MamiTialian Cells 

The pcDNA expression plasmids (10 |.ig DNA/100 rr-m plate) 

containing the STE20-related kinase constructs are 

incroduced inco 293 cells v/ith lipof ectamine (Gibco 3?^L) . 

Afner 72 hours, the cells are harvested in 0.5 mL 

sclubilizacion buffer (20 mM KEPES, pH 7.35, 150 nruM NaCl , 

10% glycerol, 1% Tricon X-100. 1.5 m^'I MgCl., 1 mM SGTA. 2 hlM 

phenylmechylsulf onyl. fluoride, 1 ug/mL aprotinin) .. Sample 

aliquocs .were resolved by SDS polyacrylamide gel 

eleczrophcresis (PAGE) on 6% acrylamide / 0 . 5 % bis -acrylamide 

gels ana elect rophorec ical ly transferred co nitrocellulose. 

Non-specific binding was blocked by preincubat ing blots in 

Blotto (phosphate buffered saline containing 5% w/v non-fat 

dried milk and 0.2% v/v nonider P-40 (Sigma)), and 

recombinant procein v;as detected using the various anti- 

peptide or anci-GST-f usion specific antisera. 

In Vicro Kinase A^ssavs 

Three days afcer cransf ect ion with the STE2 0 - related 
kinase expression concruccs, a 10 cm plate of 293 cells v/as 
washed wich ?3S and solubilized on ice with 2 mL PBSTDS 
containing phosphatase inhibitors (10 mM NaHPO^ , pH 7.25, 150 
mM NaCl , 1% Tricon X-100, 0 . 5% deoxycholate , 0.1% SDS, 0.2% 
sodium azide, 1 mJM MaF, 1 mM EGTA., 4 m^A sodium 
orthovanadate , 1% aprotinin, 5 ug/mL leupepcin) . Cell 
debris was removed by cencrif ugat ion (12 000 x g, 15 m.in, 4 
*^C) and the lysate was precleared by two successive 
incubations v.^ith 50 uL of a 1:1 slurry of protein A 
sepharose for 1 hour each. One -half mL of the cleared 



for 



STE20 - related Protein Kinases 
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supernatant was reacted with 10 of procein A purified 
kinase- specif ic aniiisera (generated frcrr: the GST fusion 
protein or antipeptide antisera) plus 50 uL of a 1:1 slurrv 
of protein A-sepharose for 2 hr at 4 °C. The beads were 
5 then washed 2 times in PBSTDS, and 2 times m HNTG (20 mM 

HEFSS, pH 7.5/150 mM NaCl , 0,1% Tricon X-100, 10% glycerol}. 

The immunopurif ied kinases on sepharose beads vyere 
resuspended in 2 0 uL HNTG plus 3 0 mM MgCl,, 10 m.^ MnCl , , and 

10 20 LiCi [a/=P]AT? {3000 Ci/mmol). The kinase reactions were 

run for 3 0 min at room temperature, and sccpped by addition 
of HNTG suppiemenced with 50 maM EDTA . The samples v/ere 
-washed 6 times in HNTG, boiled 5 min in SDS sample buffer 
and analyzed by 5% SDS-PAGE followed by aucoradicgraphy . 

15 Phosphoamino acid analysis v/as perform.ed by standard 2D 

m.ethcds on ''?-labeled bands excised from, the SDS-PAGE pel 

Similar assays were performed on bacterially expressed 
GST- fusion constructs of the kinases. 



20 



25 



30 



ZCl Assay buffer: 2 0 mM Tris pH 7.4, 2 00 mM NaGl , 0.5 
mM DTT, 3 mM MgC12 , 0.3 m.M MnCl2, lOOuM "^PvATP. 

Substrates: myelin basic procein (MEP) at 0.2S m.g/m^L 
and phosphorylated ZCl peptide RTVGRRNTFIGT- PPYWMAPE at 17 
yiM (bold underlined residue shows sice of phosphorylation) . 

At higher concentracions of MgCl, (3 miM) , the activity 
or ZCl (both full-lengch and recomjDinant: kinase dom.ain) is 
up zo 10 -fold greater rewards exogenous subscrate MEP. in 
contrast, the aucophosphorylac ion and the phosphorylation oj 
nhe activacion loop peptide substrate are boch inhibited. 
Mn-- does not inhibit the aucophosphorylat ion and che 
peptide phosphorylation by the truncated kinase domain form. 
However, boch the M3P phosphorylation, Mn^^ -preferring 
activity AND the autcphosphorvlat ing , MgT--f ~c-ref erri'-c 



BNSDOCID: <WO 9953036A2_!_> 



wo 99/53036 PCT/US99/08 1 50 

activity are elirranated with mutation of the ATP-bindma 

lysine in ZCl (Lys54Ala) indicating chat both activities ar 
attributable to the ZCl kinase domain. 

3ULU1 Assay buffer : This buffer is identical to cha- 
fer ZCl, except for 5 roM MgCl2 . Under these condicions, 
other STE20 family members (PAK4, ZCl) v/ere inhibiced for 



auccphcsphorylac ion and required reducing the [Mnj co 



<0 . 3 



n^\^': for an efficient autophosphcryiat ion reaccion. 
10 Substrates: MBP, phosvitin, or a-casein at 0.28 mg/mL, 

FAK4 . PAK5 Assav Buffer ; 2 0m>I Hepes pH 7 . 2 , 13 0 m.M KCl , 
10 rvJy'i MgCl2, 1 mM NaF, 20 rrM B -glycerolphospha t e , 0.5 hlM 
DTT, 5 0 iiM ATP, 0.5 yiC± ''PyATP. 
15 Subs- rates: MBP at 0.2 3 mg/mL and peptide substrates 

derived from PAK5 activation loop at 2.5 ).iM. 

STLK2 Assav buffer ; Similar to that described . above , 
except for the inclusion of 5 mM MgCl., 5 rcM MnCl. and 5 uCi 
20 ^^PyA.TF. 

Transformation i PAK exoerim.ent s ) 

Lov/-passage NIH3T3 fibroblasts displaying normal 
morphology (flat, non-ref ract ile cellular morphclogv) , as 

25 well as low rates of spontaneous trans form.at ion , were used 

in transformation assays. NIH3T3 cells were maintained in 
Dulbecco's modified Eagle's medium . supplemented with 10% 
(v/v) fetal calf serum., penicillin (100 U/m.L) and 
streptomycin (ICQ U/miL) and kepc in an humidified incubacor 

30 at 3 7 ^C and 5% CO,. 

Cells were transfecced with DMA-lipid complexes. As 
per manufaccurer instructions, 1 icof ec tamine was utilized to 
cransfect NIH3T2 cells. All r ransf ec" ions were wi"h eaual 

1^^ 
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10 



amounts of plasmzd DNA (DMA from the appropriate expression 
vector without insert was used to give equivalent amounts o 
DNA per cransf eccion) . 1 fig of activated allele of H-Ras 
was co-cransfecned wich increasing amounts of various 
alleles of PA.K5 . 

Foci were scored after 3 weeks by fixing 10 mm in 10% 
methanol, 10% acecic acid for 10 min. followed by stainmo 
wich 0.4% (w/v) crys-al violet in 10% methanol • for 10 mm, 
and washing with deionized water and drying at room 
temperature . 



iransrect ion; 



s t imu] 



15 



20 



acions. and Juciferase 



assays (7n 
experiment ) 

Cells {10^} were transiently transfected by 
eleccroporacion using a Gene Pulser (Bio-Rad Labs) with the 
setrmg of 960 _F and 250 V. 20-40 hours later, transfected 
cells (about 10^) were stimulated wich various stimuli. 
After a 6-hour simulation, cells were lysed, and luciferase 
activities were measured using the MicroLumat Plus (EG6.G 
Berrhold) . (j. Exp. Med. 183:611-620, 1996, hereby 
incorporated by reference herein in its entirety including 
any drawings, tables, or figures.) 
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RESULTS 

Protein expression and kinase 



related protein kinases 



ctivicy of novel STE2 0- 
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! 75 i 




1 y 




I ?AK5 1 64 


j 64 j 
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i V 




ZCl : Reoulation of 


kinase ac'iivitv 







ZCl is const icutively active as a full-length kinase 
when expressed either in vitro (TNT rabbit reciculocyte 
5 system) or in NIH 3T3 , 293T, or K1299 tissue culture cells. 

The endogenousiy expressed kinase is also active when 
irPLrrranoprecipi taced from carcinoma cell lines. 



ZCl signal incr Pathways 

10 Using human leukemic *T cell line Jurkac as a model 

system, the impac- of co transf ected wild- type ZCl on the 
activation .of cv;o reporter genes.; RE/A?-lucif erase and NFkB 
iucif erase / was examined. RE/AP is a composite in nhe IL-2 
gene promocer containing both a NFK3-like site and an AP-1 

15 site. 

Optimal accivacion of both RE/A?-lucif erase and MFk3- 
luciferase reporter genes in Jurkat T cells requires signals 
generated from, stimulation of both T cell receptor and the 
costimulator receptor CD2S. Cot ransf ec t ion of wild-type ZCl 

20 with either the RE/AP - luci f erase or the NFkE- lucif erase 

reporter results in the activation of RE/AP or NFk3 when 
costimulated with the anti-T cell receptor monoclonal 
antibody or the pharmacological reagents PMA and ionomycin 
that bypass proximal T cell receptor. No activation was 

25 seen when costimulated with an anti-CD28 monoclonal 

ant ibody . 

These results suggest that wild-type ZCl, when 
cverexpressed, was replacing a CD2 3 - specif ic signal to 
activate RE/AP or MFkB . These results imply that ZCl is 
30 involved in the CD2S signaling pathway. Since MFkE is one 

cr the major pathv/ays also activated by the pro- inf lamtmatorv 
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cytokine TNt- -a signaling, .it is also likely thac 
a component in the TNF-a signaling pathways. 



tTiav oe 



10 



15 



20 



?AK5 : Design of specific pentide substrates 

To aid in the development of in W tiro kinase assays for 

screening small molecule libraries to identify specific 

inhibitors, the search for specific peptide substrates for 

PAK5 v/as undertaken. 

The rationale used to design such peptides is based on 
the h\^o::hesis that upon binding activated small G protein, 
PA.K5 undergoes a conformational change that results in 
derepression of its kinase activity followed by 
aucophcsphcrylanion on the activation loop resulting in a 
fully active kinase. The site of aucophosphoryla t ion for 
related family members has been identified by biochemical 
and/or genetic means (e.g. Wu, C, et al. J.Biol.Chem 
270:15984-15992 and S zczepanowska , et al . Proc . Nat 1 . Acad . Sci 
94, 8503-8508, 1997) . Specific peptide substrates for ?AK5 
were designed from the sequence of the activation loop of 
this kinase . 

An activation loop PAKE peptide phosphorvlated on the 
Thr residue of the T?Y motif served as a high-af f mity 
substrate for PAK5 . 
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Transf ornrta t ion 
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Trans.f oroiati ich of lovz-passage l\'IH3T3 cells by ras in 
the presence or absence of various alleles of PAK5 showed 
thac -he don:^inant negative, kinase-dead allele of PAK5 v;as 
able to block ras transf ormac ion of NIH3T3 cells. Thus, PAK5 
accivicy is required for ras transf ormac ion c-f NIH3T3 cells. 

Inhibition of PAK5 activicy may have cherapeucic value as 
an anr i -prolif erat ive aoenc for treacina cancer. 
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PAK4 and PAK5 



■ nt^ 



. ion wic:i Cac4 



PAK 4 interacts wich CDC42 small G-protein but noc Rac , 
RhoA, or P^.as as decern^ined by co- transf eccion of recombinant 
genes and decection by kinase assays. PAK5 also interacts 
wiuh Cdc42 . Coding sequences of activated alleles of small 
G procems (ras, Cdc4 2, Rac , Rhc) cagged with a Myc epitope 



were transienclv exoressed in 29 3 T cell: 



^/ a r 1 o u s a 1 



1 1 



35S-labeled PAK5 cagged wit 
vi nro with the reticulocvce 



h HA e p i cope vi ere e >io 2 
(TNT) system. 
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Sxamole 6: Chromosomal Lo cal i zat: ion of Sre2 0 -Related 
Protein Kinases 

Macerials And Methods 

STE20 protein kinases STLK3 , STLK4 , ZCl , ZC2 , ZC3 , 
KHS2, SULUl, PAK4 , and PAPC5 were mapped using che GeneBridge 
4 Radiacion Hybrid Panel, RH02. 05 (Research Genecics:. The 
GeneB-ridge 4 Panel consists of 91 hybrid panel samples, m 
addition cc one human positive concrol (HFL) , and one 
hamsrer negative concrol {A23) . The sz-andard reaction 
condicicns used zo zest and conduce PGR reactions using che 
GeneBridge 4 Panel are available from Research Genecics. 

Oligonucleotide sequences (all 5' to 3') used fcr PGR 
m.apping v/ere : 

AGAGGGAGTATTGTGAGATGTA 
TTGGTAGA-ATTGAGATGAGGTACA 
CAJ\GGTGTTCTTTGGGTCTGTT 
AGAJ\GAGGACTTGGGACTTATG 
CCAGTGAJ^.TGAA.^.GGAGTAJUU\ 
ATTTGTGGT.AGTGGGATTCTGTG 
TAGCAAJ\A-A.GAGCGAJiuA.^.GTGTG 
GATGTTGTGTGGATTCTGTA-AAG 
CAGGTGGAGTAGTGATTTGG 

Positive reactions were assigned a score of ''1", 
negative reactions are assigned a score of ''0" , and 
amibiguous reactions are assigned a score of "2". Results 
were submitted to che Whitehead Instituce 

(www'^'genome.wi .mic.edu) for position analysis. Ghromosomal 
localizations for ZC4 , SULU3 , STLK2 , STLK5 and STLK5 were 
available publicly (for example, from Unigene) . The 
chromosomal locations of GEK2 and 3TLK7 have not been 
de t ermiined . 



1 STLK2 h 1 


XC25-27 . 1 


: (Public! 


STLK3 ; 


2q31.3 


: (SuQen: 


STLK4 h : 


3p22 . 3 -p22 . 2 


■ iSupen' 


STLK5 h : 


17q23 .2-24,2 


i (Public; - 





3TLK3 : GTGCGATTTGGTA.GGAJLAATGA , 

STLK4 : GCACA.GATGCGTATGTGTGTTG , 

ZGl : ATCGGTGGATGACAGTGGTTGT, 

ZC2 : AGATGGAGTGTAGTGGGAGGG , 

ZG3 : CATCA.TGA-a CTGGTGAGGGG , 

SULUl : GA^JLACGTGGGGGTGTGTTGTA.TT, 

KH3 2 : GAATAGGGGTACGATGATAGAATA \ 

PAK4 : CTCAGTAlTTGTGTGGAJLAGATTG , 

PAK5 : GATCAGTGGAAGTGTGGAGTG , 
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Many of the STE 2 0 kinases were mapped zo regions 
associated v/ich various human cancers, a.s shov;n belov;. 

The regions were also cross-checked 'wich the Mendalian 
5 Inheritance in Man database, which tracks genetic 

informanicn for many human diseases, including cancer. 
References for association of the mapped sites with 
chromosomal abnormalities found in human cancer can be found 
in: Knuutila, et al . , Am. J Pathol, 1993, 152:1107-1123, 
10 hereby incorporated herein be reference in its entirety 

including any figures, tables, or drawings. Association of 
these m.apped regions with other diseases is documented in 
the Online Mendalian Inheritance in Man 
(CMIM) ( htto: / /www. ncbi . nln . nih . gov/htbin-pos t /Omimi . 

15 

STLK2_h , Xq2 5-2 7.1, (Public^ 
Osteosarcoma, Xq25-qter, 2 of 31. 

Lymphoprolif erat ive syndrom.e , X- linked (OMIM No. 3 0324 0) 
human STLK3, 2q31.3, (Sugen) 
20 Squamous cell carcinoma of Head and Neck, 3 of 30. 

STLK4_h . 3p2 2 . 3-p22 . 2 , t Sugen ^ 

Mantle cell lym.phoma . 3pl4 -p22 1 of 27 

Squamous cell carcinoma of Head and Meek 3p22~p24 1 of 14 
25 Cardiomyopathy, dilated (OMIM 601154/ 

1 
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STLK5_h , 17q2 3 .2-24.2, (Public) 
Cervical cancer, 17q, 1 of 3 0 

Gascroesophageal j unc^zion adenocarcinoma xenograf , 17q , l 
5 of 5 

Breast carcinoma , 17ql2 -qter , 1 of 16 
Bladder carcinoma , 1 7q22 -q23 , 1 of 14 
Breasc carcinoma, 17q22-q25, 3 of 101 
Non-srpail cell lung cancer , 17q24 -q25 , 6 of 50 
10 Tescis, 17q24-qcer, 2 of 11 

Malignant peripheral nerve sheath cumcrs, 17q24-qter, 5 of 7 
Alzheimer disease, susceptibility co {Or4IM 106130) 

STLK6_h . 2q32.2 -q33.3, (Public) 
15 Non-small cell lung cancer, 2q31-q32, 1 of 50 

Squamous cell carcinom.a of Head and Neck, 2q31-q33, 3 of 30 
Small cell lung cancer, 2q32-q35, 1 of 22 

ZCl_h , 2pll.2, (Sugen) 
20 non-sm.all cell lung cancer, 2pter-ql3, 1 of 10 

non-smiall cell lung cancer, 2pcer-q21; 1 of 10 
Pulmonary alveolar proteinosis, congenital (OMIM 173640). 

ZC2_h, 3q26 .31-3q26 . 32, (Sugen) 
25 Non-small cell lung cancer, 3q26 . l-q26 . 3 , 26 of 103 

Cervical cancer, 3q26-.l-q27, 4 of 30 
Small cell lung cancer, 3q26.3-qter, 3 of 35 

Squamous ceil carcinoma of Head and Neck, 3q26.3-qter, 3 of 
13 

30 Marginal zone E-cell lymphoma , 3q2S -q27 , 1 of 25 

Pares ceal osteosarcoma , 3q2 6 -q2 3 , 1 of 1 
Gastrcinc est inal scrom.al cumor, 3q26-q29, 1 of 16 
Mantle cell lymphoma, 3q26-q29, 1 of 5 
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■ ZC3_h 17pl3.2-13.3 (Sugen/ 

Malignant fibrous hist iccytonria of sofc cissue, i7p, 2 of 53 
Leiomyosarcoma, 17p, 7 of 2 9 
5 Non-sm.all cell lung cancer, 17p, 1 of 50 

ZC4_h , Xq22, (Public) 

Diffuse large cell lymphoma, Xq22-i:sr, 1 of 31 
Deafness, X-linked 1, progressive. (OMIM 304700). 

10 

KHS2_h , 2p22-2p22.2, (Sugen) 
Synovial sarcom.a, 2p21-ql4/ l_of_67 
Follicular lymphoma, 2p22-p24, i_of_4 6 

Colorectal cancer, hereditary nonpoiypcsis , ~ype 1, Ovarian 
15 cancer (MSH2 , COCAl , FCCl), (CMIM 120435). 

SULUl__h , 12q24.21 (Sugen) 

Neuroglial tumors, 12q22-qter, 1 of 15 

Gastroesophageal junction adenocarcinomia , 12q2 3-qcer, 1 of 
20 5. 

Non-small cell lung cancer, 12q24 . 1 -24 . 3 , 2 of 50. 
SULU3_h 17pli.2 (Public) 

Malignant fibrous hist iocytomia of soft tissue, 17d, 2 of 5S 
25 Leiomyosarcoma, 17p, 7_of_2 9 . • 

non-small cell lung cancer, 17p, l__of_5C 
Diffuse large cell lymphoma, 17pll.2, l_of_3 2 
Osteosarcoma, 17pll.2-pl2, 4_of 31 

30 ?AK4_h : 15ql4 (Sugen) 

Schizophrenia, (OMIM 113511) . 



BNSDOCID: *:WO 99S3036A2J_> 



15 



wo 99/53036 

P„„^ , PCT/US99/08150 

t->.r.D_h : iSql3 .2-ql3 .3 (Sugsn) 

Follicular lymphoma, i9ql3, i of 45* 
Mantle cell lymphoma, 19qi3, 1 of 5 
Hepatocellular carcinoma, 19ql3.1, 2 of 50 
Small cell lung cancer, 19ql3.1, 10 of 35 
Breasc carcinoma, iggi 3 . i ^ t 3 3 

cervical cancer, 19ql3 . 1 -qter , 1 of 30 
Testis, 19ql3 . 1-qcer, 1 of 11 
Chondrosarcoma, 19ql3.2, 1 of 29 

Malignanc fxbrous histxocytoma of soft tissue, I9al3 2-a^e. 
2 of 53 " ~' 

Non-small cell lung cancer, 1 9qcen-ql3 . 3 , € of 104 

:.ourherr- ElorrTn^t ' ^ 

Materials and M^-n^Ho 

Nylon membranes were purchased from Boeh>-Hna^r 
Mannhexm. Denaturing solution contains 0.4 M Na^H and 0 6 
NaCl. ^.eucrail.ation solution concams 0.5 M Tris-HCL 
v.. and 1.5 M NaCl. Hybridization solution contain. 50.' 
formamide, SX SSPE, 2 . 5X Denhardt ■ s solution, 0.2 mo/mL 
aenacured salmon DNA, 0.1 mg/mL yeasc cRNA, and 0 . 2 % sodium 
docecyl sulfate. Rescriccion en.ymes were purcnased from 
Boenringer .Mannheim. Radiolabeled probes were prepared 
us.ng the Prime-it IZ ... Stratagene. The beta ac^m B.A 
rragm.enc used for a probe template was purchased f>-o. 
Clontiech . 

_ Genomic DNA was isolated from 20 different tumo^- c^i . 

30 t^-^^'- Caiu-., A549, HCT-IS, HT-29, ^olo 

^^^^ LS-iec, DLD-1, HCT-115, PC3, CAPAN-2, MIA-PaCa-o o..,. 
1. AsPc-1, BxPC-3, OVCAR-3, SKOV3 , SW S2e and PA-I, and ^-^n. 
two normal cell Imes: human mammary epithelia. cells an/^ 
numan umbilical vein endothelial cells. 
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A 10 |.ig aliquot: of each genomic DNA sanple was digesce 

wich EcoR I rescriccion enzyme and a separate 10 |.ig sample 
was digesced with Hind 111 restriction enzyme. The 
restriction-digested DNA samiples were loaded onto a 0.7% 
agarose gel and, follov/ing elect rophoret ic separation, the 
DNA. was capillary- transferred to a nylon memtbrane bv 
standard m.ethods (Sambrook, J. et al (19S9) Molecular 
Cloning: A Laboratory Manual, Cold Spring Harbor 
Labor^^cory) . 

? A K 5 Amo 1 i c o n : 

A 50 0 base pair fragment (EcoR I - Sac i) of the ?AK5 
gene was used as a template for a radiolabeled DNA probe 
v;hich was hybridized to the blots at 42 for 4S hours in 

hybridization solution using standard methods (supra}. The 
blots were exposed to a phosphorimLager screen for 4 davs , 
then scanned and analyzed using a Molecular Dynamics Storm 
84 0 phosphc-rimager . The relative miass and gene copy number 
values of the PA.K5 DNA fragments were calculated from the 
band density values obtained. The blots were re -hybridi zed 
with a radiolabeled probe copied from a fragmient of human 
beta act in DNA and developed as above to confirm the sam.ple 
m^a s s 1 o a d i ng e qu i va 1 e n c y . 

RESULTS ■ 

The PAK5 gene was determined to exhibit 3 -fold 
amplification compared to the norm.al DNA copy number in 
PAJ^JC-1 (pancreatic epithelioid carcinoma) and OVCAR-3 
(ovarian adenocarcinoma) hum^an cell lines, and ape r ox i mat el 
2 times the norm.al copy num.ber in the BxPC-3 (primary 
pancreatic adenocarc-inoma ; human cell line. 

Similar Southern analyses can be performed for other 
STE20 kinases. 

\^-3 
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h^xamcle 3 : Dececcio n uf P rotein- Procein Interaction 

Through Phage Display' 

Macerials And Methods 
5 Phage display provides a method for isolating molecular 

incerac-ions based on affinity for a desired bait. cDNA 
fragments cloned as fusions to phage coat proceins are 
displayed on the surface of che phage. Phage (s) inceracrinq 
v/ith a bait are enriched by affinity purification and che 
10 insert DMA from individual clones is analyzed. 

T7 Phaoe Dis clav Libraries 

Ail libraries were constructed in the T7Selectl-lb 
vector (Novageni according to the m.anuf accurer = s directions. 

15 

Bait Presentation 

Protein domains to be used as baits were generated as 
C-termmal fusions to GST and expressed in E. coll. 
Peptides were chemtically synthesized and biotinylated at the 
20 N-terrr.inus using a long chain spacer biotin reagent. 

Selection 

Aliquots of refreshed libraries (10^"-10-^ pfu) 
supplemented with PanMix and a cocktail of E, call 
inhibitors (Sigmia P-8465) were incubated for 1-2 hrs at room 
25 temperature witn the imim;Obil i zed baits. Unbound phage v/as 

extensively wasned (at least 4 tim.es) with v/ash buffer. 

After 3-4 rounds of selection, bound phage was eluted 
in 100 |.lL of 1% SDS and plated on agarose plates to obtain 
single plaques. 

30 

Identification of insert DNAs 

Individual plaques were picked into 2 5 uL of 10 mM EDTA 
and tne phage was disrupted by heating at 70 for 10 min. 
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2 |.lL of zlie disrupted phage v:ere added co 5C uL ?C?, 
reaction miK. The inserc DNA ivas amoiified tv 3 5 rounds cf 
thermal cycling (94oC, SOsec; 50cC, Imin; 72cC, Imin) 

5 Composition of Buffer 

lOx PanMix 
5% Tricon XlOO 

10% non-fac dry milk (Carnation) 
10 mM EGTA 
10 2 50 PolA NaF 

250 Lig/mL Heparin (sigma; 

250 f.ig/mL sheared, boiled salmon sperm DNA (sigma) 
O..G5% Na azide 
PreTDared in PBS 
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Wash Buffer 

PBS supplemented with: 

0.5% \*?-4 0 

2 5 ul g/mL heparin 



PGR reaction mix 

^•^ ICx ?CR buffer ( Perkin-Elmer , v/ith 15 n-\S^: Ma) 

0.2 mL eacri dNTPs flO mM scock) 

0 . 1 mL T7UP primer (15 pmol/|,tL) GGAGCTGTCGTATTCCAGTC 

25 0 . 1 mL T7DrJ primer (15 pm.ol/uL) AACCCCTCAAGACCCGTTTAG 

25 tl^! MgCl, or MgSC, co compensate for SDTA 
Q-S. CO 10 mL v/ich discilled wacer 
Add 1 unic cf Taq polymerase per 5 0 uL reaccion 

30 LIBRARY : T7 Seleccl-H441 
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RESULTS 



Phage display baits and inceractors 



Bait j Domain j Aa j Pacent 
i 1 SEQ ID 


CDNA 
library 


Interactor 


Sequence 

Range 
& SEQ ID 


SULUi Colled- 752-396 j 22 


H441 


GEK2 cc dcm ti) 


677-820 
SEQ ii26 


SULU3 i Coiled- | 755-89S j 23 
i ccil2 ! i 


H4 41 


SLK isof arm 


M837S0 

i 



interacced zo a lesser extent with the coiled-coil domain of an SLK 
isof orrn . 

The phage display data suggest: potential interactions 
of SULU3 -wich SLK and SULUI with GSK2 chrough their coiled- 
coil' domains'. ' Therefore two members of che SULU subfamilv 
of STE20 kinases interact wich two members of a separate 
STE20 family, the prototype being SLK. 

These results suggest a specificity in the interaction, 
and imply that these STE2 0 kinases may interact with each 
other ' through homo- and hetero-dimerizat ion . Alternatively 
SULU- related kinases could act imim.ediately up- or down- 
stream of the SLK-related kinases in a signaling cascade. 

One skilled in the art would readily appreciate that 
the present invention is well adapted to carry out the 
objects and obtain the ends and advantages mentioned, as 
well as those inherent therein. The molecular complexes and 
the r-thcds, procedures, treatments, molecules, specific 
25 comp.:.unds described herein are presently representative of 

preferred emx)odim.ents are exemplary and are not intended as 
limitations on .the scope of the invention. Changes therein 
and other uses will occur to those skilled in the art v;hich 
are encompassed within che spirit of the invention are 
30 defined by the scope of the claims. 

It v.'ill be readily apparent to one skilled in the art 
tr.at varying substitutions and modificacicns m»av be m.ade to 
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chs invencicn disclosed herein without deparcmg fro:r. che 

scope and spirit of the invention. 

All patents and publications mentioned in the 
specification are indicative of the levels of chose skilled 
in the art to which the invencicn pertains. 

The invention illustraciveiy described herein suitablv 
may be practiced in the absence of any element or elem.ents, 
limitation or limitations v.^hich is not specifically 
disclosed herein. Thus, for example, in each instance 
herein any of the terms ''comprising", "consisting 
essentially of" and ''consisting of" may be replaced v;ith 
either of the other tv;o terms. The term.s and expressions 
which have been employed are used as terms of descriction 
and not of limLitation, and there is no intention that in the 
use of such term.s and expressions of excluding anv 
equivalents of the featui-es shown and described or portions 
thereof, but it is recognized that various modifications are 
possible within the scope of the invention claim.ed. 

In particular, although some f ormrulat ions described 
herein have been identified by the excipients added to the 
formulations, the invention is m.eant to also cover the final 
formulation form.ed by the comjoination of these excipients. 
Specifically, the invention includes formulations in v/hich 
one to all of the added excipients undergo a reaction during 
formrulation and are no longer present in the final 
f orm;ulation, or are present in modified forms. 

In addition, where features or aspects of the invention 
are described in terms of i^arkush groups, those skilled in 
the art will recognize that the invention is also thereby 
described m terms of any individual member or subgroup of 
members of the Markush group. For exam.ple, if X is 
described as .selected from the group consisting of brom.me, 
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chlorine, and iodine, claims for X being oronme and claims 
for X being bromine and chlorine are fully described. 

Other embodiments are within che following claims. 
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CLAIMS 



1. An isolated, enriched, or purified nucleic acid 
molecule encoding a kinase polypept ide selected froir. che 
group consisting of STLK2 , STLK3 , STLK4 , STLK5 , STLK5 , 
STLK7, ZCl, ZC2 . ZC3 . ZC4 , KKS2, SULUl , SULU3 , GEK2 , ?AK4 , 
and PAK5 . 

2. The nucleic acid molecule of claim 1, wherein said 
nucleic acid molecule comprises a nucleotide sequence chat: 

(a) encodes a polypepcide comprising uhe amino acid 
sequence sec forch m SEQ 'ID NO : 5 , SEQ ID NG : 6 , SEQ ID NO : :^ , 
SSQ ID MO: 13, SEQ ID NO: 14, SEQ ID NO : 1 5 , SEQ ID NO: IS, SEQ 
ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO : 2 9 , SEQ ID 
NO: 31, SEQ ID NO: 97, SEQ ID NO : 9 9 , SEQ ID NO: 101, SEQ ID 
N0:1C3, cr SEQ ID N0:1C7; 

(b) is the complem.enc of the nucleocide sequence of 

(a) ; 

(c) hybridizes under highly scringent condicions to 
the nucleotide molecule of (a) and encodes a naturally 
occurring kinase polypeptide; 

(d) encodes a kinase polypeptide having the am.mo acid 
sequence of SEQ ID NO : 5 , SEQ ID NO : 6 , SEQ ID NO : 7 , SEQ ID 
NO: 13, SEQ ID NO: 14,. SEQ ID NO : 1 5 , SEQ ID NO : 1 S , SEQ ID 
NO:22, SEQ ID NO : 2 3 , SEQ ID NO : 2 4 , SEQ ID NO : 2 9 , SSQ ID 

NO: 31, SEQ ID NO : 9 , SEQ ID NO : 9 9 , SEQ ID NO: 103, or SEQ ID 
NO:lC7, except that it lacks one or miore, but not all, of 
the follov;ing segm.ents of amiino acid residues: 1-21, 22-274, 
or 275-415 of SEQ ID NO : 5 , 1-31, 32-303, 309-489 or 490-516 
of SEQ ID NO:^, 1-178 or 179-414 of SEQ ID NO ; 7 , 1-22, 23- 
239, 290-526, 527-640, 641-395, or 397-1239 of SEQ ID N0:13, 
1-255, 255-442, 443-526, 627-954, or 955-1297 of . SEQ ID 
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MO:14, i-255, 256-476, 47--5a0, 631 -535, or 334 -:32-5 of SEQ 
ID MO:15, 1-13, 14-273, 274-345, 347-534, or 535-8£^4 of SEQ 
ID MO:lS, 1-21, 22-277, 273-427, 42S-637, 63S-751, or 752- 
S98 of SEQ ID NO:22, 1-66, 57-215, 216-425, 426-539, 540- 
786, or 737-337 of SEQ ID NO:23, 1-25, 26-2-:^3, 274 -422, 4 2 1- - 
632, or 633-748 of SEQ ID NO:24, 1-51, 52-224, 225-353, 394- 
658, or 659-631 of SEQ ID NO : 2 9 , 1-25, 26-281, 232-430, 431- 
640, 641-754, 755-901, or 902-1001 of SEQ ID MO : 3 1 , 1-lC, 
11-321, or 322-373 of SEQ ID NO:97, 1-37, 58-369, or 370-413 
of SEQ ID NO:99, 1-52, 53-173, 174-307, 308-572, or 573-591 
of SEQ ID NO:103, or 1-33, 34-294, 295-337, 333-472, 473- 
724, or 72 5-968 of SEQ ID NO : 107; 

(e) is the compl en-ier.t of the nucleotide secuence of 

(d) ; 

(f) encodes a polypeptide having the amino acid 
sequence sec forth in SEQ ID NO : 5 , SEQ ID NO : 5 , SEQ ID NO : 7 , 
SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO : 1 S , SEQ 
ID NO:22, SEQ ID NO : 2 3 , SEQ ID NO:24, SEQ ID WO : 2 9 , SEQ ID 
NO: 31, SEQ ID NO: 97, SEQ ID NO : 9 9 , SEQ ID NO: 103, or SEQ ID 
NO:107 from amino acid residues 1-21, 22-274, or 275-416 of 
SEQ ID NO:5, 1-31, 32-308, 309-439 or 490-516 cf SEQ ID 
N0:6, 1-178 or 179-414 of SEQ ID NO : 7 , 23-285, 290-526, 527- 
640, 641-S96, or 397-1239 of SEQ ID NO: 13, 1-255, 256-442, 
443-626, 527-954, or 955-1297 of SEQ ID NO:14, 1-255, 256- 
475, 477-530, 631-933, or 934-1326 of SEQ ID NO:15, 1-13, 

14 -273, 274-346, 347-534, or 535-394 cf SEQ ID NO : 1 S , 1-21, 
22-277, 278-427, 423-637, 633-751, or 752-393 of SEQ ID 
NO:22, 1-66, 67-215, 216-425, 426-539, 540-786, or 7S7-SS7 
of SEQ ID NO:23, 1-25, 26-273, 274-422, 423-632, or 533-748 
of SEQ ID NO:24, 1-51, 52-224, 225-393, 394-553, or 659-631 
of SEQ ID NO:29, 1-25, 2 6 - 2 3 1 , . 2 8 2 - 4 3 G , 431-540, 541-754, 
755-901, or 902-1001 of SEQ ID NO: 31, 1-10, ll-32i„ or 322- 
373 of SEQ ID NO:97, 1-57, 53-355, or 370-418 of SEQ ID 

^00 
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NO:99, 1-52, 53-173, 174-307, 308-572, or 573-591 of SEQ ID 

MO:103, cr 34-294, 295-337, 338-472, 473-724, or 725-96S of 
SEQ ID NO : 107 ; 

(g) is the complemenc of the nucleotide sequence of 

5 (f); 

(h; encodes a polypepcide corrprising zhe amino acid 
sequence set forch in SEQ ID NO : 5 , SEQ ID NO : 6 , SEQ ID NO : 7 , 
SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO : 1 5 , SEQ ID NO: IS, SEQ 
ID NO: 22, SEQ ID NO : 2 3 , SEQ ID NO : 2 4 , SEQ ID NO : 2 9 , SEQ ID 

10 NO: 31, SEQ ID NO: 97, SEQ ID NO : 99 , SEQ ID NO : 1 0 1 , SEQ ID 

N0.':1C3, or SEQ ID NO: 107, except chat it lacks one or uore, 
but noc all, of the domains selected from the group 
consisting of a C-terminal domain, a catalytic domain, an N- 
terminal domain, a spacer region, a proline-rich region, a 

15 coiled-coil structure region, and a 0-termiinal tail; or 

;i) is the ccmplemenc of the nucleotide sequence of 

(h) . 

3. The nucleic acid molecule of claim 1, furrher 
20 comLprising a vector or prcmocer effective to initiate 

transcription in a host cell . 

4. The nucleic acid molecule of claim 1, wherein said 
nucleic acid molecule is isolated, enriched, or purified 

25 from, a m-amtmtal . 

5. The nucleic acid molecule of claim. 4, .wherein said 
mam.meil is a hum.an . 

30 6. A nucleic acid probe for the detection of nucleic 

acid encoding a kinase polypeptide m a samtole, v/herem said 
polypeptide is selected from: the group consiscmg of STLK2 , 



^0/ 
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.STL?-:3, STLK4 , STL-K5 , STLK5 , STLK7 , ZCl . ZC2 , 2C5 , ;KH£2 , 
SULUi, 3uLU3 , GE?C2, PAfC4 , and PAK5 . 

7. The probe of claim 6, wherein said polypeptide is 
3 a fragment of the protein encoded by the amino acid sequence 

set for-h in SEQ ID NO : 5 , SEQ ID NO : 6 , SEQ ID NO : 7 , SEQ ID 
NO: 13, SEQ ID NO: 14, SEQ ID-iaO:15, SEQ ID NO: IS, SEQ ID 
NO: 22, SEQ ID NO : 2 3 , SEQ ID NO: 24, SEQ ID NO : 2 9 , SEQ ID 
MO: 31, SEQ ID NO: 97, SEQ ID NO:99, SEQ ID NO : 1 0 1 , SEQ ID 
10 NO: 102, or SEQ ID NO:lC7. 

a. A recombinanc cell comprising a nucleic acid 
molecule encoding a kinase polypeptide selected 



20 



25 



30 



i-rcm Che 

roup consisting of STLK2 , STLK3 , STLK4 , STLK5, STLKS , 

ZCl, ZC2, ZC3, :<HS2, SUL'Jl , SULU3 , GEI<:2 , PAK4 , anc 
PAK5 . 



1 5 STLK7 . ZC'i - 7^7 



9. The cell of claim 8, wherein said polypeocide is 

fragment of the protein encoded by the arr^ino acid sequence 

set forch in SEQ ID NO : 5 , SEQ ID NO: 5, SEQ ID NO : 7 , SEQ ID 

NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: IS, SEQ ID 

NO:22, SEQ ID NO : 2 3 , SEQ ID NO:24, SEQ ID NO:29, SEQ ID 

NO:31, SEQ ID NO: 97, SEQ ID NO: 99, SEQ ID NOtiOi, SEQ ID 
NO: 103, or SEQ ID NO: 107. 



10. An isolated, enriched, cr purified kinase 
polypeptide selected from the group consisting of STLK2 , 
STLK3, 5TLK4, 3TLK3 , STl^yie , STLK7 , ZCl, ZC2 , ZC3 , 2C4 , KHS2 
SULul, SULU3, GSK2 , PAK4 , and 9A¥.S . 



11. The polypeptide of claim 10, wherein said 
pclypep-ide is a fragment of the protein encoded bv the 
amino acid sequence set forth m SEQ ID NO : 5 , SEQ ID NO : 6 , 
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SEQ ID K'0:7, SEQ ID NO : 1 3 , SEQ ID NO: 14, SEQ ID NO : 1 5 , SEQ 

ID NO: IS, SEQ ID NO : 2 2 . SEQ ID NO : 2 3 , SEQ ID NO: 24, SEQ ID 
NO: 29, SEQ ID NO : 3 1 , SEQ ID NO: 97, SEQ ID NO : S 9 , SEQ ID 
NO:lCl, SEQ ID NO:103, SEQ ID NO:105, or SEQ ID NO:I07. 



10 



15 



20 



2:) 



30 



NO 
NO 



12. The polypeptide of claim 10, v/herein said 
polypeptide connprises : 

(a) the amino acid sequence sen forch in SEQ ID NO : 5 , 
SEQ ID NO: 6. SEQ ID NO : 7 , SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID 
NO: 15, SEQ ID NO : 1 3 , SEQ ID NO: 22, SEQ ID NO : 2 3 , SEQ ID 

^4, SEQ ID NO: 29, SEQ ID NO : 3 1 , SEQ ID NO: 97, SEQ ID 
99, SEQ ID NO: 101, SEQ ID NO : 1 0 3 , SEQ ID NO: 105, or SEQ 
ID NO : 107 ; 

(b) the amino acid sequence set forth in SEQ -ID NO : 5 , 
SEQ ID NO:'£, SEQ ID NO : 7 , SEQ ID NO : 1 3 , SEQ ID NO: 14, SEQ ID 
NO: 15, SEQ ID NO: IS, SEQ ID NO: 22, SEQ ID NO : 2 3 , SEQ ID 

NO: 24, SEQ ID NO : 2 9 , SEQ ID NO : 3 1 , SEQ ID NO : 9 7 , SEQ ID 
NO: 99, SEQ ID NO: 103, SEQ ID NO: 105, or SEQ ID NO: 107, 
excepc that it lacks one or more, but not all, of the 
folloiving segments of amino acid residues: 1-21, 22-274, or 
275-415 of SEQ ID NO : 5 , 1-31, 32-303, 3C9-4S9 or 490-516 of 
SEQ ID NO: 6, 1-178 or 179-4 14. of SEQ ID NO: 7, 1-22, 23-239, 
290-526, 527-540, 541-895, or 897-1239 of SEQ ID NO:13, 1- 
255, 255-442, 443-525, 527-954, or 955-1297 of SEQ ID N0:14, 
1-255, 255-475, 477-6S0, 581-933, or 984-1326 of SEQ ID 
NO:15, 1-13, 14-273, 274-345, 347-534, or 535-894 of SEQ ID 
NO:lS, 1-21, 22-277, 278-427, 428-537, 538-751, or 752-898 
of SEQ ID NO:22, 1-55, 57-215, 215-425, 425-539, 540-785, or 
7S7-387 of SEQ ID NO : 2 3 , 1-25, 26-273, 274-422, 423-532> or 
533-748 of SEQ ID NO:24, 1-51, 52-224, 225-393, 394-558, or 
559-531 of SEQ ID NO:-29, 1-25, 25-281, 282-430, 431-540, 
541-754, 755-901, or 902-1001 of SEQ ID NO : 3 1 , 1-10, 11-321, 
or 322-373 of SEQ ID MO: 97, 1-^57, 58-359, or 370-41-8 of SEQ 
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ID NO:99, i_52, 53-173, 174-307, 30S-572. or 573-591 cf SSQ 
ID NO:i03, 1-24, 25-239, 290-397, 393-62S, 62S-66S, 559-372, 
or 873-1227 of 3EQ ID NG:105, or 1-33, 34-294, 295-337, 338- 
472, 473-724, or 725-968 of SEQ ID N0:107; 

(c) uhe amino acid sequence set forth in SEQ ID NO: 5, 
SEQ ID NO: 6, SEQ ID NO : 7 , SEQ ID NO : 1 3 , SEQ ID NO: 14, SEQ ID 
NO: 15, SEQ ID NO : 1 8 , SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID 
NO: 24, SEQ ID NO : 2 9 , SEQ ID NO : 3 1 , SEQ ID NO : 97 , SEQ ID 
NO: 99, SEQ ID NO: 103, SEQ ID NO: 105, or SEQ ID NO: 107 from 
arrdno acid residues 1-21, 22-274, or 275-416 of SEQ ID NO : 5 , 
,1-31, 32-308, 309-489, or 490-516 of SEQ ID NO : 6 , 1-17S or 
179-414 of SEQ ID NO : 7 , 23-289, 290-526, 527-640, 641-S96, 
or S97-1239 of SEQ ID NO:13, 1-255, 256-442, 443-626, 627- 
954, or 955-1297 of SEQ IDNO:14, 1-255, 256-476, 477-680, 
631-933, or 934-1325 of SEQ ID NO:15, 1-13, 14-273, 274-346, 
347-534, or 535-894 of SEQ ID NO:18, 1-21, 22-277, 278-427, 
423-637, 638-751, or 752-893 of SEQ ID N0:22, 1-66, 67-215, 
216-425, 426-539, 540-786, or 787-887 of SEQ ID NO : 2 3 , 1-25, 
. 25-273, 274-422, 423-632, or 633-748 of SEQ ID NO:24, 1-51, 
20 52-224, 225-393, 394-653, or 659-631 of SEQ ID NO : 2 9 , 1-25, 

25-251, 232-430, 431-640, 641-754, 755-901, or 902-1001 of 
SEQ ID NO:31, 1-10, 11-321, or 322-373 cf SEQ ID NO:97, 1- 
57, 53-359, or 370-418 of SEQ ID NO:99, 1-52, 53-173, 174- 
307, 308-572, or 573-591 of SEQ ID NO: 103, 1-24, 25-289, 
290-397, 39S-528, 629-668, 669-872, or 373-1227 of SEQ ID 
NO:105, or 34-294, 295-337, 338-472, 473-724, or 725-968 of 
SEQ ID NO: 107 ; or 

id) the ammo acid sequence set forth in SEQ ID NO : 5 , 
SEQ ID NO:6, SEQ ID NO : 7 , SEQ ID NO : S , SEQ ID NO : 1 3 , SEQ ID 
NO:14, SEQ ID N0:15, SEQ ID NO:16, SE: ID NO : 1 3 , SEQ ID 
NO: 22, SEQ ID NO: 23. SEQ ID NO: 24, SEQ ID NO : 2 9 , SEQ ID 
N0:31, SEQ ID NO:97, SSQ ID NO:99, SEQ ID NO: 101, SEQ IE 
NO: 103, SEQ ID NO: 105, or SEQ ID NO: 107, excep- that ic 



25 
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lacks one or more, bu- noc all, of the domains selectee frorr. 

the group consisting of a C-cern^inal domain, a cacalycic 
domain, an N-terrainal domain, a spacer region, a proline- 
rich region, a coiled-coil scruccure region, and a C- 
cerm.inal tail . 

13. The kinase polypeptide of claim 10, wherein said 
polypepcide is isolated, purified, or enriched from a 
mamimial . 

14. The kinase polypeptide of claim 13, v/herein said 
mammal is a hum.an . . 



15. The kinase polypeptide of claimi 10, wher-ein said 
15 polypepcide xs a STLK2 , STLK3 , STLK4 , STLK5 , 5TLK6 , STLK7 

polypepcide. . - 

16. The kinase polypepcide of claimt 10, wherein said 
polypepcide is a ZCl , ZC2 , ZC3 , or ZC4 polypeptide. 

20 • 

i". '.The kinase polypeptide of claim; 10, wherein said 
polypepcide is a KHS2 polypepcide. 

IS. The kinase polypeptide of claim. 10, wherein sai;': 
25 polypepcide is a SULUl or 5ULU3 polypepcide. 

19. The kinase polypepcide of claim 10, v;herein said 
polypepcide is a GEK2 polypeptide. 

30 20. The kinase polypepcide of claim 10, wherein said 

polypepcide is a ?AK4 or ?AK5 polypeptide. 



l05 



<WO 9953036 A2 J _> 



wo 99/53036 PCT/US99/08150 

21. An ancibody or antibody fragment: having specific 
binding affinity to a kinase poiypepcide selecred from zhe 

, group consistrng cf 3TLK2 , STLK3 , STLK4, STLK5, 5TLK6 , 
STLK7, ZCl, ZC2. ZC3 , ZC4 , KHS2 , SULul , 3ULU3 , GEK2 , ?AK4 , 
5 and ?AK5 or a ki^nase domain polypeptide selected from che 

group consisting of STLK2 , STLK3 , STLK4 , STLK5 , STLK6 , 
STLK7, ZCl. ZC2, ZC3 , ZC4 , KHS2 , SULUl , SULU3 , GEK2 PAK4 , 
and PAK5 . 

22. A hybridoma which produces an annibody having 
specific binding affinity to a kinase poiypepcide selecced 
from the group consiscing of 3TLK2 , STLK3 , STLK4 , STLK5 , 
STLKt, STLK7, ZCl , ZC2 , ZC3 , ZC4 , KHS2 , SULUl , SULU3 , GEK2 , 
?AK4 , and PAK5 . 

15 

23 . A m.echod for identifying a subsnance chat 
modulates kinase actiivity comprising the steps of: 

(a) contaccing a kinase polypeptide selecced from the 
group consisting of 3TLK2 , STLK3 , STLK4 , STLK5 , STLK6 , 

20 STLK7, 2C1, ZC2, ZC3 , ZC4 , KH32 , SULUi , SULU3 , G5K2 , PAK4 , 

and PAK5 with a cest substance; 

(b) measuring the activity of said polypeptide; and 

(c) determining v/hecher said substance modulates the . 
activity of said polypeptide. 



95 



24 . A m.ethcd for identifying a substance that 
modulates kinase activity in a cell com;prising the steps of: 

(a) expressing a kinase polypeptide m a cell, v/herein 
said polypeptide is selected from the group consisting of 
STLK2, 5TLK3, 3TLK4 , STLK5 , STLK6 , STLK7 , ZCl , ZC2 , ZC3 , 
ZC4 , r:HS2, SULUl, SULU3 , GEK2 , PAK4 , and,?AK5; 

(b^ adding a test substance to said ceil; and 
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(c) morxitoring a change in ceil pheno type or the 

interaccion betv.-een said polypeptide and a nacurcil bindmc: 
parcner . 

25. A method for treating a disease or disorder by 
adminiscering to a patienc in need of such treatment a 
substance that modulates the accivity of a kinase selected 
from the group consisting of STLK2 , STLK3 , STLK4 , STLK5 , 
STLKb , 5TLK7, ZCl ZC2 , ZC3 , ZC4 , KHS2 , SULUl , SULU3 , GEK2 , 
?AK4 , and ?AK5 . 

26, The method of claim 25, wherein said disease or 
disorder is selected from the group consisting of immune- 
related diseases and disorders, organ cransplantat ion , 
myocardial infarction, cardiovascular disease, stroke, rena 
failure / oxidat ive stress -related neurodegenerat i ve 
disorders, and cancer. ^ . • ' 



27. .The mechod of claim 25, wherein' said substance 
m.odula~e"s kinase activicy in vitro. 



2S. Tne metinca oi c^aim 27, wnerein said substance 
a kinase inhibitor. 



29. A m.ethod for dececcion of a kinase polypeptide in 
a sample as a diagnostic locl for a disease or disorder, 
v;herein said method com.prises: 

(a) contacting said sample with a nucleic: acid probe 
which hybridizes under hybr idizar ion assay conditions co a 
nucleic acid target region of a kinase polvpeoiide selected 
fromi the group consiscmg of STLK2 , STLK3 , STLK4 , STLK5 , 
STLK6, STLK7,. ZCl, ZC2 , ZC3 , ZC4 , ?:HS2 , SULUl , SULU3 , GEK2 , 
PAK4 , and ?AK5 , said probe comprising the nucleic acid 

loi 
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sequence encoding said pol ypepcide , fragmenr- cherecf, or 
-he conplemencs of said sequences and fragmencs; and 

(b) dececcing the presence or amount of the 
probe :carget region hybrid as an indication of said disease. 

30-. The method of claim 29, v.'herein said disease or 
disorder is selected from che group consisting of immune- 
rela-ed diseases and disorders, organ transplancat icn, 
myocardial infarccion, cardiovascular disease, stroke, renal 
failure, oxidative stress -related neurodegenerac ive 
disorders, and cancer. 



21. A method for detection of a kinase polypeptide in 
a sample as a diagnostic tool for a disease or disorder, 
•wherein said m.ethod comiprises: 

(a) comparing a nucleic acid target region encoding 
said kinase polypeptide in a sample, wherein said kinase 
polypeptide is selected from the group consisting of STLK2-, 
STLrO, STLK4, STLFCS , STLK6 , STLK7,. ZCl , ZC2 , ZC3 , ZC4 , KHS2 , 

20 SULUI, SULU3, GEK2 , ?AK4 , and ?AK5 , or one or more fragments 

thereof, with a control nucleic acid target region encoding 
said kinase polypeptide, or one or more fragm.ents thereof; 
and 

(b) detecting differences in sequence or am.cunt between 
2--> said target region and said' control target region, as an 

indication of said disease or disorder. 

32. The method of claim 31, v.-herem said disease or 
disorder is selected from the group consisting of immune - 
30 related diseases and disorders, organ transplantation, 

myocardial infarction, cardiovascular disease, stroke, renal 
failure, oxidative stress - related neurodegenerative 
aisorders, and cancer. 
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Figure 3A 
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Figure 3B 
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Figure 4 



KHSl h 5 LRPAADILRRNPQQDYELVQRVGSGTYGDVVKARNVHTGELAAVKIIKLEPGDDFSLIQQ 64 

KHS2lh 1 MNPGFDLSRRNPQEDFELIQRIGSGTYGDVYKAKNV^m3ELAAIKVIKLEPGEDFAVVQQ 60 

KHSl_h 65 EIFMVKECKHCNIVAYFGSYLSREKLWICMEYCGGGSLQDIYHVTGPLSELQIAWCRE^ 124 

KHS2_h 61 EIIMMKTCKHPNIVAYFGSVXIUlDKLVnCMEFCGGGSWDIYHVTGPLSELQIAWSI^ 120 

KHSl 125 LQGIAYLHTKGKMHRDIKGANILLTDHGDVKIJ^FGVAAKITATIAKIUCSFIGTPY^ 184 

KHS2lh 121 I^LYYLHSKGKMHRDIKGANILLTDNGHVKlADFGVSAQITATIAKRKSFIGTPYWh^ 180 

KHSl_h 185 EVAAVEKNGGYNQLCDIWAVGITAIEIXSELQPPMFDLHPMRALFLMSKSNFQPPKLKDKT 244 

KHS2_h 181 ETv^AAVEMCGGVNQLCDLWAVGlTAIElAELQPPMFDLHPMRALFIOTKSNFQPPKI^ 240.. 

KHSl_)l 245 KWSSTFHNFVKIALTKNPIOaiPTAERLLTHTFVAQPGLSRAIAVELLDKVNNPDNHAH^ 304 

KHS2„h 241 KWSNSFHHFVKMALTKNPKKRPTAEKLLQHPFVTQH-LTRSIAIELUJKVNMPD-HSTYH 298 



t** • *** 



KHSl.h 305 EADDDDFEPHAIIRHTIRSTNRNARAERTASEINFDKLQFEPPLRKETEARDEMGLS 361 

KHS2Zh 299 DFDDDDPEPLVAVPHRIHSTSRNVREEKTRSEITFGQVKFDPPLRKETEPHHELPDSDGF 358 
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362 SDPNFMLQWNP FV D 375 
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376 GAOTGKSTSKRAIPPPLPPKPRISSYPED-NFPDEEKASTIKHCP — DSESRAPQILRRO 432 
419 DDESKHSTLKAKIPPPLPPKPKSIFIPQEMHSTEDENQGTIKRCPMSGSPAKPSQVPPRP 478 



KHSl h 433 SSPSCGPVAETSSIGNGDGISKL-MSENTEGSA QAPQLPRKNDKRDFPXPAIN 484 

KHS2lh 479 PPPR LPPHKPVALGNCMSSFQLNGERDGSLCQQQNEHRGTNLSRK-EKKDVPKPISN 534 

KHSl_h 4 85 GLPPTPKVIJ1GACFSKVFIXK:PLKINCATSWIHPDTKDQY1IFGTEDGIYTLNIJ«ELHEA 544 

KHS2_h 53 5 GLPPTPKVHMGACFSKVFNGCPLKIHCASSWINPDTRDQYLIFGAEEGIYTLNLNELHET 594 

KHSl h 545 TOEQLFPRKCTWLYVTNNTLMSLSEGKTFQLYSHNI.IALFEHAK-KPGLAAHIQTHRFPD 603 

KHS2Zh 59S SMEQLFPRRCTWLYVMNNCLLSIS-GKASQLYSHNLPGLFDYARQMQKLPVAIPAHKLPD 653 

KHSl_h 604 RILPRKFALTTKIPDTKGCHKCCIVRNPYTGHKYLCGALQSGIVLLQWYEPKQKFMLIKH 663 

KHS2_h 654 RILPRKFSVSAKIPETKWCQKCCVVRNPYTGHKYIX:GAIX)TSIVLLEWVEPMQKFMLIKH 713 

KHSl_h 664 FDFPLPSPLNVFEMLVIPEQEYPMVCVAISKGTESNQWQFETINLNSASSWFTEIGAGS 723 

KHS2_h 714 IDFPIPCPLRMFEMLVVPEQEYPLVCVGVSRGRDFNQVVRFETVNPNSTSSWFTE--SE>T 111 

KHSl_h 724 QQLDSIHVTQLERDTVLVCLDKFVKIVNLQGKLKSSKKIJ^ELSFDFRIESWCLQDSVL 7 83 

KHS2_h 772 PQTlWTHVTQLERDTILVCLIXrCIKIVNI^RLKSSRKLSSELTFDFQIESIVCLODST^ 831 

KHSl h 784 AFWKHGMQGKSFKSDEVTQEISDETRVFRLLGSDRVVVI.ESRPTENPTAHSNLYILAGHE 843 

KHS2Zh 832 AFVnCHGMQGRSFRSNEVTQEISDSTRIFRLLGSDRWVLESRPTDNPTANSNLYlLAGHE 891 
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KHS2.h 



844 NSY 846 
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Figure 5 
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Figure 6 



LOK_m 1 MAFANFRRILRLSTFEKRKSREVEHVRRDLDP^roVWEIVGELGDGAFGKVYKAKNKETGA 60 

GEK2.h 1 ^lAFANFRRILRLSTFEKRKSREYEH^mRDLDPNEVVJEIVGEI/3IX;AFGK^nrKAKNKETGA 60 



LOK_m 
GEK2_h 



61 IJiL^AKVIETKSEEELEDYIVEIEItATCDHPYIVKLLGAYYYDGKLWIMIEFCPGGAVDA 12 0 
61 lAAAKVIETKSEEELEDYIVEIEILATCDHPYIVKLLGAYYHDGKLVJIMIEFCPGGAVDA 120 



LOK_TO . 121 IMLELDRGLTEPQIQWCRQMLEAJLNFLHGKRIIHRDLKAGNVLMTLEGDIRLADFGVSA 18 0 
GEK2_b 121 IKLELDRGLTEPQIQWCRQMLEALNFLHSKRIIHRDLKAGNVLMTLEGDIRIADFGVSA 180 



GEK2_h 



181 KNLKTLQKRDSFIGTPYWMAPEVVLCETMKDAPYDYKADIWSLGITLIEMAQIEPPBffiEL 240 
181 KNLKTL0KRDSFIGTPYV7MAPEVVMCETMKDTPYDYKADIWSLGITLIEMAQIEPPKHEL 240 



LOK_m 
GEIC2„h 



241 NPMRVLLKIAKSDPPTLLTPSKWSVEFRDFLKIALDKNPETRPSAAQLLQHPFVSRVTSN 300 
241 NPMRVLLKIAKSDPPTLLTPSKWSVEFRDFLKIALDKNPETRPSAAQLLEHPFVSSITSN 300 



LOK_in . 301 KALRELVAEAKAEVMEEIEDGREDGEEEDAVDAVPPLVNHTQDSANVTQPSLDSNKLLQD 360 
GEK2_h 3-01 KALRELVAEAKAEVMEEIEDGRDEGEEEDAVDAASTLENHTQNSSEVSPPSLNADKPLEE 360 



LOK»m 361 S-STPLPPSQPQEPVNGPCSQPSGDGPLQTTSPADGLSKNDNDLKVPVPLRKSRPLSMDA 419 

qEK2_h 361 SPSTPLAPSQSQDSVNEPCSQPSGDRSLQTTSPPWAPGNENGLAVPVPLRKSRPVSMDA 420 



LOK_m 420 RIQMDEEKQIPDQDENPSPAASKSQKANQSRPNSSALETLGGEALTNGGLELPSSVTPSH 479 

GEK2_h 421 RIQVAQEKQVAEQGGDLSPAANRSQKASQSRPNSSALETLGGEKLANGSLEPPAQAAPGP 480 



LOK_Tn 
GEK2_h 



480 SKRASDCSNLSTSESMDYGTSLSADLSLNKETGSLSLKGSKLHNKTLKRTRRFVVDGVEV 539 

481 SKRDSDCSSLCTSES^mYGTNLSTDLSLNKEMGSLSIKDPKLYKKTLKRTRKFVVDGVEV 540 



LOK_m 540 SITTSKIISEDEKKDEEMRFLRRQELRELRLLQKEEHRNQTQLSSKHELQLEQMHKRFEQ 599 

GEK2_h 541 SITTSKIISEDEKKDEEMRFLRRQELRELRLLQKEEHRJNQTQLSNXHELQLEQMHKRFEQ 600 

******** * ******************** *** *** ********** ****** « 

LOK_;n 600 EINAKKKFYDVELENLERQQKQQVEKMEQDHSVRRKEEAKRIRLEQDRDYAKFQEQLKQM 659 

GEK2.h 601 EINAKKKFFDTELENLERQQKQQVEKMEQDHAVRRREEARRIRLEQDRDYTRFQEQLKLM 660 

LOK.m 660 KKEVKSE\rEKLPRQQRKESMKQKMEEHSQKKQRLDRDFVAKQKEDLELAMRXLTTErmRE 719 

GEK2_h 661 KKEVKNEVEKLPRQQRKESMKQKMEEHTQKKQLLDRDFVAKQKEDLEIAMKRLTTDNRRE 720 

LOK_m 720 ICDKERDCLSKKQELLRDREAALWEMEEHQLQERHQLVKQQLKDQYFLQRKDLLRKHEKE 77 9 

GEK2_h 721 ICDKERECLMKKQELLRDREAALWEMEEHQLQERHQLVKQQLKDQYFLQRHELLRKHEKE 7 80 

LOK_m 7 80 REQMQRYNQRMMEQLKVHQQQEKARLPKIQRSDGETRMAMYKKSLHIKGAGSASEQREKI 83 9 

GEK2„h 7 81 REQMQRYNQRMIEQLKVRQQOEKARLPKIQRSEGKTRMAMYKXSLHINGGGSAAEQREKI 840 



LOK_m 
GEK2_h 



84 0 KQFSQOEEKRQKAERLQ<:XK)KHEHQMHDM'/AQCESE^SELQQLQNEKCYLLVEHETOKLK 89 9 
6 41 KQFSQQEEKRQKSERLQQQQKHENQblRDMIJ^QCESNMSELQQLQNSKCHLLVEHETQKLK 900 



LOK_Tn 900 ALDESHNQSLKE 911 

GEK2_h 901 ALDESHNQNLKE 912. 
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FIGURE 8A 

SEQ ID NO: 5 STLK2 human Nterm=l-21 kin=22-274 

?a?ml^?2kglSylhsekkihrdikaanvllseqgdvkl^ 



rDFTKSFkEFIDACLNKDPSFRPTAKELLKHKFIVKNSKKTSYLTELIDRFKRWKAEGH^ 
DSIsnlE^DSES^iSrTHPEWSFTTVRKKPDPKKVQNGA^ 

?SelkIqdI^aI?SSieeleksiavaeaacpgitdkm^ 

SEQ ID NO: 6 STLK3 human Nterm=l-31 kin=32-308 
(-iV^v-rn-inq-dRQ ^ insert=327-352) tail=490-516 

?^lA;iSpAilLApSISLQAVGWPICRDAYELQEVIGSGATAWQAALCKPRQE^ 
A?^I^KCQ?S^ELLKEIQAMSQCSHPNVVTYYTSFVV^ 

i^^S^gehSg^^eeaiiatilkevlegldylhrngqihrdlkagnil^^^ 

D?GVSAF£ATGGDWRNKVRKTFVGTPCWmPEVM^^ 

?I?yh^^mk^nStlqndpptletgve^ 

^lkcSfqiS^eylieklltrtpdiaqrakk^ 

dekseeg^sqeksrrvkeenpeiavsastipeqiqslsv^^ 

?^^?SS^iRKELNDIRFEFTPGRDTADGVSQELFSAGLVDGHDVVIVAANLQKIVD 
DPKALKTLTFKIoASGCDGSEIPDEVKLIGFAQLSVS 

SEQ ID NO: 7 STLK4 human Nterm=absent , kin=l-178, 

J T _T 7Q_414 insertl=198-222 , insert2=253-293 
KSGVLDXSTIATILREVLEGLEYLHKXGQIHRDVKAGNILXGEDGSVQIADFGVSAFLAT 

GGDITRNKVRKTFVGTPCWMAPEWEQ 

^S^?2SSppsle?gvqdkemlkk^gksf 

SS^flSS^lI^ptiIerakkvrrvpgssgrlhktedggwewsddefde^^^ 

TSOrisPRVKESISNSELFPTTDPVGTLLQVPEQISAHLPQPAGQIATQPTQVSLPPTAE 
pSSTAlS^icSQETKIPISLVLRLRNSKKEI^ 

^gS2^vIa5£qkiveepqsnrsvtfkiasgvegsdipddgkligfa 

SEQ ID NO: 8 STLK5 human Nterm=absent , kin=l-222 (lacks 
2lcSFilDGMNElilAYILQGVLKAI.DYIHHMGYVHRS^ 

lsS^^hSSrq^^fpSIvkvlpwlspevlqqnlqgydaksdiysvgitacel^^ 

PFraMPATQMLLES^GTVPCLLDTSTIPAEELTMSPSRSVANSGLSDSLTTSTPRP 
DS^S^Y^RTPsiKFifflF^^ 

tpitnfegsqsqdhsgifglvtnleelevddwef 

SEQ ID NO: 13 ZCl human 1/5/98 Nterm=l-22 kin=23-289 
coiled-coil=290-526 pro=527-640 B=641-896 

^II^Il^^dlsslLpagifelvewgngtygqwkgrhvktgqiaai^^ 

deeeeikleinmlkic^shhrniatyygafikksppghddqlwlvmefcgagsito 

kgntlkedwiayisreilrglahlkihhvihrdikgqnvlltenaevklvdfgvsaqldr 

??gr^?igtpywmapeviacdenpdatydyrsdlws 
alfliprnppprlkskkwskkffsfiegclvknymqrpst^ 

S^^Sk^^^r^gISeteyeysgseeeeeevpeqegepssivnvpgestlrrdflrlq 
SeSS^rsealSqqLqeqqlreqeeykrqliaerqkrieqqkeqrrrleeqqrrerel^^ 
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FIGURE 8B 

™S2ooSo9^^^^^^^^"^^"^Q^^^Q^QLQQEQAYLLSLQHDHRRPHPQHSQQPP 
PPQQERSKPSFHAPEPKAHYEPADRAREVEDRFRKTNHSSPEAQSKQTGRVLEPPVPSRS 
ESFSNGNSESVHPALQRPAEPQVPVRTTSRSPVLSRRDSPLQGSGQQNSQAGQRNSTSli 
PRLLWERVEKLVPRPGSGSSSGSSNSGSQPGSHPGSQSGSGERFRVRSSSKSEGSPSORL 
ENAVKKPEDKKEVFRPLKPADLTALAKELRAVEDVRPPHKVTDYSSSSEESGTTDEEDDD 
VEQEGADESTSGPEDTRAASSLNLSNGETESVKTMIVHDDVESEPAMTPSKEGTLIVRRT 
QSASSTLQKHKSSSSFTPFIDPRLLQISPSSGTTVTSWGFSCDGMRPEAIRODPTRKGS 
yVNWPTNTRPQSDTPEIRKYKKRFNSEILCAALWGVNLLVGTESGLMLLDRSGQGKVYP 
L INRRRFQQMDVLEGLNVLVTI SGKKDKLRVYYLSWLRNKILHNDPEVEKKQGWTTVGDL 
EGCVHYKWKYERI KFLVIALKS SVEVYAWAPKPYHKFMAFKSFGELVHKPLLVDLTVEE 
GQRLKViyGSCAGFHAVDVDSGSVYDIYLPTHIQCSIKPHAIIILPNTDGMELLVCYEDE 

GVYVNTYGRITKDWLQWGEMPTSVAYIRSNQTMGWGEKAIEIRSVETGHLDGVFMHKRA 
QRLKFLCERNDKVFFASVRSGGSSQVYFMTLGRTSLLSW ^uk^v t I'lnif^ 

SEQ ID NO: 14 ZC2 human Nterm=missing kin=l-255 
coiled-coil=256-442 pro=443-626 B=627-954 Rab/RacBD=955-1297 



f^'-'-^^-'-a^o o=o^/-s»ofi KaD/KactiD=y55-1297 

SoSSiG5^°^^^^^°Q^^™^^^^^2^EIKQEINMLKKYSHHRNIATYYGAFI^ 
SS^°i^i^^^^^^^^^°^^°^^^K°NTLKEEWIAYICREILRGLSHLHQHKVIHRDI 

5™J'2^*^^^^^°^PPLCDMHPMRALFLIPRNPAPRLKSKKWSKKFQSFIESCLVK^ 
f2SS^J^*3^MKHPFIRDQPNERQVRIQLKDHIDRTKKKRGEKDETEYEYSGSEEEEEEND 
T?^o??J^n?S^SJ^^^^^^°^^'^^^^^^^QQLEQQQRENEEHKRQLLAERQKR 
Jo^S^SS-^^^^^^^^^^^QQ^^^Q^^^EEQMRREEERRRAEHEQEYKRKQLEEQ 
SS^5J^'^^'^^^°^^^^^^^Q"Q^Q^Q^P^^^PLYHYKEGMSPSEKPAWAKEVEERSRL 
???2f ^ SGVQPARTPPMLRPVDPQI PHLVAVKSQGC 

ALTASQSVHEQPTKGLSGFQEALNVTSHRVEMPRQNSDPTSENPPLPTRIEKFDRSSWLR 
S?c2JoS^!?^^^^^^^^^^^^°N^S^^P^^GSQPIRASNPDLRRTEPILESPLQ 
?5n?Sfocf??J^^^°^^^°°^^°^°^Q^^S^^^T^^^SKSEGSPVLPHEPAKVKPEE 
E??S??u5J^^J.^^°^°'^'^^^^^^^^^^^™^PM^^DYSSSSEESESSEEEE 

miretI? 



""^":it;;^fr^'^-^ ^-^v*^-^ ^^^-Li^-LtiAi-usNEgyiWGMVGTHGL^^ 
S,^.^^^^^^^^^P^^QTSPTDEDEEDEESSAAALFTGELLRQEQAKLijEARKISW 



SEQ ID NO: 15 ZC3 human kin=l-255 coiled-coil=256-47S 
pro=477-680 8=681-983 Rab/RacBD =984-1326 

SS?tSJSP^^^'^°^^^^^™^^°^2EEIKQEINMLKKYSHHRNIATYYGAFIKKSP 
PGtTODQLWLVMEFCGAGSVTDLVKNTKGNALKEDCIAYICREILRGIAH^ 

kgqnvlltenaevklvdfgvsaqldrtvgrrntfigtpywmapeviacd^ 

DIWSLGITAIEMAEGAPPLCD^ffiPMRALPT.TDDMDDDl?T.vcvt^.Tc.iCi;™™ 




^T^^SSi^^^'''^''''^^°'^^-^^^^Q^^Q^^QQ^^2DMQALRREEERRQAEREQEviRH 

Rleeeqrqleilqqqllqeqallleykrkqleeqrqserlqrqlqqehaylkslqq6qqq 
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FIGURE 8C 

QQLQKQQQQQLLPGDRKPLYHYGRGMNPADKPAWAREVEERTRMNKQQNSPLAKSKPGST 
GPEPPIPQASPGPPGPLSQTPPMQRPVEPQEGPHKSLVAHRVPLKPYAAPVPRSQSLQDQ 
PTRNLAAFPASHDPDPAIPAPTATPSARGAVIRQNSDPTSEGPGPSPNPPAWVRPDNEAP 
PKVPQRTSSIATALNTSGAGGSRPAQAVRARPRSNSAWQIYLQRRAERGTPKPPGPPAQP 
PGPPNASSNPDLRRSDPGWERSDSVLPASHGHLPQAGSLERNRVGVSSKPDSSPVLSPGN 
KAKPDDHRSRPGRPADFVLLKERTLiDEAPRPPKKAMDYSSSSEEVESSEDDEEEGEGGPA 
EGSRDTPGGRDGDTDSVSTMWHDVEE I TGTQPPYGGGTMWQRTPEEERNLLHADSNGY 
TNLPDWQPSHSPTENSKGQSPPSKDGSGDYQSRGLVKAPGKSSFTMFVDLGIYQPGGSG 
DS I P ITALVGGEGTRLDQLQYDVRKGSWNVNPTNTRAHSETPE IRKYKKRFNSE ILCAA 
LWGVNLLVGTENGLMLLDRSGQGKVYGLIGRRRFQQMDVLEGLN1.LITISGKRNKLRVYY 
LSWLRNKILHNDPEVEKKQGWTTVGDMEGCGHYRWKYERIKFLVIALKSSVEVYAWAPK 
PYHKFMAFKSFADLPHRPLLVDIiTVEEGQRLKVIYGSSAGFHAVDVDSGNSYDIYIPVHI 
QSQITPHAI IFLPNTDGMEMIiLCYEDEGVYVNTYGRI IKDWLQWGEMPTSVAYICSNQI 
MGWGEKAIEIRSVETGHLDGVFMHKRAQRLKFLCERNDKVFFASVRSGGSSQVYFMTLNR 
NRIMNVJ ' ' 

SEQ ID NO: 16 ZC4 human Nterm kin coiled-coil pro B=missing 
Rab/RacBD=l-349 

NVNPLYVSPACKKPLIHMYEKEFTSEICCGSLWGVNLLLGTRSNLYLMDRSGKADITKLI 
RRRPFRQIQVLEPLNLLITISGHKNRLRVYHLTWLRNKILNNDPESKRRQEEMLKTEEAC 
KAIDKLTGCEHFSVLQKEETTYIAIALKSSIHLYAWAPKSFDESTAIKVFPTLDHKPVTV 
DLAIGSEKRLKIFFSSADGYHLIDAESEVMSDVTLPKNPLEI I I PQNI I ILPDCLGIGMM 
LTFNAEAIjS VEANEQLFKKI LEMWKD I PS S I AFECTQRTTGWGQKAI EVRS LQSRVLE SE 
LKRRSIKKLRFLCTRGDKLFFTSTLRNHHSRVYFMTLGKLEELQSNYDV 

SEQ ID NO: 18 KHS2 human Nterm=l-13 kin=14-273 A=274-346 
Pro=347-534 RabBD =535-894 

MNPGFDLSRRNPQEDFELIQRIGSGTYGDVYKARNVNTGELAAIKVIKLEPGEDFAWQQ 
EIIMMKDCKHPNIVAYFGSYLRRDKLWICMEFCGGGSLQDIYHVTGPLSELQIAYVSRET 
LQGLYYLHS KGKMHRD I KGANI LLTDNGHVKLADFGVS AQ I TAT I AKRKS F I GT P YWMAP 
EVAAVERKGGYNQLCDLWAVGI TAI ELAELQPPMFDLHPMRALFLMTKSNFQPPKLKDKM 
KWSNSFHHFVKMALTKNPKKRPTAEKLLQHPFVTQHLTRSLAIELLDKVNNPDHSTYHDF 
DDDDPEPLVAVPHRIHSTSRNVREEKTRSEITFGQVKFDPPLRKETEPHHELPDSDGFLD 
SSEEIYYTARSNLDLQLEYGQGHQGGYFLGANKSLLKSVEEELHQRGHVAHLEDDEGDDD 
ESKHSTLKAKIPPPLPPKPKSIFIPQEMHSTEDENQGTIKRCPMSGSPAKPSQVPPRPPP 
PRLPPHKPVALGNGMSSFQLNGERDGSLCQQQNEHRGTNLSRKEKKDVPKPISNGLPPTP 
KVHMGACFSKVFNGCPLKIHCAS SWINPDTRDQYLI FGAEEGI YTLNLNELHETSMEQLF 
PRRCTWLYVMNNCLLS I SGKASQLYSHNLPGLFDYARQMQKLPVAIPAHKLPDRILPRKF 
SVSAKI PETKWCQKCC WRNPYTGHKYLCGALQTS I VLLEWVEPMQKFMLI KHIDFPI PC 
PLRMFEMLWPEQEYPLjVCVGVS RGRDFNQWRFETVNPNSTS SWFTESDTPQTNVTHVT 
QLERDTILVCLDCCIKIVNLQGRLKSSRKLSSELTFDFQIESIVCLQDSVLAFWKHGMQG 
RSFRSNEVTQEISDSTRIFRLLGSDRVWLESRPTDNPTANSNLYIIiAGHENSY 

SEQ ID NO: 22 SULUl human N=l-21 kin=22-277 A=278-427 

coiled-coill=428-637 

B=638-751 coiled-coil2=752-898 

MRKGVLKDPEI DDLF YKDDPEELF I GLHEI GHGS FGAVYFATNAHTNE WAI KKMS YS GK 
QTHEKWQDI LKEVKFLRQLKHPNTI EYKGCYIiKEHTAWIiVMEYCLGSASDLLEVKKKPLQ 
EVEIAAITHGALHGLAYLHSHALIHRDI KAGNI LLTEPGQVKLADFGSASMAS PANSFVG 
TPYWMAPEVIIlA^lDEGQYDGKVDIWSLGITCIELAERKPPLFNMNAMSALYHIAQNDSPT 
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FIGURE 8D 

LQSNEWTDSFRRFVDYCLQKIPQERPTSAELLRHDFVRRDRPLRVLIDLIQRTKDAVREL 

DNIiQYRKMKKILFQSTRNGPLNESQEDEEDSEHGTSLNREMDSLGSNHSIPSMSVSTC 

SSSVNSMQEVMDESSSELVMMHDDESTINSSSSVVHKKDHVFTRDEAGHGDPRPEPRP 

SVQSQALHYRNRERFATIKSASLVTRQIHEHEQENELREQMSGYKRMRRQHQKQIilALEN 

KLKAEMDEHRLKLQKE VETHANNS S I ELEKLAKKQVAI I EKEAKVAAADEKKFQQQ I IxAQ 

QKKDLTTFLESQKKQYKICKEKIKEEMNEDHSTPKKEKQERISKHKENLQHTQAEEEAHL 

LTQQRLYYDKNCRFFKRKIMIKRHEVEQQNIREELNKKRTQKEMEHAMLIRHDESTRE 

YRQLHTLQKLRMDL I RLQHQTELENQLE YNKRRERELHRKHVMGLRQQPKNLKAMEMQ I K 

KQFQDTCKVQTKQYKALKNHQLEVTPKNEHKTILKTLKDEQTRKI^ 

ASQALRLDEAQEAECQALiRLQLQQEMEIiLNAYQSKIKMQTEAQHERELQKLEQRVSLRRA 
HLEQKIEEELAALQKERSERIKNLLERQEREIETFDMESLRMGFGNLVTLDFPKEDYR 

SEQ ID NO: 2 3 SULU3 human Nterm=missing kin part ial = l -66 
A=67-215 coiled-coill=216-425 B=426-539 coiIed-coil2=540.-786 
Ctail=687-786' 

lEIiAERKPPLFNMNAMSALYHIAQNESPTLQSNEWSDYFRNFVDSCLQKIPQDRPTSEEL 
IiKH I FVLRERPETVL IDLI QRTKDAWELDl^QYRKMKKLLFQEAHNGPAV^ 
DHGVGRTGTVNS VGSNQS I PSMS I S AS SQS SS VNSLPDVSDDKSELDMMEGDHTVMSNS S 
VIHLKPEEENYREEGDPRTRASDPQSPPQVSRHKSHYRNREHFATIRTASLVTRQMQEHE 
QDSELREQMSGYKRMRRQHQKQLMTLENKLKAEMDEHRLRLDKI)LETQRNNFAAEMEK^ 
KJOTQAAMEKEAKVMSNEEKKFQQHIQAQQKKELNSFLESQK^ 

TPKKEKQEVJLSKQKENIQHFQAEEEANLLRRQRQYLELECRRFKRRMLLGRHNLEQDLVR 
EELNKRQTQKDLEHAMLLRQHESMQELEFRHLNTIQKMRCELIRLQHQ 
RERELRRKHVMEWQQPKSLKSKELQIKKQFQDTCKIQTRQYKALRNHLLETTPKSEHKA 
VTiKRIiKEEQTRKLiAI LAEQYDHS INEMLSTQALRLDEAQEAECQVLKMQLQQELELLNAY 
Q S KI KMQ AEAQHDRELRELEQRVS LRRALLEQKI EE EMLALQNERTER I RS LLERQARE I 
EAFDSESMRLGFSNMVLSNLSPEAFSHSYPGASGWSHNPTGGPGPHWGHPMGGPPQAWGH 
PMQGGPQPWGHPSGPMQGVPRGSSMGVRNSPQALRRTASGGRTEQGMSRSTSVTSQISNG 
SHMSYT 



SEQ ID NO: 24 SULU3 murine Nterm=l-25 kin=26-273 A=274-422 
ccl=423-632 B=633-748 cc2=missing 

MPSTNRAGSLKIDPEIAELFFKEDPEKLFTDLREIGHGSFGAVYFARDWTNEVVAIKKMS 

YSGKQSTEKWQDI I KEVKFLQRI KHPNS I EYKGCYLREHTAWLVMEYCLGSASDLLEVHK 

KPLQEVEIAAITHGALQGLAYLHSHTMIHRDIKAGNILLTEPGQVKLADFGSASMAS 

S FVGTPYWMAPEVI IiAMDEGQYDGKVD WSLGI TC I ELAERKPPLFNl^ 

ESPTLQSNMNDSCLQKIPQDRPTSEELLKHMFVLRERPETVLIDLIQRTKDAVRELDNLQ 

YRKMKKLLFQEAHNGPAVEAQEEEEEQDHGVGRTGTVNSVGSNQS I PSMS I SAS SQSS S V 

NSLPDASDDKSELDMMEGDHTVMSNSSVIHLKPEEENYQEEGDPRTRASDPQSPPQVSRH 

KSHYRNREHFATIRTASLVTRQMQEHEQDSELREQMSGYKRMRRQHQKQLMTLENKLKAE 

l^EHRLRLDKDLETQRNNFAAEMEKLIKKHQAAMEKEAKVMA^ 

NSFLESQKREYKLRKEQLKEEIJsrENQSTPKKEKQEWLSKQKENIQHFQAEEE^^ 

QYLELECRRFKRRMLLGRHNLEQDLVREELNKRQTQKDLEHAMLLRQHESMQELEFRHIx^ 

TIQKMRCELIRLQKQTELTNQLEYNKRRERELRRKHVMEVRQQPKSLKSKELQIKKQFQD 

TCKIQTRQYKALRNHLIjETTPKNEHKAI 



SEQ ID NO: 26 GEK2 human N=l-33 kin=34-294 A=295-337 
B=338-472 215 coiled-coill=473 -724 215 coiled-coil2=725-912 
MAFANFRRILRLSTFEKRKSREYEHVRRDLDPNEWEIVGELGDGAFGKVYKAKNKETGA 
LAAAKVIETKSEEELEDYIVSIEILATCDHPYIVKLLGAYYHDGKLWIMIEFCPGGAVDA 
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FIGURE 8E 

IMLELDRGLTEPQ I Q WCRQMLEALNFLHSKRI I HRDLKAGNVLMTLEGD I RIADFGVS A 

KNLKTLQKRDS F I GTPYWMAPEWMCETMKDTP YDY KADI WSLG I TLX EMAQIE PPHHEL 

NPMRVIiliKIAKSDPPTLLTPSKWSVEFRDFLKIALDKNPETRPSAAQIiLEHPFVSSITSN 

KALREIjVAEAKAEVMEEIEDGRDEGEEEDAVDAASTLENHTQNSSEVSPPSL.NADKPLEE 

SPSTPLAPSQSQDSVNEPCSQPSGDRSLQTTSPPWAPGNENGLAVPVPLRKSRPVSMDA 

RIQVAQEKQVAEQGGDLSPAANRSQKASQSRPNSSALETLGGEKLANGSLEPPAQAAPGP 

SKRDSDCSSLCTSESMDYGTNLSTDLSLNKEMGSLSIKDPKLYKKTLKRTRKFWDGVEV 

S ITTSKI I SEDEKKDEEMRFLRRQELRELRLLQKEEHRNQTQLSNKHELQLEQMHKRFEQ 

EINAKKKFFDTELENLERQQKQQVEKMEQDHAVRRREEARRIRLEQDRDYTRFQEQLKLM 

KKEVKNEVEKLPRQQRKESMKQKMEEHTQKKQLLDRDFVAKQKEDLELAMKRLTTDNRRE 

ICDKERECLMKKQELLRDREAALWEMEEHQLQERHQLVKQQLKDQYFLQRHELLRKHEKE 

REQMQRYNQRMIEQLKVRQQQEKARLPKIQRSEGKTRMAMYKKSLHINGGGSAAEQREKI 

KQFSQQEEKRQKSERLQQQQKHENQMRDMLAQCESNMSELQQLQNEKCHLLVEHETQKLK 



SEQ ID NO: 29 PAK4 human Rac=l-51 A=52-224 Nterm=225-393 
kin=3 94-658 Ctail=659-681 residues 13-23 (SAPQNFQHRVH) = 
Cdc42 /Rac-binding motif 

MFRKKKKKRPEISAPQNFQHRVHTSFDPKEGKFVGLPPQWQNILDTLRRPKPWDPSRIT 

RVQLQPMKTWRGSAMPVDGYISGLLNDIQKLSVISSNTLRGRSPTSRRRAQSLGLLGDE 

HWATDPDMYLQSPQSERTDPHGLYLSCNGGTPAGHKQMPWPEPQSPRVLPNGLAAKAQSL 

GPAEFQGASQRCLQLGACLQSSPPGASPPTGTNRHGMKAAKHGSEEARPQSCLVGSATGR 

PGGEGSPSPKTRESSLKRRLFRSMFLSTAATAPPSSSKPGPPPQSKPNSSFRPPQKDNPP 

SLVAKAQ SLP SDQPVGTFS PLTTSDTS S PQKSLRTAPATGQLPGRS S PAGS PRTWHAQ I S 

TSNLYLPQDPTVAKGAIiAGEDTGWTHEQFKAALRMWDQGDPRLLLDSYVKIGEGSTGI 

VCIAREI<HSGRQVAVKMMDLRKQQRRELLFNEVVIMRDYQHFNVVEMYKSYLVGEELWVL 

MEFLQGGAIiTDI VSQVRLNEEQ I ATVCEAVLQALAYLHAQGVIHRD I KSDS I LLTLDGRV 

KLSDFGFCAQISKDVPKRKSLVGTPYWMAPEVISRSLYATEVDIWSLGIMVIEMVDGEPP 

YFSDS PVQAMKRLRDS PPPKLKNSHKVS PVLRDFLERMLVRDPQERATAQELLDHPFLLO 
TGL.PECLVPLIQLYRKQTSTC 



SEQ ID NO: 30 PAK5 human Rac A=missing Nterm partial = l-114 
kin=115-379 Ctail=380-398 

ASGAKLAAGRPFNTYPRADTDHPSRGAQGEPHDVAPNGPSAGGLAIPQSSSSSSRPPTRA 
RGAPSPGVLGPHASEPQIoAPPACTPAAPAVPGPPGPRSPQREPQRVSHEQFRAALQLWD 
PGDPRSYLDNFIKIGEGSTGIVCIATVRSSGKLVAVKKMDLRKQQRRELLFNEWIMRDY 
QHENVVEMYNSYLVGDELWVVMEFLEGGALTDIVTHTRMNEEQIAAVCLAVLQALSVLHA 
QGVIHRDIKSDSILLTHDGRVKLSDFGFCAQVSKEVPRRKSLVGTPYWMAPELISRLPYG 
PEVDIWSLGIMVIEMVDGEPPYFNEPPLKAMKMIRDNLPPRLKNLHKVSPSLKGFLDRLL 
VRDPAQRATAAELLKHPFLAKAGPPASIVPLMRQNRTR 
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FIGURE 9 A 

SEQ ID NO: 1 STLK2 human 

TAACAGCCCACCTCCTAGCCCCGGGCTACGCGCCGCCAGCCCAGTAACCCCACTTTTGTG 
TGTCCTCCCAGGCCCCGATCGAAAAGCCTGGGAGGGCCGCCGAACTACCCCCGGAGGGAG 
GAGCCAGTCCGAACCCAAGGCGCCACCGCCGCAGAAGCGGAGCGAGGCAGCATTCGCCTC 
CATGGCCCACTCGCCGGTGGCTGTCCAAGTGCCTGGGATGCAGAATAACATAGCTGATCC 
AGAAGAACTGTTCACAAAATTAGAGCGCATTGGGAAAGGCTCATTTGGGGAAGTTTTCAA 
AGGAATTGATAACCGTACCCAGCAAGTCGTTGCTATTAAAATCATAGACCTTGAGGAAGC 
CGAAGATGAAATAGAAGACATTCAGCAAGAAATAACTGTCTTGAGTCAATGTGACAGCTC 
ATATGTAACAAAATACTATGGGTCATATTTAAAGGGGTCTAAATTATGGATAATAATGGA 
ATACCTGGGCGGTGGTTCAGCACTGGATCTTCTTCGAGCTGGTCCATTTGATGAGTTCCA 
GATTGCTACCATGCTAAAGGAAATTTTAAAAGGTCTGGACTATCTGCATTCAGAAAAGAA 
AATTCACCGAGACATAAAAGCTGCCAATGTCTTGCTCTCAGAACAAGGAGATGTTAAACT 
TGCTGATTTTGGAGTTGCTGGTCAGCTGACAGATACACAGATTAAAAGAAATACCTTTGT 
GGGAACTCCATTTTGGATGGCTCCTGAAGTTATTCAACAGTCAGCTTATGACTCAATUSlGC 
TGACATTTGGTCATTGGGAATTACTGCTATTGAACTAGCCAAGGGAGAGCCACCTAACTC 

cgatatgcatccaatgagagttctgtttcttattcccaaaaacaatcctccaactcttgt 
tggagactttactaagtcttttaaggagtttattgatgcttgcctgaacaaagatccatc 
atttcgtcctacagcaaaagaacttctgaaacacaaattcattgtaaaaaattcaaagaa 

GACTTCTTATCTGACTGAACTGATAGATCGTTTTAAGAGATGGAAGGCAGAAGGACACAG 

tgatgatgaatctgattccgagggctctgattcggaatctaccagcagggaaaacaatac 
tcatcctgaatggagctttaccaccgtacgaaagaagcctgatccaaagaaagtacagaa 
tggggcagagcaagatcttgtgcaaaccctgagttgtttgtctatgataatcacacctgc 

ATTTGCTGAACTTAAACAGCAGGACGAGAATAACGCTAGCAGGAATCAGGCGATTGAAGA 

actcgagaaaagtattgctgtggctgaagccgcctgtcccggcatcacagataaaatggt 
gaagaaactaattgaaaaatttcaaaagtgttcagcagacgaatccccctaagaaactta 
ttattggcttctgtttcatatggacccagagagccccaccaaacctacgtcaagattaac 
aatgcttaacccatgagctccatgtgccttttggatctttgcaacactgaagatttggaa 
gaagctattaaactattttgtgatggcgtttatcattttatattttgaaaggattatttt 
gtaaggaataacttttaatactatagtttcacctgtattctagtaaatgttgagacaccg 
ttttgcttttaagtatccctatttcttaagttacgaggatgaatacctttcacattttga 
tctttagttgactctacagtcatgaaacatacaggtctttcaaagtcattctcaatattc 
agcttttgtaaattatcaagcttcaaaaagcttttttttaaaaaaaaaaacatgcatatt 
ctaaaaatgactattggtggggaggtgtaaataagtcataccttcttaaaacagaaaatt 
taagtaaagtcttttaaatgaaacctgtaaaagtattgactcttctaccaagttggtatg 

ATATTCCAGGCAGCTCAATGATTATCACATTTGAGACCCTGTGTTTGAAGCATTTACAGG 

caatgtacagcaacagaggtacctcttggtgtatagtatttacattctcttttaggtaga 
agaggcaattttacccttatttcacatggttagaaatttaaagcaagatcatttacccaa 
ggataggtgtttggtaatgttgaaggagttagtctggcttcatgttttacatcttcaact 
aaaatcccatactatctgcttggatttggagagccaaaaaataaagctgattgtcatgtg 
attaaatatctgatcaacaggtatgaatataacttaaatcagcatatttttgccatggta 
ataaattgtcctataaactatttatatatttttgttcttcataattatcactaataagca 
tcagtttgttgtttttaaaaggatatttaagtgagcattttctagttcatatgaaaataa 
ccatagtacaggatgatttctgtccacacaaaggttaaattagattgcacagttaatttt 
cacttatatttatggtactattatgtgggtgatgcctttttcttttaagcccagtacata 



wo 99/53036 



PCT/US99/08150 



15/43 



FIGURE 9B 

TATTATGCCTGCCTAAGTTCTGAACTGGGGCTGTATTTCAGTAGTTGTAGAATTATTGAT 

ATTTAGTTTTGATAGCTAATGTTTAATTGTTTGGATCTGCACAGTTTGGTTTTTGCACAA 

AAGTCATTTAAAAAAATCTGAGTAATTGTCAAATATTAAAAGAAAGATATTCTTCCTGTA 

AGGAATACAGTTTTTAGTCAAAGTGGCCATTACATCCTCTTTTTAATTTACATAATACAG 

ATACTTGAGAAAGTTGTTGTGGTGTTGTATGCCAAGAAAATTCTTTTTATTGGTGCCTAT 

ATTGTAACAATTATTTTTAATGCATTGTATTTTGTkAGTAACGGTTCAGTTAAATTTTTCA 

CCTGCTGTGTAACTGAAACACAATTACAGTTTATAATCATCTGTAGAAGTCTGGAGATAA 

TTTTGCAACTCATGTTATGGGTTAAATGAATATTTTTGTAAAAGTAAAAGCAACAAATTT 

ATAAATTGATTATTTGAAACTTTACAACACAATTGCATCCCAAATACAAATTGTATTGCT 

TATTCATTATAGCTATTCGTCCTGTAATCTGTTTCTAGGTGAAGCATACTCCAGTGTTTT 

AGGGGTTTTGAAAATAAATATTTAAATTTCACAGTCAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAT^AAAAAAA 

SEQ ID NO: 2 STLK3 human 

GACAGCAGCGCCGGCCCCGGCAGCTCCCGCGGCCCCGGCCCCGGCCCCGGCCCCGGCCCC 

GGCGGCACAGGCTGTCGGCTGGCCCATCTGCAGGGACGCGTACGAGCTGCAGGAGGTTAT 

CGGCAGTGGAGCTACTGCTGTGGTTCAGGCAGCCCTATGCAAACCCAGGCAAGAACGTGT 

AGCT^TAAAACGGATCAACTTGGAAAAATGCCAGACCAGTATGGATGAACTATTAAAAGA 

AATTCAAGCCATGAGTCAGTGCAGCCATCCCAACGTAGTGACCTATTACACCTCTTTTGT 

GGTCAAAGATGAACTTTGGCTGGTCATGAAATTACTAAGTGGAGGTTCAATGTTGGATAT 

CATAAAATACATTGTCAACCGAGGAGAACACAAGAATGGAGTTCTGGAAGAGGCAATAAT 

AGCAACAATTCTTAAAGAGGTTTTGGAAGGCTTAGACTATCTACACAGAAACGGTCAGAT 

TCACAGGGATTTGAAAGCTGGTAATATTCTTCTGGGTGAGGATGGTTCAGTACAAATAGC 

AGATTTTGGGGTAAGTGCGTTCCTAGCAACAGGGGGTGATGTTACCCGAAATAAAGTAAG 

AAAAACATTCGTTGGCACCCCATGTTGGATGGCTCCTGAAGTCATGGAACAGGTGAGAGG 

CTATGACTTCAAGGCTGACATGTGGAGTTTTGGAATAACTGCCATTGAATTAGCAACAGG 

AGCAGCGCCTTATCACAAATATCCTCCCATGAAAGTGTTAATGTTGACTTTGCAAAATGA 

TCCACCCACTTTGGAAACAGGGGTAGAGGATAAAGAAATGATGAAAAAGTACGGCAAGTC 

CTTTAGAAAATTACTTTCACTGTGTCTTCAGAAAGATCCTTCCAAAAGGCCCACAGCAGC 

AGAACTTTTAAAATGCAAATTCTTCCAGAAAGCCAAGAACAGAGAGTACCTGATTGAGAA 

GCTGCTTACAAGAACACCAGACATAGCCCAAAGAGCCAAAAAGGTAAGAAGAGTTCCTGG 

GTCAAGTGGTCACCTTCATAAAACCGAAGACGGGGACTGGGAGTGGAGTGACGACGAGAT 

GGATGAGAAGAGCGAAGAAGGGAAAGCAGCTTTTTCTCAGGAAAAGTCACGAAGAGTAAA 

AGAAGAAAATCCAGAGATTGCAGTGAGTGCCAGCACCATCCCCGAACAAATACAGTCCCT 

CTCTGTGCACGACTCTCAGGGCCCACCCAATGCTAATGAAGACTACAGAGAAGCTTCTTC 

T.TGTGCCGTGAACCTCGTTTTGAGATTAAGAAACTCCAGAAAGGAACTTAATGACATACG 

ATTTGAGTTTACTCCAGGAAGAGATACAGCAGATGGTGTATCTCAGGAGCTCTTCTCTGC 

TGGCTTGGTGGATGGTCACGATGTAGTTATAGTGGCTGCTAATTTACAGAAGATTGTAGA 

TGATCCCAAAGCTTTAAAAACATTGACATTTAAGTTGGCTTCTGGCTGTGATGGGTCGGA 

GATTCCTGATGAAGTGAAGCTGATTGGGTTTGCTCAGTTGAGTGTCAGCTGATGTATGTC 

CCTTGATGTCACCCTGATCTGTCATGCCCCACCGCCACCCCTACTCCCTTCAACCCTCCC 

TCTTTCTGCCCATTTCCTCCCACCCCCTCACTCCCATTTCCTAGCAAAATCAGAAGATTG 

TGAAGAGGCCGGCTTCAACAAAATGGGATAAAAAAATAATTTTTTAAAACTTACAACACT 

CCGAGTTCTGCTTTATTCTCTAGCAATCCACAGTACAAGAACAAGCAAATGCCACAGCTG 



3NSDOCID: <WO 9953036A2J_> 



wo 99/53036 



PCT/US99/08I50 



16/43 



FIGURE 9C 

CACGACTGTTGCTCATTTTTCCAAAAGCTATTTAATATTCTTAGCAATCAATTTGGATAT 

CCCTTAAGTGAAAAGAATCTGAAATACACTCAGGTGGTCTTATTTATTGGCAACAAAAGG 

AATTTTCTATCCAGAAGCCTATTTCTCCTTTCATTGTTGTTATTTCTGTTATAATACTTT 

AATTGTACATCTGACAATACTGCCTCTTTTATGTTGTATTTAGAAATTAATATACTTATA 

AAATTAAGATTTATTAGCCAAACTTGAATTCTAGTTTTAAAACTGACTGTGAATTTTATT 

TTTCATATATTTATGCATTACACACCTTAGCTATAAGAAAAAAAGGGTTTTGATTATATG 

CTTCTTGCAGTTAATCTCGTTATTTAAACAAAAAGTTTTGGGTCTATCTTTGGAGTATTT 

GTAACTTCTAAATTTTGAAATGACTGAATTAGGAATTTGGATGCTTATTCTTTTAGTCTG 

TTTGCCTAAAAACCAATTTACAATCTGACTGTCTCTTGGGAGAGGGAGGTGCCTTGCAAA 

CTTTCACATTAAGAATGTGCCTGAGGCTGCTTTACTCTGGAATAGTCTCAGATCTAAAAT 

TTCCTCTATATAAGGTGGCATATGTTAAGTTTTGCTTCATTGGACCGTTTAGAATGCTAT 

GTAAAATGTTGCCATTCTGTTAGATTGCTAACTATATACCCATCTCTGATTTGGCTCTCC 

TTAAGTGATAGGATTTGTTATTCTAAAGGTGATAAACTTGAAAATATCAGAATCTGAGTT 

TTACTTGAAATTTTGCAGAATACCCAGGTGGAGTGAAAATTGGAAGGOT^ 

actaaaaggtaaaacgctgttaaggttcaagaatcaatactttcaacccSg^^^ 

ISSJ??''^^°^^^^'^^A'^°°^CTAGTAAACCTTAGGATTTTG 

tttcaaggaggtgggctcccaggatggtaccattgctctttcciagctS^^ 

SEQ ID NO: 3 STLK4 human 

caaaagtggagtcctagatgantctaccattgctacgatactccgagaagtactggaagg 
gctggaatatctgcataaaantggacagatccacagagatgtgaaagctggaaacattct 
tnttggagaagatggctcagtacagatttcagactttggggttagtgcttttttagcaac 
tggtggtgatattacccgaaataaagtgagaaagacctttgttggcaccccttgttggat 
ggcacctgaagttatggaacaggtccgtggttatgatttcaaagctgatatttggagttt 
tggaattacagcaattgaattggctacaggggcggctccttatcataaatatccaccaat 
gaaggttttaatgctgacactgcagaacgatcctccttctttggaaactggtgttcaaga 
taaagaaatgctgaaaaaatatggaaaatcatttagaaaaatgatttcattgtgccttca 
aaaagatccagaaaaaagaccaacagcagcagaactattaaggcacaaatttttccagaa 

aagagcaaaaaaggttcggagagtaccaggttccagtgggcgtcttcataagacagagga 

tggaggctgggagtggagtgatgatgaatttgatgaagaaagtSgSI^gJSg^^^^ 

aatttcacaactcaggtctccccgagtgaaagaatcaatatcaaattc^gag^^ 

aacaactgatcctgtgggtactttgctccaagttccagaacagatctctgctcatctacc 

tcagccagctgggcagattgctacacagccaactcaagtctctctcccacccaccgcaga 

?^S^?S^^^^^°^'^^'^'^'='^^^°^'^^^^°°atcaggttcacaag 

aatcagtctagtactaagattaaggaattccaaaaaagaactaaatgatattcgatttga 
atttactcctgggagagatacagcagagggtgtctctcaggaactcatttctgci^^^^ 

S^r^S^^°°^°°GATTTAGTAATAGTGGCAGCTAATTTGCAGAAAATTGTGG^^ 

TCAGTCAAATCGATCTGTCACTTTCAAACTGGCATCTGGTGTCGAAGGCTC;^GATAI^^^ 

TGATGATGGTAAACTGATAGGATTTGCCCAGCTCAGCATCAGCTAAAcSS^CCC^^^^ 
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FIGURE 9D 

AGAGGCGGCCTAAGGAGATTCCACACATGeGTATCTCTGTTGCTTCTATTGGCCTAAACC 

CACTACTGCCAAAGAACCCAGCAACAAACCTCCCGGCTAGGAGCTTTAGAAGTCTTTATG 

TTCTTCCTGCCATCATTCCTCCTTTTCCCACAGGGAAAGAAAAGTTGGATCACTAGTGGC 

CAGCATCCCCAGAGTTCCGTTAGTAAACTTACTTCATATGTCCCCTGTCTTCCTCCATCT 

GAGAAGTGGCCCATGTGCTTCAAGGCCCAGGAGGGAGATCTGTCAGCTCATTCTTGCCTT 

ACTCCAATGATGGCCCAGGTGGAAAAGTAGCAGCTGTATCGGGCTTCCTCATCCTGCCTG 

TTCCCCCACACCTGCCAGGATATGGACATCTTGGGATATCTCTTTACCACTGAAGTAGAA 

TTGATTGTTCAGCTGGAGCCCAGAGAATTTAATTTAATGTTTTTTCTTTGTACCTGATGT 

GAATTCTAGCAACCTTTGTTAGGAAAAAGCACAGCCTCAGATGGAGGCAGCCTAAACTGT 

GTTCTTGTTTTGTTCATGGTGTTTCTAAGCGTTTTGCTGAAGCTGCTCTCAGGCACCCCC 

TTCTTCATTGCTCTCTCCAGAAAGGGTTGCTAGCCTTAACTTCAGCTGGTGCAAAACATC 

TGACTGTAGCCGAACTTCAGCCATCAGATCCTTCAAAGTGGAACTTTGGATTGTTTTTAC 

AGACAACATCGAGTAATGGCTTGTAAATGTGT^TTTTGCCAGAGGTGGTTTTTGAACAGG 

AAAATCATAATTCATATCATTGGAGAAGTATTTATTTTCAAATATCAAATTGAAGAAAAA 

CTCAATCCTCCCATGAAAATCAGTTCGCCTGGCCTCCAAGTCGTGAGGAAATGGGTATGC 

AAGGCTGAGATTTCTACAGCAATAAAGGAGACACACACTGGGCCAGAGAGGCCTGCCTTC 

TGCCTGCTCTCCTGCACTGACCCTTTGGAGGGGGTCTCTGTGTGCTGAAGCTAACTCAAG 

ATGGAAAGTGAAACCACATGTGCCGTGACCTTTAGGTTTTATGAGTAGACAGTGTTCATT 

TGATTTTCTACAGAAATAATATAAATTATTCTTTAGGTTTAAAAAAGAGCACTCATAATG 

CAATATGTGAATAATCAGTGAGGTTGATTTTTCTTTTTTCCTACCGTTTCATAGTCTTTG 

TCTAACTGCTAGTAACCCTACCGAGTTTTATATATGAGTGGGATACTCAATCTGGCCTTA 

AAAAGATACACAAAGATGGGCTGTGGGTCCCTGGAAAGGGGGAGAGTTGCCCTTTACAGA 

ATCACTCGAGCCCTTTCCAGCACTGTTGGTCTGATGAACAAGGTTGTTTTACCTTATTTT 

CTCTTGGAACATATCTGAAAACCTTCCCCACAAATAACTTGTCACACCTTTTGTTTCATT 

CTGAGTCTTTAGTTTTAGTCATGGGCTTTCTTCACCTGCTCTAGGTGCAAAGGCATGTTG 

GGAAAGAGATGGATGTTGGGGAGGAAGAGAGGAGATGGATTTCAGTTGGGAGTTAGGAGG 

AGAGTAGGTGAGATGATCAGACACCGGAGTTCAACGTCCCAGCAGTCTTGGTAAAAGGAG 

GGAGCCTGCTGAGCCAGGAGGGAGAAAAGAAGATTGACCAGCTTGCTAGAAAAATACTTA 

GCTTTTCTTTTTCTTTTTTTGTGGAGGGGGGACGGAGAGGAACAAGGATGGGGAGGTAGG 

AATGAGGTATAGAAAAGAGATAGCATCTTCTTTGGCACAAGACTAGTGGCTTACCGCTTA 

CCTTAGAGTTTTGTTTTTTTTTTTTCAAACCCATCAAAATCTACTTATTTATGAATCCAA 

GGGGTGGCAGCATCACTCTGTTCTAGCATTCTTTGTGGAGATGGTCTGGTGCCTAGCTGG 

GAGTGAGCAGCAGCCCATCCCCTGTTCACTTTCTCTAGCCCATCATTACCTGTGAACTGC 

AGTGGGGCAGTCATGGCAAATAGAATTGGGCTGGGGTTTCTCCTTCTTTTCAGTTCATTG 

TTTGCCCTGCTAGGAATTAGAAGACAGACACCATGTCCCAGGACAGTGTTACTTCTTCTG 

CATGATGTGTGGTAGACTCCCTTTGCTGGCTTGTGCAGTGATACTGAGAAAATACATGAA 

CAGAAACTGCCCAGGTGGAACAGCACGTAACCTAGTGAGTGACTGTACTCCTTTCTAGGA 

ATGCTGATTCAGAGTGCACCTCTTTGACTAGGTCCCAGGATCCCCTTGTCCCTGGAGTAG 

GGACTAACTATAGCACAAAGTAATATGTGCCAATGCTATTTGTGAAATGTTTGGTCTTTC 

TAAACGACTAAAGGATTTGTTGGGTTTTTGCTTAAGTTTTGAACCAAATCCTAGAGCCAG 

CTGATAATATTTAATAATCTGGAGGAGAGAATAATGATGTACCAATAAGTGGAGATTCCT 

CCTTATGATGTATGCTAGGTTATGGAAGATGTAAAATATTCAACTTTTTCCTCCTTTTTT 

TGGACTTTGTATTTTACTGCATGTTTTCTTCATTTTTAATCAATAAAGAGTAAATTGTCA 

AAAAAAAAAAAAAAAAA 
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FIGURE . 9E 

SEQ ID NO: 4 STLK5 human 

CTCATCTGTACACACTTCATGGATGGCATGAATGAGCTGGCGATTGCTTACATCCTGCAG 
GGGGTGCTGAAGGCCCTCGACTACATCCACCACATGGGATATGTACACAGGAGTGTrA^^ 

gccagccacatcctgatctctgtggatgggaaggtctacctgtctS^^tg^^ 

CTCAGCATGATAAGCCATGGGCAGCGGCAGCGAGTGGTCCACG^^^^ 

gtcaaggttctgccgtggctcagccccgaggtcgtccagcagaatctcSg^tS^^^ 

GCCAAGTCTGACATCTACAGTGTGGGAATCACAGCCTGTGAACTGGcS^^^ 
™Iii?^^''^^^^^^^^^^C^^ATGCTGCTAGAGAAACTGAA^ 



TGCCTGTTGGATACCAGCACCATCCCCGCTGAGGAGCTGACCATGAGCCCTTC^^^ 
GTGGCCAACTCTGGCCTGAGTGACAGCCTGACCACCAGCACCCCCC^^^^ 

GACTCGCCCTCCCACCCCTACCACCGAACCTTCTCCCCCCACTTcScSS??^^ 



3tctcaggaccacagtggj! 
:gattgggagttctgagcc 
:acccagaggcccttcctg 

gactgattgggaaagaaaatcctggagaga'?^c?tSc?S?c^^^ 



" - itctctgcaaact 



?5S?'^^''^^^^°^^^^°°a^^cagaggccacccagaggcccttcotS^^^ 



°I^5^??^°°^^a^ctggaggtggacgattgggagttctgagcctctgcaaactgt 

3GCCAC 
VCTGCT 
^GACAC 



AAGGGAATCTCAACAACCT^GGGATCASGAGGGTCC^Gj^aiJ^C^^^^ 



TCCA 



SEQ ID NO: 9 2C1 human 

jATGTC 
rCTCAT 

:atgat 
jtgaag 

lTCCTG 
AGAAT 
AGCTG 
CTGAG 

TGTGGCATTACAGCCATTGAGATGGCAGA^GGTOcic?™'^°'^°^'^''""=°^=^ 



S^^^S^S^!5°™!5FF!^????^=*°™°=*OCaTCAAAGTTA?G^^^ 

:tcat 

^TGAT 

:gaag 

'CCTG 
lGAAT 
.GCTG 

GTCa.TCGCCTGTGATGAGAACai*GATGCCACCTAT^^^ 



2^™?^?^™^^^TATTATGGTGCTTTCATCAAAAAGAGCCCTCCAGGACATGAT 

:CATTACAGAC( 

GTOTXGCTGAC.^AG^iG^;aGi;G^C^^^^^ 



GACCAACTCTGGCTTGTTATGGAGTTCTGTGGGGCTGGGTCCATO^^^^ 
AACACCAAAGGGAACACACTCAAAGAAGACTGGATCGCTTACATC?SAr»^ri?^S^^ 



TCGAAGAAGTTTTTTAGTTTTATAGAAGGGTGCc5SSS?^S?S 



iiiMi'i I mil "It I 
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FIGURE 9F 

TCTACAGAGCAGCTTTTGAAACATCCTTTTATAAGGGATCAGCCAAATGAAAGGCAAGTT 

AGAATCCAGCTTAAGGATCATATAGATCGTACCAGGAAGAAGAGAGGCGAGAAAGATGAA 

ACTGAGTATGAGTACAGTGGGAGTGAGGAAGAAGAGGAGGAAGTGCCTGAACAGGAAGGA 

GAGCCAAGTTCCATTGTGAACGTGCCTGGTGAGTCTACTCTTCGCCGAGATTTCCTGAGA 

CTGCAGCAGGAGAACAAGGAACGTTCCGAGGCTCTTCGGAGACAACAGTTACTACAGGAG 

CAACAGCTCCGGGAGCAGGAAGAATATAAAAGGCAACTGCTGGCAGAGAGACAGAAGCGG 

ATTGAGCAGCAGAAAGAACAGAGGCGACGGCTAGAAGAGCAACAAAGGAGAGAGCGGGAA 

GCTAGAAGGCAGCAGGAACGTGAACAGCGAAGGAGAGAACAAGAAGAAAAGAGGCGTCTA 

GAGGAGTTGGAGAGAAGGCGCAAAGAAGAAGAGGAGAGGAGACGGGCAGAAGAAGAAAAG 

AGGAGAGTTGAAAGAGAACAGGAGTATATCAGGCGACAGCTAGAAGAGGAGCAGCGGCAC 

TTGGAAGTCCTTCAGCAGCAGCTGCTCCAGGAGCAGGCCATGTTACTGGAGTGCCGATGG 

CGGGAGATGGAGGAGCACCGGCAGGCAGAGAGGCTCCAGAGGCAGTTGCAACAAGAACAA 

GCATATCTCCTGTCTCTACAGCATGACCATAGGAGGCCGCACCCGCAGCACTCGCAGCAG 

CCGCCACCACCGCAGCAGGAAAGGAGCAAGCCAAGCTTCCATGCTCCCGAGCCCAAAGCC 

CACTACGAGCCTGCTGACCGAGCGCGAGAGGTGGAAGATAGATTTAGGAAAACTAACCAC 

AGCTCCCCTGAAGCCCAGTCTAAGCAGACAGGCAGAGTATTGGAGCCACCAGTGCCTTCC 

CGATCAGAGTCTTTTTCCAATGGCAACTCCGAGTCTGTGCATCCCGCCCTGCAGAGACCA 

GCGGAGCCACAGGTTCCTGTGAGAACAACATCTCGCTCCCCTGTTCTGTCCCGTCGAGAT 

TCCCCACTGCAGGGCAGTGGGCAGCAGAATAGCCAGGCAGGACAGAGAAACTCCACC^ 

ATTGAGCCCAGGCTTCTGTGGGAGAGAGTGGAGAAGCTGGTGCCCAGACCTGGCAGTGGC 

AGCTCCTCAGGGTCCAGCAACTCAGGATCCCAGCCCGGGTCTCACCCTGGGTCTCAGAGT 

GGCTCCGGGGAACGCTTCAGAGTGAGATCATCATCCAAGTCTGAAGGCTCTCCATCTCAG 

CGCCTGGAAAATGCAGTGAAAAAACCTGAAGATAAAAAGGAAGTTTTCAGACCCCTCAAG 

CCTGCTGATCTGACCGCAGTGGCCAAAGAGCTTCGAGCAGTGGAAGATGTACGGCCACCT 

CACAAAGTAACGGACTACTCCTCATCCAGTGAGGAGTCGGGGACGACGGATGAGGAGGAC 

GACGATGTGGAGCAGGAAGGGGCTGACGAGTCCACCTCAGGACCAGAGGACACCAGAGCA 

GCGTCATCTCTGAATTTGAGCAATGGTGAAACGGAATCTGTGAAAACCATGATTGTCCAT 

GATGATGTAGAAAGTGAGCCGGCCATGACCCCATCCAAGGAGGGCACTCTAATCGTCCGC 

CGGACTCAGTCCGCTAGTAGCACACTCCAGAAACACAAATCTTCCTCCTCCTTTACACCT 

TTTATAGACCCCAGATTACTACAGATTTCTCCATCTAGCGGAACAACAGTGACATCTGTG 

GTGGGATTTTCCTGTGATGGGATGAGACCAGAAGCCATAAGGCAAGATCCTACCCGGAAA 

GGCTCAGTGGTCAATGTGAATCCTACCAACACTAGGCCACAGAGTGACACCCCGGAGATT 

CGTAAATACAAGAAGAGGTTTAACTCTGAGATTCTGTGTGCTGCCTTATGGGGAGTGAAT 

TTGCTAGTGGGTACAGAGAGTGGCCTGATGCTGCTGGACAGAAGTGGCCAAGGGAAGGTC 

TATCCTCTTATCAACCGAAGACGATTTCAACAAATGGACGTACTTGAGGGCTTGAATGTC 

TTGGTGACAATATCTGGCAAAAAGGATAAGTTACGTGTCTACTATTTGTCCTGGTTAAGA 

AATAAAATACTTCACAATGATCCAGAAGTTGAGAAGAAGCAGGGATGGACAACCGTAGGG 

GATTTGGAAGGATGTGTACATTATAAAGTTGTAAAATATGAAAGAATCAAATTTCTGGTG 

ATTGCTTTGAAGAGTTCTGTGGAAGTCTATGCGTGGGCACCAAAGCCATATCACAAATTT 

ATGGCCTTTAAGTCATTTGGAGAATTGGTACATAAGCCATTACTGGTGGATCTCACTGTT 

GAGGAAGGCCAGAGGTTGAAAGTGATCTATGGATCCTGTGCTGGATTCCATGCTGTTGAT 

GTGGATTCAGGATCAGTCTATGACATTTATCTACCAACACATATCCAGTGTAGCATCAAA 

CCCCATGCAATCATCATCCTCCCCAATACAGATGGAATGGAGCTTCTGGTGTGCTATGAA 

GATGAGGGGGTTTATGTAAACACATATGGAAGGATCACCAAGGATGTAGTTCTACAGTGG 

GGAGAGATGCCTACATCAGTAGCATATATTCGATCCAATCAGACAATGGGCTGGGGAGAG 



BNSOOCIO: <WO 9953036A2J_> 



wo 99/53036 



PCT/US99/08I50 



20/43 



FIGURE 9G 

CGGTCTGGTGGCAGCAGTCAGGTTTATTTCATGACCTTAGGCAGGacSSc???????? 
??iG^?^?g??S?IS^?='^^°°=^^^*"°===^==^°°°""^^ 

SEQ ID NO: 10 ZC2 human 

GCTTTCGGGGAGGTCTATGAGGGTCGTCATGTCAAAACGGGCCAGCTTGCAGCCATCAAG 

AAATATTCTCATCACCGGAATATTGCTACATACTATGGTGCTTTTATCAAAAAGAACCCA 
CCAGGCATGGATGACCAACTTTGGTTGGTGATGGAGTTTTGTGGTGCTGGCTCTGTCACC 
GACCTGATCAAGAACACAAAAGGTAACACGTTGAAAGAGGAGTGGATTGCATACATCTGC 
AGGGAAATCTTAGGGGGGCTGAGTCACCTGCACCAGCATAAAGTGATTCATCGAGATA^^ 
AAAGGGCAAAATGTCTTGCTGACTGAAAATGCAGAAGTTAAACTAGTGGACTTTGGAGTC 
AGTGCTCAGCTTGATCGAACAGTGGGCAGGAGGAATACTTTCATTGGAACTCCCTACTGG 

atggcaccagaagttattgcctgtgatgaaaacccagatgccacatatgatttS^g^^^ 

GACTTGTGGTCTTTGGGTATCACCGCCATTGAAATGGCAGAAGGTGCTCCCCCTCTCTGT 

gacatgcaccccatgagagctctcttcctcatcccccggaatccagcgcctcggctg^^ 

TCTAAGAAGTGGTCAAAAAAATTCCAGTCATTTATTGAGAGCTGCT^^^ 

'''''^''''''''^^''^''^^^^^^^^^^^^^GGGAGTCGACGCTGCGGAGGG^^^ 

ctgaggctgcagctggccaacaaggagcgttctgaggccctacggaggcagcaSSIg 

CAGCAGCAGCGGGAGAATGAGGAGCACAAGCGGCAGCTGCTGGCcSSS^^ 

atcgaggagcagaaagagcagaggcggcggctggaggagcaacaSggcSSg^g^^^ 
ctgcggaagcagcaggagagggagcagcgccggcactatgagSgSga^^^ 

cagcatcagcggcaggagcagaggcctgtggagaagaagccactgtaccattacS^gaI 
ggaatgagtcctagtgagaagccagcatgggccaaggaggtagaagaaSgt^S^ 

^^^°^S5^^^^<=^^^^°^^A^°CCTCACAAGG^ 

ctgcccccaaggtcggagtccttcagcattagtggagttcagcctgctcgaacacccc^^ 
^^^S^^°^^^^tcg^tccc^gatcccacatctggtagctgtaSI?c^^ 

gaggctctgaacgtgacctcccaccgcgtggagatgccacgccagaactcagatSSc^ 

caggaagaagacattccaccaaaggtgcctcaaagaacaacttctatatcccSg^^^ 

^S^^^'^^'^^^'^^^^^^^^^^^^T^^T^CTCTGGGACCCAGACTAG^^^^ 

atcagagcaagcaacgctgatctccggagaactgagcccatcttggagagcc^^^ 

aggaccagcagtggcagttcctccagctccagcacccctagctcccagccSgSSS^ 

ggaggctcccagcctggatcacaagcaggatccagtgaacgcaccSagttcgSc 

AGTAAGTCAGAAGGATCACCTGTGCTCCCCCATGAGCCTGCC^GG?SSIcC^^ 



tccagggacattacccggcccagtcgaccagct. 



'AGCTACAAAAAAGCTATAGATGAGGAT 
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FIGURE 9H 

CTGACGGCATTAGCCAAAGAACTAAGAGAACTCCGGATTGAAGAAACATACCGCCCAATG 

AAGAAGGTGACTGATTACTCCTCCTCCAGTGAGGAGTCAGAAAGTAGCGAGGAAGAGGAG 

GAAGATGGAGAGAGCGAGACCCATGATGGGACAGTGGCTGTCAGCGACATACCCAGACTG 

ATACCAACAGGAGCTCCAGGCAGCAACGAGCAGTACAA.TGTGGGAATGGTGGGGACGCAT 

GGGCTGGAGACCTCTCATGCGGACAGTTTCAGCGGCAGTATTTCAAGAGAAGGAACCTTG 

ATGATTAGAGAGACGTCTGGAGAGAAGAAGCGATCTGGCCACAGTGACAGCAATGGCTTT 

GCTGGCCACATCAACCTCCCTGACCTGGTGCAGCAGAGCCATTCTCCAGCTGGAACCCCG 

ACTGAGGGACTGGGGCGCGTCTCAACCCATTCCCAGGAGATGGACTCTGGGACTGAATAT 

GGCATGGGGAGCAGCACCAAAGCCTCCTTCACCCCCTTTGTGGACCCCAGAGTATACCAG 

ACGTCTCCCACTGATGAAGATGAAGAGGATGAGGAATCATCAGCCGCAGCTCTGTTTACT 

GGCGAACTTCTTAGGCAAGAACAGGCCAAACTCAATGAAGCAAGAAAGATTTCGGTGGTA 

AATGTAAACCCAACCAACATTCGGCCTCATAGCGACACACCAGAAATCAGAAAATACAAG 

AAACGATTCAACTCAGAAATACTTTGTGCAGCTCTGTGGGGTGTAAACCTTCTGGTGGGG 

ACTGAAAATGGCCTGATGCTTTTGGACCGAAGTGGGCAAGGCAAAGTCTATAATCTGATC 

AACCGGAGGCGATTTCAGCAGATGGATGTGCTAGAGGGACTGAATGTCCTTGTGACAATT 

TCAGGAAAGAAGAATAAGCTACGAGTTTACTATCTTTCATGGTTAAGAAACAGAATACTA 

CATAATGACCCAGAAGTAGAAAAGAAACAAGGCTGGATCACTGTTGGGGACTTGGAAGGC 

TGTATACATTATAAAGTTGTTAAATATGAAAGGATCAAATTTTTGGTGATTGCCTTAAAG 

AATGCTGTGGAAATATATGCTTGGGCTCCTAAACCGTATCATAAATTCATGGCATTTAAG 

TCTTTTGCAGATCTCCAGCACAAGCCTCTGCTAGTTGATCTCACGGTAGAAGAAGGTCAA 

AGATTAAAGGTTATTTTTGGTTCACACACTGGTTTCCATGTAATTGATGTTGATTCAGGA 

AACTCTTATGATATCTACACACCATCTCATATTCAGGGCAATATCACTCCTCATGCTATT 

GTCATCTTGCCTAAAACAGATGGAATGGAAATGCTTGTTTGCTATGAGGATGAGGGGGTG 

TATGTAAACACCTATGGCCGGATAACTAAGGATGTGGTGCTCCAATGGGGAGAAATGCCC 

ACGTCTGTGGCCTACATTCATTCCAATCAGATAATGGGCTGGGGCGAGAAAGCTATTGAG 

ATCCGGTCAGTGGAAACAGGACATTTGGATGGAGTATTTATGCATAAGCGAGCTCAAAGG 

TTAAAGTTTCTATGTGAAAGAAATGATAAGGTATTTTTTGCATCCGTGCGATCTGGAGGA 

AGTAGCCAAGTGTTTTTCATGACCCTCAACAGAAATTCCATGATGAACTGGTAACAGAAG 

AGCACTTGGCACTTATCTTCATGGCGTTATTTCTAATTTAAAAGAACATAACTCATGTGG 

ACTTATGCCAGTCTAGAGGCAGAATCAGAAGGCTTGGTTGAACATATCGCTTTCCCTTTT 

TCCTCTCCCTCCGCCCCTCCCAGTACAGTCCATCT 

SEQ ID NO: 11 2C3 human 

GCATTTGGGGAGGTGTATGAGGGTCGGCATGTCAAGACGGGGCAGCTGGCTGCCATCAAG 
GTCATGGATGTCACGGAGGACGAGGAGGAAGAGATCAAACAGGAGATCAACATGCTGAAA 
AAGTACTCTCACCACCGCAACATCGCCACCTACTACGGAGCCTTCATCAAGAAGAGCCCC 
CCGGGAAACGATGACCAGCTCTGGCTGGTGATGGAGTTCTGTGGTGCTGGTTCAGTGACT 
GACCTGGTAAAGAACACAAAAGGCAACGCCCTGAAGGAGGACTGTATCGCCTATATCTGC 
AGGGAGATCCTCAGGGGTCTGGCCCATCTCCATGCCCACAAGGTGATCCATCGAGACATC 
AAGGGGCAGAATGTGCTGCTGACAGAGAATGCTGAGGTCAAGCTAGTGGATTTTGGGGTG 
AGTGCTCAGCTGGACCGCACCGTGGGCAGACGGAACACTTTCATTGGGACTCCCTACTGG 
ATGGCTCCAGAGGTCATCGCCTGTGATGAGAACCCTGATGCCACCTATGATTACAGGAGT 
GATATTTGGTCTCTAGGAATCACAGCCATCGAGATGGCAGAGGGAGCCCCCCCTCTGTGT 
GACATGCACCCCATGCGAGCCCTCTTCCTCATTCCTCGGAACCCTCCGCCCAGGCTCAAG 
TCCAAGAAGTGGTCTAAGAAGTTCATTGACTTCATTGACACATGTCTCATCAAGACTTAC 
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CAGCGGCGCATAGAGGAGCAGAAGGAGGAGCGGCGCCGCGToSGGlrrl^^^^^^ 

GAGCGGGAGCAGCGGAAGCTGCAGGAGAAGGAGCRoSGC^GCTrS?^^2^^''°° 

GCTCTGCGGCGGGAGGAGGAGCGGCGaCAGGCGSGScSSrrr»^SfS^'^°"'° 
AGGCTAGAGGAGGAGCAGCGACAGCTCGAGATC^^T^.n^?Sf22'^^*^*"'^°'^<^^ 

gccctgctgctggaatacaagSgSgS^§SgSg™S?S^^^ 



GCACG 



cccactgccacgcccagtgcccgSSg??gtcSS^^ 
gaaggacctggccccagcccgaatccccSStggg?c?^^ 

GACAGCATCCcSTSSScC^CTGG-^r^^p?^^^^ 
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ATTAAGTTCCTGGTCATCGCCCTCAAGAGCTCCGTGGAGGTGTATGCCTGGGCCCCCAAA 
CCCTACCACAAATTCATGGCCTTCAAGTCCTTTGCCGACCTCCCCCACCGCCCTCTGCTG 
GTCGACCTGACAGTAGAGGAGGGGCAGCGGCTCAAGGTCATCTATGGCTCCT^GTGCTGGC 
TTCCATGCTGTGGATGTCGACTCGGGGAACAGCTATGACATCTACATCCCTGTGCACATC 
CAGAGCCAGATCACGCCCCATGCCATCATCTTCCTCCCCAACACCGACGGCATGGAGATG 
CTGCTGTGCTACGAGGACGAGGGTGTCTACGTCAACACGTACGGGCGCATCATTAAGGAT 
GTGGTGCTGCAGTGGGGGGAGATGCCTACTTCTGTGGCCTACATCTGCTCCAACCAGATA 
ATGGGCTGGGGTGAGAAAGCCATTGAGATCCGCTCTGTGGAGACGGGCCACCTCGACGGG 
GTCTTCATGCACAAACGAGCTCAGAGGCTCAAGTTCCTGTGTGAGCGGAATGACAAGGTG 
TTTTTTGCCTCAGTCCGCTCTGGGGGCAGCAGCCAAGTTTACTTCATGACTCTGAACCGT 
AACCGCATCATGAACTGGTGACGGGGCCCTGGGCTGGGGCTGTCCCACACTGGACCCAGC 
TCTCCCCCTGCAGCCAGGCTTCCCGGGCCGCCCCTCTTTCCCCTCCCTGGGCTTTTGCTT 
TTACTGGTTTGATTTCACTGGAGCCTGCTGGGAACGTGACCTCTGACCCCTGA • 

SEQ ID NO: 12 ZC4 human 

CAATGTTAACCCACTCTATGTCTCTCCTGCATGTAAAAAACCACTAATCCACATGTATGA 
AAAGGAGTTCACTTCTGAGATCTGCTGTGGTTCTTTGTGGGGAGTCAATTTGCTGTTGGG 
AACCCGATCTAATCTATATCTGATGGACAGAAGTGGAAAGGCTGACATTACTAAACTTAT 
AAGGCGAAGACCATTCCGCCAGATTCA?IGTCTTAGAGCCACTC!AATTTGCTGATTACCAT 
CTCAGGTCATAAGAACAGACTTCGGGTGTATCATCTGACCTGGTTGAGGAACAAGATTTT 
GAATAATGATCCAGAAAGTAAAAGAAGGCAAGAAGAAATGCTGAAGACAGAGGAAGCCTG 
CAAAGGTATTGATAAGTTAACAGGCTGTGAACACTTCAGTGTCCTCCAACATGAAGAAAC 
AACATATATTGCAATTGCTTTGAAATCATCAATTCACCTTTATGCATGGGCACCAAAGTC 
CTTTGATGAAAGCACTGCTATTAAAGTATTTCCAACACTTGATCATAAGCCAGTGACAGT 
TGACCTGGCTATTGGTTCTGAAAAAAGACTAAAGATTTTCTTCAGCTCAGCAGATGGATA 
TCACCTCATCGATGCAGAATCTGAGGTTATGTCTGATGTGACCCTGCCAAAGAATCCCCT 
GGAAATCATTATACCACAGAATATCATCATTTTACCTGATTGCTTGGGAATTGGCATGAT 
GCTCACCTTCAATGCTGAAGCCCTCTCTGTGGAAGCAAATGAACAACTCTTCAAGAAGAT 
CCTTGAAATGTGGAAAGACATACCATCTTCTATAGCTTTTGAATGTACACAGCGAACCAC 
AGGATGGGGCCAAAAGGCCATTGAAGTGCGCTCTTTGCAATCCAGGGTTCTGGAAAGTGA 
GCTGAAGCGCAGGTCAATTAAGAAGCTGAGATTCCTGTGCACCCGGGGTGACAAGCTGTT 
CTTTACCTCTACCCTGCGCAATCACCACAGCCGGGTTTACTTCATGACACTTGGAAAACT 
TGAAGAGCTCCAAAGCAATTATGATGTCTAAAAGTTTCCAGTGATTTATTACCACATTAT 
AAACATCATGTATAGGCAGTCTGCATCTTCAGATTTCAGAGATTAAATGAGTATTCAGTT 
TTATTTTTAGTAAAGATTAAATCCAAAACTTTACTTTTAATGTAGCACAGAATAGTTTTA 
ATGAGAAATGCAGCTTTATGTATAAAATTAACTATAGCAAGCTCTAGGTACTCCAATGGT 
GTACAATGTCTTTTGCACAAACTTTGTAACTTTTGTTACTGTGAATTCAAACATTACTCT 
TTGGACAGTTTGGACAGTATCTGTATTCAGATTTTACAACATGGAGTAAAGAAACCTGTT 
ATGAATTAGATTACAAGCAGCCTTCAA?^GAATTGGCACTGGGATAAGATTTTTCAGAAA 
AGAAAAACATCGGCAAACT 

SEQ ID NO: 17 KHS2 human 

CCGCCATGAACCCCGGCTTCGATTTGTCCCGCCGGAACCCGCAGGAGGACTTCGAGCTGA 
TTCAGCGCATCGGCAGCGGCACCTACGGCGACGTCTACAAGGCACGGAATGTTAACACTG 
GTGAATTAGCAGCAATTAAAGTAATAAAATTGGAACCAGGAGAAGACTTTGCAGTTGTGC 
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GCTATCtCAGGCGAGATAAGCTTTGGATTTGCATGGAGTTTTGTGGAGGTGGTTCTTTao 

aggatatttatcacgtaactggacctctgtcagaactgcSattgca?^^^ 
aaacactgcagggattatattatcttcacagtaaaggaSaatgSc^^^ 

GAGCTAACATTCTATTAACGGATAATGGTCAT^S^C^^^^ 

cacagataacagctacaattgccaaacggaagtctttcattggcacacStS?SIt^^ 

ctccagaagttgcagctgttgagaggaaggggggttacaatS^cStot^^^ 

cagtgggaatcactgccatagaacttgcagagcttcagcctcctatot^^ 

caatgagagcattatttctaatgacaaaaagcaattttcScScS 

aaatgaaatggtcaaatagttttcatcactttgtgaaaatgg^^^ 

aaaaaagacctactgctgaaaaattattacagcatccttttgtaaSc^ 

ggtctttggcaatcgagctgttggataaagtaaataatccaSiSStccS?^^^^ 
atttcgatgatgatgatcctgagcctcttgttgctgS 



GTAGAAACGTGAGAGAAGAAAAAACACGCTCAG^GAi^J^;;;^^ 

atccacccttaagaaaggagacagaaccacatcatgaacttcc?gac^ 



atgatgaatctaaacactcaactctgaaagcaaaaattcSc^cc^^^ 



aCCTAAAGTGCftTATGGGTGCATGTTTTTCAAAAG^TTTT^TGGGT^^^^ 
IIS^?!???5*^5*^=^^°°^TAAACCCAGATACAAGAGA^?^G?^??I^TS??^ 



CCGAAGAAGGGATTTATACCCTCAATCITAATGAACTTCRTGAAASTCAATGGA»r:>Pr. 

\ccagggctttttgattatgcaag/ 
:::aaactccctgacagaatactgcc? 

tagaatgggttgaaccaatgcagaaatttatgttaattaagcaca?I5aS:"c^^^ 



TGTAA 



acctatgtggagcacttcagactagcattgt 

^TTTATGTTAATTAAGCACATAGATTTTCC 
rGCTGGTAGTTCCTGAACAGGAGTACCCTTT. 
VCTTCAACCAAGTGGTTCGATTTGAGACGGTi 
TAACnraan-T.^^A.^n^. -^GAATCAGATACCCCACAGACAAATGTTAC 
TAACCCAACTGGAGAGAGATACCATCCTTGTATGCTTGGACTGTTGTATAAAAATAGT^ 



™^2!!^5^^°^^°CTATTCCAGCACACAAACTCCCTGACAGAAT^^^^ 

3AAACCAAATGGTG 

ztatgtggagcact 
tttatgttaattaa' 
:tggtagttcctga 

TV « .T^-T,^™ ' ^TCAACCAAGTGGT' 

caaattctacctcttcatggtttacagaatcagataccccacagacaaatgttactc^^^^ 



gaatcagataccccacagacaaatgttac] 
agattgaatcaatagtgtgcctacaagacagtgtgctagctttctggaaacatggaatgc 



^^??I?2^="™="=TAATGAGGTAAc£Si;^;^;n— :;^rc^^^ 

SgSS?S^^>?"''^'^'^°=°''''°^°^"^°<^°taggccaJctg^ 
aagctgtacaaaagctgcagtaacctgtcttcagttac???g?^t??^g?ggSSa 
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GATAAGATGGGGAAAATTTTGTTTTAAGTGGTATGGATATATTTAGCATATTGAACCACA 

CAAGTGCTTAATTCATTGTTATGTAATCTTTGTACATATAGGCAGTATTTTTTCTGTGAA 

ACTTCATATTGCTGAAGACATACACTAAGAATTTATGTAGATAATGTACTTTTATGAGAT 

GTACAAGTAAGTGTCTTATCTGTACAGATGTAAATGTTGATGAAAATGCAATTGGGGTTA 

ATATTTTAAGAATTCTTTAGTATATTCTTGGGTGTGGCTATATTACAAAATGGGATGCTG 

GCAATGAAACAATACATTTAACACTATTGTATTTTTATTATATGTAATTTAGTAATATGA 

ATATAAATCTTGTAACTTTTAAAATTGTAATGGAGGCTGTAATCATTTTATAATCTTTTT 

AATTTTAATGCAAGTACACTGGTGTTTATATTTGCACAAAGTATTGATATGTGATGTATT 

AAGTCACAAAAGTAAGCTGTGACATTGTCTATAAGCATTTGGCTCCACAAATGTATTTGG 

ATTGTTTTCTATGTGAAGCAAACCAATTATAATTAACCACATGTTGTAGTAACTGGTCTT 

TTTATATTTAAGCAGAATCCTGTAAGATTGCTTGTCTTTGCTTAAAAACAATACCTTTGA 

ACATTTTTGAATCACAGAATAGCGGTACCATGATAGAATACTGCAATTGTGGTCAGAATT 

ACAGTATGCACAAAGAATTAATTAGCATTATTAAAGAGTCCTCACTAAACATTTCATATG 

ATCACACTGAAGAACTGTAACATTCCATAGAGTGAAGTGGTTCAAATTTCTCTTGGAATT 

TTTACTTTTGTTGGCCTTATTTTATGATCCTTTTCATATTTCTTTTGACTTAGAGTATTA 

ATACATGGCCAAAATAATTTAGTTACTACCTCATACAAACAATATAATGGTTACTACACA 

TCACAGGAACTTAGTTTTGGTTTAAGTCATTTTTGATTGCTTTTTTCCAATGGAATATGT 

ATATACCAGGTTTTAGCAAAATGCACACTTTTGGCTCTTTTTGGTATATGTTCTTTATAT 

TTTAATGTGAGTATATACACTAAGAACAAACTAAATTGTGATTTATGATCTTCATTTATT 

TTAATGATAATGGTTTTAAAATATGTTCCTGATTGTACATATTGTAAAATAAACATGTTT 
TTT 



SEQ ID NO: 19 SULUl human 

GGGAGGGTCCTTGTGGCGCCGGGCGGCGGGGTCCTGCGTGGAGAGTGGGACGCAACGCCG 

AGACCGCGAGCAGAGGCTGCGCACAGCCGGATCCGGCACTCAGCGACCGGACCCAAGGAT 

CCGCCGGGGAACAAGCCACAGGAGAGCGACTCAGGAACAAGTGTGGGAGAGGAAGCGGCG 

GCGGCGGCGCCGGGCCCGGGGGTGGTGACAGCAGGTCTGAGGTTGCATCATAAATACAAA 

GGACTGAAGTTATAAAAGAGAAAAGAGAAGTTTGCTGCTAAAATGAATCTGAGCAATATG 

GAATATTTTGTGCCACACACATUUU^GGTACTGAAGATTTACCCCCCAAAAAAAATTGTCA 

ATGAGAAATAAAGCTAACTGATATCAAAAAGCAGAGCCTGCTCTACTGGCCATCATGCGT 

AAAGGGGTGCTGAAGGACCCAGAGATTGACGATCTATTCTACAAAGATGATCCTGAGGAA 

CTTTTTATTGGTTTGCATGAAATTGGACATGGAAGTTTTGGAGCAGTTTATTTTGCTACA 

AATGCTCACACCAATGAGGTGGTGGCAATTAAGAAGATGTCCTATAGTGGGAAGCAGACC 

CATGAGAAATGGCAAGATATTCTTAAGGAAGTTAAATTTTTACGACAATTGAAGCATCCT 

AATACTATTGAGTACAAAGGCTGTTACTTGAAAGAACACACTGCTTGGTTGGTGATGGAA 

TATTGCTTAGGCTCAGCCTCTGATTTATTAGAAGTTCATAAAAAACCACTTCAGGAAGTG 

GAGATCGCTGCCATTACTCATGGAGCCTTGCATGGACTAGCCTACCTACATTCTCATGCA 

TTGATTCATAGGGATATTAAAGCAGGAAATATTCTTCTAACAGAGCCAGGTCAGGTAAAA 

CTAGCTGATTTTGGATCTGCTTCAATGGCTTCTCCTGCCAACTCCTTCGTGGGCACACCT 

TACTGGATGGCTCCAGAGGTGATCTTAGCTATGGATGAAGGACAGTATGATGGGAAAGTT 

GATATTTGGTCACTTGGCATCACTTGTATTGAATTGGCGGAACGGAAGCCGCCCCTTTTC 

AACATGAATGCAATGAGTGCCTTATATCACATTGCCCAGAATGACTCCCCAACGTTACAG 

TCTAATGAATGGACAGACTCCTTTAGGAGATTTGTTGATTACTGCTTGCAGAAAATACCT 

CAGGAAAGGCCAACATCAGCAGAACTATTAAGGCATGACTTTGTTCGACGAGACCGGCCA 

CTACGTGTCCTCATTGACCTCATACAGAGGACAAAAGATGCAGTTCGTGAGCTAGATAAC 
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GAGTCACAGGAGGATGAGGAAGACAGTGAACATGGAACCAGCCTGAACAGGGAAATG^r 
^^J^^^^^^^^^^^^^^C^TTCCAAGC^^^^ 

agtgtgaacagcatgcaggaagtcatggacgagagcagttccgaacttgtcatgatgS^c 

GATGACGAAAGCACAATCAATTCCAGCTCCTCCGTCGTGCATAAGAAAGATCATGTAT^^ 

acaagggatgaggcgggccacggcgatcccaggcctgagccgcggcctacccagtcagtt 

CAGAGCCAGGCCCTCCACTACCGGAACAGAGAGCGCTTTGCCACGATCAAATCAGCATCT 

ttggttacacgacagatccatgagcatgagcaggagaacgagttgcgggaacagatgtca 
aactcgtccatcgagctggagaagctggccaagaagcaagtggctatcatagaaaaggag 

^^^^^^''^''-^^^'^^''''^^^^^^^^^^cccaagaaagagaagc^^ 

^^^^'''^''''^^'^^^^^^^^^^^^ctgaagaggaagcccacSt^^^ 

^JJ??^^°^^^°t^ctacgacaaaaattgtcgtttcttcaagcggaaaataatga^ 

gagatggagcatgccatgctaatccggcacgacgagtccacccgagagctagStaS^^ 

cagctgcacacgttacagaagctacgcatggatctgatccgSaSgScSSc^ 

ctggaaaaccagctggagtacaataagaggcgagaaagagaac?gScS 

^^J^S^^^^^^'^-^^^^^^tacagaccaaacagtataaagcactcaagaatcaccag^^ 

?^^Sr°''^^^^^''°''''^''°^°^^^^^CAGAGTATAAATGAAATGATGG^^^ 

^^^tSS'^^^^'^'^'^^^^^^^'^^^^A^^^AGCAAAATCAAGATGCAAAS^ 
?^??J?^''''^^''°'^^^^°^°^'^^°^^^°AGAGTGTCTCTGCGc5^ 

AACCACATTCCCCATTTTAACGGGCGTTGCTCTCACTCTCI^TC^ 
^?^JJ°f^^°°ACTAGTGCCTGTTTATTCTTACTCCATCAGGG^^^^ 

cgtgtcaactttcagtgctggccaaaacctggccgtctcttctattcacagtaS^^ 

acggggggtgggtggggtgaagagatcttgtgatttttaagaacctStgc^^^ 
aacttgtcttcagcatattctgattgtatcataatS?tt?^^^ 
gaatacacttaaggagctcacaga^tcccagtagcacSJ?^^^ 
tattttgtgtatagaaggaatttaaggagaggtatta^I??^^ 

CTGTTTCTCTGATCAAATTTTTTTACTTCCTCCTC?TG?ic????^^^ 
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TCCAGTTCAGTATTTGGAGTTCAACACTGTCTCTCAATCAGATCATCTTGATCTTTTTCT 
TTATCTCCCTTCCCCTTCCTAAGTCCCATTTCTTGGTCATAAATATTGCATTATTCACAC 
TTTCAAACTGTGTATTTTCTTACAATAAAAAATGATGAAAAAAAAAAAAAAAAAAA 

SEQ ID NO: 2 0 SULU3 human 

TATTGAATTGGCGGAACGGAAGCCTCCTTTATTTAATATGAATGCAATGAGTGCCTTATA 

TCACATAGCCCAAAATGAATCCCCTACACTACAGTCTAATGAATGGTCTGATTATTTTCG 

CAACTTTGTAGATTCTTGCCTCCAGAAAATCCCTCAAGATCGACCTACATCAGAGGAACT 

TTTAAAGCACATATTTGTTCTTCGGGAGCGCCCTGAAACCGTGTTAATAGATCTCATTCA 

QAGGACAAAGGATGCAGTAAGAGAGCTGGACAATCTGCAGTATCGAAAGATGAAGAAACT 

CCTTTTGCAGGAGGCACATAATGGACCAGCAGTAGAAGCACAGGAAGAAGAAGAGGAACA 

AGATCATGGTGTTGGCCGGACAGGAACAGTTAATAGTGTTGGAAGTAATCAATCCATTCC 

CAGCATGTCCATCAGTGCCAGCAGCCAAAGCAGTAGTGTTAACAGTCTTCCAGATGTCTC 

AGATGACAAGAGTGAGCTAGACATGATGGAGGGAGACCACACAGTGATGTCTAACAGTTC 

TGTTATCCATTTAAAACCAGAGGAAGAAAATTACAGAGAAGAGGGAGATCCTAGAACAAG 

AGCATCAGATCCACAATCTCCACCCCAAGTATCTCGTCACAAATCACACTATCGTAATCG 

AGAACACTTTGCTACTATACGGACAGCATCACTGGTTACGAGGCAAATGCAAGAACATGA- 

GCAGGACTCTGAGCTTAGAGAACAAATGTCTGGCTATAAGCGAATGAGGCGACAACATCA 

AAAGCAACTGATGACTCTGGAAAACAAGCTAAAGGCTGAGATGGATGAACATCGCCTCAG 

ATTAGACAAAGATCTTGAAACTCAGCGTAACAATTTTGCTGCAGAAATGGAGAAACTTAT 

CAAGAAACACCAGGCTGCCATGGAGAAAGAGGCTAAAGTGATGTCCAATGAAGAGAAAAA 

ATTTCAGCAACATATTCAGGCCCAACAGAAGAAAGAACTGAATAGTTTTCTCGAGTCCCA 

GAAAAGAGAGTATAAACTTCGAAAAGAGCAGCTTAAAGAGGAGCTAAATGAAAACCAGAG 

TACCCCCAAAAAAGAAAAACAGGAGTGGCTTTCAAAGCAGAAGGAGAATATACAGCATTT 

CCAAGCAGAAGAAGAAGCTAACCTTCTTCGACGTCAAAGACAATACCTAGAGCTGGAATG 

CCGTCGCTTCAAGAGAAGAATGTTACTTGGGCGTCATAACTTAGAGCAGGACCTTGTCAG 

GGAGGAGTTAAACAAAAGACAGACTCAGAAGGACTTAGAGCATGCCATGCTACTCCGACA 

GCATGAATCTATGCAAGAACTGGAGTTCCGCCACCTCAACACAATTCAGAAGATGCGCTG 

TGAGTTGATCAGATTACAGCATCAAACTGAGCTCACTAACCAGCTGGAATATAATAAGCG 

AAGAGAACGAGAACTAAGACGAAAGCATGTCATGGAAGTTCGACAACAGCCTAAGAGTTT 

GAAGTCTAAAGAACTCCAAATAAAAAAGCAGTTTCAGGATACCTGCAAAATCCAAACCAG 

ACAGTACAAAGCATTAAGAAATCACCTGCTGGAGACTACACCAAAGAGTGAGCACAAAGC 

TGTTCTGAAACGGCTCAAGGAGGAACAGACCCGGAAATTAGCTATCTTGGCTGAGCAGTA 

TGATCACAGCATTAATGAAATGCTCTCCACACAAGCCCTGCGTTTGGATGAAGCACAGGA 

AGCAGAGTGCCAGGTTTTGAAGATGCAGCTGCAGCAGGAACTGGAGCTGTTGAATGCGTA 

TCAGAGCAAAATCAAGATGCAAGCTGAGGCACAACATGATCGAGAGCTTCGCGAGCTTGA 

ACAGAGGGTCTCCCTCCGGAGGGCACTCTTAGAACAAAAGATTGAAGAAGAGATGTTGGC 

TTTGCAGAATGAGCGCACAGAACGAATACGAAGCCTGTTGGAACGTCAAGCCAGAGAGAT 

TGAAGCTTTTGACTCTGAAAGCATGAGACTAGGTTTTAGTAATATGGTCCTTTCTAATCT 

CTCCCCTGAGGCATTCAGCCACAGCTACCCGGGAGCTTCTGGTTGGTCACACAACCCTAC 

TGGGGGTCCAGGACCTCACTGGGGTCATCCCATGGGTGGCCCAGCACAAGCTTGGGGCCA 

TCCAATGCAAGGTGGACCCCAGCCATGGGGTCACCCTTCAGGGCCAATGCAAGGGGTACC 

TCGAGGTAGCAGTATGGGAGTCCGCAATAGCCCCCAGGCTCTGAGGCGGACAGCTTCTGG 

GGGACGGACGGAGCAGGGCATGAGCAGAAGCACGAGTGTCACTTCACAAATATCCAATGG 
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GTCACACATGTCTTATACATAACTTAATAATTGAGAGTGGCAATTCCGCTGGAGCTGTCT 

GCCAAAAGAAACTGCCTACAGACATCATCACAGCAGCCTCCTCACTTGGGTACTACAGTG 

TGGAAGCTGAGTGCATATGGTATATTTTATTCATTTTTGTAAAGCGTTCTGTTTTGTGTT 

TACTAATTGGGATGTCATAGTACTTGGCTGCCGGGTTTGTTTGTTTTTGGGGAAATTTTG 

AAAAGTGGAGTTGATATTAAAAATAAATGTGTATGTGTGTACATATATATACACACACAT 

ACACATATATTATGCATGTGGTGAAAAGAATTGGCTAGATAGGGGATTTTTCTGAACACT 

GCAAAAATAGAACGTAGCAAAATGGCTTCAGTTATCACTTTTGGGTGTCTGTATCCTAAG 

AAGTTTCTGAAAAGATCTAAAGCCTTTTTATCCCATATCCCAAATTCTTATGAGCCACTC 

ACAGCAGGCAGCATATGTTGAAATAAGTTATTACTGGTACACACCTGCATTGCCTCACCA 

QTGTATTTATTTGTTATTAAATTGATCTGACTTCTCAGCCTCATTTGGACTAAAAAAAGA 

AAGCAGAAATCCATGAACACATTGCTTCTCGGCCTTTTGGCTAAGATCAAGTGTAGAAAT 

CCATGAACACTAAAGGACTTCATTGATTTTTTCAGAGAGTAGAAAACAACTTAGTTTTTC 

TTTTTTCCTGAATGCGTCATAGGCTTGTGAGTGATTTTTGTCCATTCAATTGTGCCTTCT 

TTGTATTATGATAAGATGGGGGTACTTAAGGAGATCACAAGTTGTGTGAGGATTGCATTA 

ACAAACCTATGAGCCTTCAATGGGGAAGACCAGAAGGGTGAGAGGGGCCCTGAAAGTTCA 

TATGGTGGGTATGTCCCGCAGCAGAGTGAGGAGATGAAGCTTACGTGTCCTGACGTTTTG 

atttaaatgtttagttgccacagtgaaccaggagtcactSg5caa?ga^^ 

gctgactaatcttcatcaccactgtagattttgctgcatgtgSggt^^^ 

JISSJS'^'^ttcgttgctgcagtactttacaaacttctagttcSttSg^^^ 

tttggcatcaagttaacatcacacaataggaaacaccacttccacaagtctcaagcot 

gtgctaaagtactactgaaaaggaactaggaagtttggccaatt 

■SEQ ID NO: 21 SULU3 murine 

gcaggatgccatcaactaacagagcaggcagtctaaaggaccctgaaattgcagagctct 
tcttcaaagaagatccggaaaagctcttcacagatctcagagaaatcggccatgggagct 
ttggagcagtatattttgcacgagatgtgcgtactaatgaagtggtggccatcaagaaaa 
tgtcttatagtggaaagcagtctactgagaaatggcaggatattattaaggaagtcaagt 
ttctacaaagaataaaacatcccaacagtatagaatacaaaggctgctatttacgtgaac 
acacagcatggcttgtaatggaatattgtttaggatctgcttcagatttattagaagttc 
ataaaaagccattacaagaagtggaaatagcagcaattacacatggtgctctccagggac 
tagcttatttacattctcataccatgatccatagagatatcaaagcaggaaatatccttc 
tgacagaaccaggccaagtgaaacttgctgactttggatctgcttccatggcttcccctg 

aaggacagtatgatggcaaagttgatgtatggtctcttggaataacgtgtattgaattag 
^^^^^^^'''^'''^^'^^^^^^''^^^'^'^^'^^^^^^tgagtgccttatat^^^ 

^^^!^^^'^'^^'^^■^^^^T^^^'^CTAatATGAATGATTCTTGCCTCCAGAAAATCCCTC 

aagatcgccctacatcagaggaacttttaaagcacatgtttgttcttcgagagcgccctg 

AAACAGTGTTAATAGATCTTATTCAAAGGACAAAGGATGCAGTAAGAGAGCTGGACAATC 

tgcagtatcgaaagatgaagaaactccttttccaggaggcacataatgggccagcggtag 

AAGCACAGGAAGAAGAGGAGGAGCAAGATCATGGTGTTGGCCGAACAGGAACAGTGAATA 

gtgttggaagcaatcagtctatccctagtatgtctatcagtgccagcagtcaaagcagca 
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GTGTTAATAGTCTTCCAGATGCATCAGATGACAAGAGTGAGCTAGACATGATGGAGGGAG 

ACCATACAGTGATGTCTAACAGTTCTGTCATCCACTTAAAACCTGAGGAGGAAAATTACC 

AGGAAGAAGGAGATCCTAGAACAAGAGCATCAGACCCACAGTCTCCCCCTCAGGTGTCTC 

GTCACAAGTCACATTATCGTAATAGAGAACACTTTGCAACCATACGAACAGCATCACTGG 

TTACAAGACAGATGCAAGAACATGAGCAGGACTCTGAACTTAGAGAACAGATGTCTGGTT 

ATAAGCGGATGAGGCGACAGCATCAAAAGCAGCTGATGACGCTGGAAAATAAACTGAAGG 

CAGAGATGGACGAACATCGGCTCAGATTAGACAAAGATCTTGAAACTCAGCGTAACAATT 

TCGCTGCAGAAATGGAGAAACTTATTAAGAAACACCAAGCTGCTATGGAAAAAGAGGCTA 

AAGTGATGGCCAATGAGGAGAAAAAATTCCAGCAACACATTCAGGCTCAACAGAAAAAAG 

AACTGAATAGCTTTTTGGAGTCTCAAAAAAGAGAATATAAACTTCGCAAAGAGCAGCTTA 

AGGAGGAGCTGAATGAAAACCAGAGCACACCTAAAAAAGAAAAGCAGGAATGGCTTTCAA 

AGCAGAAGGAGAATATACAGCATa;TTCAGGCAGAAGAAGAAGCTAATCTTCTTCGACGTC 

AAAGGCAGTATCTAGAGCTAGAATGTCGTCGCTTCAAAAGAAGAATGTTACTTGGGCGAC 

ATAACTTGGAACAGGACCTTGTCAGGGAGGAGTTAAACAAAAGGCAGACTCAAAAGGACT 

TGGAACATGCAATGCTATTGCGACAGCATGAATCAATGCAAGAACTGGAGTTTCGCCATC 

TCAACACTATTCAGAAGATGCGCTGTGAGTTGATCAGACTGCAGCATCAAACTGAGCTCA 

CTAACCAGCTAGAGTACAATAAGAGAAGGGAACGGGAACTGAGGCGAAAACATGTCATGG 

AAGTTCGACAACAACCTAAGAGTCTGAAGTCTAAAGAACTCCAAATAAAAAAGCAGTTTC- 

AGGATACCTGCAAAATTCAAACCAGACAGTACAAAGCATTAAGGAATCACCTACTGGAGA 

CTACACCAAAGAATGAGCACAAAGCAATC 

SEQ ID NO: 25 GEK2 human 

CGAAGCCACAGCCCGAGCCCGAGCCCGAGCCCGAGCCGGCGCCACCGCGCCCCCGGCCAT 
GGCTTTTGCCAATTTCCGCCGCATCCTGCGCCTGTCTACCTTCGAGAAGAGAAAGTCCCG 
CGAATATGAGCACGTCCGCCGCGACCTGGACCCCAACGAGGTGTGGGAGATCGTGGGCGA 
GCTGGGCGACGGCGCCTTCGGCAAGGTTTACAAGGCCAAGAATAAGGAGACGGGTGCTTT 
GGCTGCGGCCAAAGTCATTGAAACCAAGAGTGAGGAGGAGCTGGAGGACTACATCGTGGA 
GATTGAGATCCTGGCCACCTGCGACCACCCCTACATTGTGAAGCTCCTGGGAGCCTACTA 
TCACGACGGGAAGCTGTGGATCATGATTGAGTTCTGTCCAGGGGGAGCCGTGGACGCCAT 
CATGCTGGAGCTGGACAGAGGCCTCACGGAGCCCCAGATACAGGTGGTTTGCCGCCAGAT 
GCTAGAAGCCCTCAACTTCCTGCACAGCAAGAGGATCATCCACCGAGATCTGAAAGCTGG 
CAACGTGCTGATGACCCTCGAGGGAGACATCAGGCTGGCTGACTTTGGTGTGTCTGCCAA 
GAATCTGAAGACTCTACAGAAACGAGATTCCTTCATCGGCACGCCTTACTGGATGGCCCC 
CGAGGTGGTCATGTGTGAGACCATGAAAGACACGCCCTACGACTACAAAGCCGACATCTG 
GTCCCTGGGCATCACGCTGATTGAGATGGCCCAGATCGAGCCGCCACACCACGAGCTCAA 
CCCCATGCGGGTCCTGCTAAAGATCGCCAAGTCGGACCCTCCCACGCTGCTCACGCCCTC 
CAAGTGGTCTGTAGAGTTCCGTGACTTCCTGAAGATAGCCCTGGATAAGAACCCAGAAAC 
CCGACCCAGTGCCGCGCAGCTGCTGGAGCATCCCTTCGTCAGCAGCATCACCAGTAACAA 
GGCTCTGCGGGAGCTGGTGGCTGAGGCCAAGGCCGAGGTGATGGAAGAGATCGAAGACGG 
CCGGGATGAGGGGGAAGAGGAGGACGCCGTGGATGCCGCCTCCACCCTGGAGAACCATAC 
TCAGAACTCCTCTGAGGTGAGTCCGCCAAGCCTCAATGCTGACAAGCCTCTCGAGGAGTC 
ACCTTCCACCCCGCTGGCACCCAGCCAGTCTCAGGACAGTGTGAATGAGCCCTGCAGCCA 
GCCCTCTGGGGACAGATCCCTCCAAACCACCAGTCCCCCAGTCGTGGCCCCTGGAAATGA 
GAACGGCCTGGCAGTGCCTGTGCCCCTGCGGAAGTCCCGACCCGTGTCAATGGATGCCAG 
AATTCAGGTAGCCCAGGAGAAGCAAGTTGCTGAGCAGGGTGGGGACCTCAGCCCAGCAGC 
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CAACAGATCTCAAAAGGCCAGCCAGAGCCGGCCCAACAGO^GCGCCCTGGAGACOT^ 

caagagggactcggactgcagcagcctctgcacctctgagagcatggSt^^ 

TCTCTCCACTGACCTGTCGCTGAACAAAGAGATGGGCTCTOT^SScI^^^^ 

actgtacaaaaaaaccctcaagcggacacgcaaatttgtggtggSSgt^g^^^^ 

?i^^^?^^^^^^^^^^°AGAGCTTCGGCTGCTCCAGAAAGAAGAGCATCG^ 

CCAGCTGAGTAACAAGCATGAGCTGCAGCTGGAGCAAATGCATAAACGTTTT^ 

AATCAACGCCAAGAAGAAGTTCTTTGACACGGAATTAGAGAACCTGGAGC^^^ 
^S°?J^^^^°^^^^^TGGAGCAAGACCATGCCG^^^ 

gatccgcctggagcaggatcgggactacaccaggttccaagagcagSSaact^^ 

GAAAGAGGTGAAGAACGAGGTGGAGAAGCTCCCCCGACAGSGCGGS^Ar^^^ 

gcagaagatggaggagcacacgcagaaaaagcagcttS?ScSg^^^ 

GCAGAAGGAGGACCTGGAGCTGGCCATGAAGAGGCTCAcScCGA^S^^^ 

^^^!SJ^°^^°c<^c°agtggctcatgaagaagcaggagc^^^^ 




CAAGAAGAGCCTCCACATcXACGaaGGGGG^^GCG^Sci^^ 



CAGCGTGAGAAGATCAA 



GTC 
AG 



AGCGCCACAGAACTTCCAGCACCGTG 



TCCACACCTCCTTCGACCCCAAAGAAGGCAA( 



ATCTC 
GTT 



SEQ ID NO: 27 PAK4 human 
CGTTCCTGGGCTTCCCGCTCCG^GGC^^^ 

AGGAAi 
GAGAT< 
GGCAA( 
AAGCC( 
GTGCGC 
ftAGTTC 
SCACAC 
IIAGAGC 
\CACCAGC 
\ATGGGCT 
:GCTGa 

;gcacc 

GGTGGGCTCAGCCACAGGCAGGCCAGGTGGGG^SGcISJcCTr"""'"""''''^^^^^ 



GCCCGT 
TGCGGGG 
GTTGTC 
GTC 



GAGCAGCCTGAAGCGCAGGCTATTCCGAAGCATGTTCCTGTCCACTGCTrrrar^ 

tccaagcagcagcaagccaggccctccaccacagagSIgccS^c^^^^ 

^^^^^f^^^^^^^^CCCAAGCCTGGTGGCCAAGGCcS^^^ 
GCCGGTGGGGACCTTCAGCCCTCTGACCACTTCGGATACCAGCAGCC^^^^ 
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CCGCACAGCCCCGGCCACAGGCCAGCTTCCAGGCCGGTCTTCCCCAGCGGGATCCCCCCG 

CACCTGGCACGCCCAGATCAGCACCAGCAACCTGTACCTGCCCCAGGACCCCACGGTTGC 

CAAGGGTGCCCTGGCTGGTGAGGACACAGGTGTTGTGACACATGAGCAGTTCAAGGCTGC 

GCTCAGGATGGTGGTGGACCAGGGTGACCCCCGGCTGCTGCTGGACAGCTACGTGAAGAT 

TGGCGAGGGCTCCACCGGCATCGTCTGCTTGGCCCGGGAGAAGCACTCGGGCCGCCAGGT 

GGCCGTCAAGATGATGGACCTCAGGAAGCAGCAGCGCAGGGAGCTGCTCTTCAACGAGGT 

GGTGATCATGCGGGACTACCAGCACTTCAACGTGGTGGAGATGTACAAGAGCTACCTGGT 

GGGCGAGGAGCTGTGGGTGCTCATGGAGTTCCTGCAGGGAGGAGCCCTCACAGACATCGT 

CTCCCAAGTCAGGCTGAATGAGGAGCAGATTGCCACTGTGTGTGAGGCTGTGCTGCAGGC 

gCTGGCCTACCTGCATGCTCAGGGTGTCATCCACCGGGACATCAAGAGTGACTCCATCCT 

GCTGACCCTCGATGGCAGGGTGAAGCTCTCGGACTTCGGATTCTGTGCTCAGATCAGCAA 

AGACGTCCCTAAGAGGAAGTCCCTGGTGGGAACCCCCTACTGGATGGCTCCTGAAGTGAT 

CTCCAGGTCTTTGTATGCCACTGAGGTGGATATCTGGTCTCTGGGCATCATGGTGATTGA 

GATGGTAGATGGGGAGCCACCGTACTTCAGTGACTCCCCAGTGCAAGCCATGAAGAGGCT 

CCGGGACAGCCCCCCACCCAAGCTGAAAAACTCTCACAAGGTCTCCCCAGTGCTGCGAGA 

CTTCCTGGAGCGGATGCTGGTGCGGGACCCCCAAGAGAGAGCCACAGCCCAGGAGCTCCT 

AGACCACCCCTTCCTGCTGCAGACAGGGCTACCTGAGTGCCTGGTGCCCCTGATCCAGCT 

CTACCGAAAGCAGACCTCCACCTGCTGAGCCCACCCCAAGTATGCCTGCCACCTACGCCC 

ACAGGCAGGGCACACTGGGCAGCCAGCCTGCCGGCAGGACTTGCCTGCCTCCTCCTCTCA 

GTATTCTCTCCAAAGATTGAAATGTGAAGCCCCAGCCCCACCCTCTGCCCTTCAGCCTAC 

TGGGCCAGGCCGGACCTGCCCCCTCAGTGTCTCTCCCTCCCGAGTCCCCAGATGGAGACC 

CCTTTCTACAGGATGACCCCTTGATATTTGCACAGGGATATTTCTAAGAAACGCAGAGGC 

CAGCGTTCCTGGCCTCTGCAGCCAACACAGTAGAAAAGGCTGCTGTGGTTTTTTAAAGGC 

AGTTGTCCACTAGTGTCCTAGGCCACTGCAGAGGGCAGACTGCTGGTCTCCACAGATACC 

TGCTGTTCTCAGCTCCAGCTTCAAACCTCGAGTCTCGAGAGGGCCACGGGGTGGTTTTTA 

TGACCGGAATCCCGCTTCCTCCCTCACGTCTGATGTCCTGAAGGTGCAGTCCCACCTGTA 

CAGCCCCTCCCCGCCAAGAACTGTGAATGGCCTGCTCCAGGCCATGGCTGGGGGCAGGGA 

GTGAGGGGACAATTTCTGAGTGAAAGAGAAAGAATGGGGTCGGTGGTGAAGGTGCTCTCA 

CTTTACAGAATGGAGAGAACATCGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTG 

TGTGTGTGTGTGTGTGTGTGTGTGTGTAAGGGGAGGAAAGCCACCTTGACAGCCCAGGTC 

CCTCCAGGTCACCCACAGCCAGTTTCAGGAAGGCTGCCCCTCTCTCCCACTAAGTTCTGG 

CCTGAAGGGACCTGCTTTCTTGGCCTGGCTTCCACCTCTCCACTCCTGTGTCTACCTGGC 

CAGTGGAGTGGTCCATGCTAAGTCTAACACTCCTGGGAGCTCAGGAGGCTTCTGAGCTTC 

TCCTGTACTGTGCATCGTGAGGGCCAGAGACAGGAATGTAAGGATTGGCAACTGTGTTAC 

CTTTCAAGTTTATCTCAATAACCAGGTCATCAGGGACCCATTGTTCTCTTCAGAACCCTA 

TCTGGGAGAGAAGGCGAACCACCTCCGGGTTTCCATCATGTCAAGGTCACAGGCATCCAT 

GTGTGCAAACCATCTGCCCCAGCTGCCTCCACAGACTGCTGTCTCCTTGTCCTCCTCGGC 

CCTGCCCCACTTCAGGGCTGCTGTGAGATGGAATTCCAGGAAAGAACTTCAGGTGTCTGG 

ACCCTTTCTATCTAGATAATATTTTTAGATTCTTCTGCTCCCTAGTGACCTACCTGGGGG 

CAAAGAAATTGCAAGGACTTTTTTTTAAGGGTCAGAGTTTTCAAAACAAAAGCATCTTCC 

CTAGAAATTTTTGTGAATTGTTTGCACTTGTGCCTGTTTTAAATTAAATTGAGTGTTCAA 
AGCC 

SEQ ID NO: 2 8 PAK5 human 

GGCCAGTGGGGCGAAACTGGCAGCTGGCCGGCCCTTTAACACCTACCCGAGGGCTGACAC 
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:acccgagc 
sctggcccc 



S?S^2°?2^E^?^^^^^^CTA^CTGGACAACTTCATCAAGATTGGCGAGGGCTCCAC^^^ 

3gtggccgtcaagaagatg 
;gtggtaatcatgagggac 
jgtgggggacgagctctgg 



^2^°°?5°'^'=*^'='^='==°°ACATCAASAGCGACTCGATCCTOCTGAC^^ 

ggtgaagctgtcagactttgggt-x-ctgcgcccaggtgagcaaggSS^gSSgS^ 



AGAGCC 
GCCACC 
CCTGCI 
CCTGGC 
hTGAGQ 
3AGGCC 
:CACCC 
ICTTTT 
rCGGAC 

:ggaag 



CGCCTGCl 
TTCCTGGC 
?VGATGAGG 
ZTGAGGCC 
IIACCACCC 
TGACTTTT 

?r^^S^J2°°'^'^'^°''''°°=°°^°=*<''^°'*°^'^'^GAGGTGAASS 



^f°i2^?^?^S?!=°^"*°=TGTCGCCATCCCTGAAGGGCTTCCTGGACCGCCTGCT 

AGCCGAGCTGCT 

CCTCATGCGCC;! 
CCCCCGGGTCAC 

TCCTTGCTGGGGGTAGA^G^C;;^C^;A^^;^^^ 



S??°f???^^^^T^^C^GCGGGCCACGGCAGCCGAGCTGCTGAAGCACCCATTCCT^^^ 

CTCATC 

-* *^'»-*^"^^<->'^'^\jN-.\«\_CCCCGG 
AGTAGGGGGCCAGGCCTCCCACTCCTCCCAGCCCGGGAGATGCTCCGCGTGGCAC^^^^ 



CTGTGGCTGGAT55^^gG§?^g§=J°=§gJJ5==M^^^ 
GACTACTGCACCTGGACAGCCTCCTVTT^Tj^TaSrvSSSIF'^™^ 

-CCTGCCTCGAGTTA^?^^^™??^??^^^^^^ 
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>STLK5.h 

MSSPLPEQGCVELLTVIGKGFEDLMTVNLARYKPTGEWTVRRINLEACSNEMVTFLQGELHVSKL^ 

ADNELWWTSFMAYGSAKDLICTHPMIXSMNEIJVIAyiLQGVLKALDYIHHMGYVHRSV^ 

SMISHGQRQRVVHDFPKYSVKVLPWLSPEVLQQm-QGYDAKaDIYSVGITACELANGHVPPmMP 

LLDTSTIPAEELTMSPSRSVANSGLSDSLTTSTPRPSNGDSPSHPYHRTFSPHFHHFVEQCLQRNPDARPSASTLLNHSF 
PKQIXRRASEALPELLRPVTPITNFEGSOSQDHSQIFGLVTNLEELEVDDWEF 

>STLK6_h 

MSLLDCFCTSRTQVESLRPEKQSETSIHQYLVDEPTLSWSRPSTRASEVLCSTWSHYSLQVSIGRGFDNLTSVHLARHT 

PTGTLVTrKITm-ENCNEERLKALQKAVILSHPPRHPNITTYWTWTVOSWLWVISPFMAYGSASQLLRTYFPEGM^ 

IRNILFGAVRGLNYLHQNGC2HRSIKASHILISGDGLVTLSGLSHLHSLVKHGQRHRAVYDPPQF 

DLKQYNVKSDIYSVGITACEIASGQVPFQDMHRTQMLLQKLKGPPYSPLDISIFPQSESRMKNSOSGVDSGIGESVLVSS 
GTHTVNSDRLHTPSSKTFSPAFFSLVQLCLQQDPEKRPSASSXiLSHVFFKQMKEESQDSILSLLPPAYNKPSISLPPVLP 
WTEPSCDFPDEKDSYWEF 

>STLK7_h 

NiyDDYELQEVIGSGATAWQAAYCAPKKEKVAIKRINLEKCQTSMDELLKElQAMSQCHHPNIVSYYTSFVV^ 
KLLSGG9VLDIIKHIVAKGEHKS . 

>2C4_h 

MAGPGGWRDREVTDLGHI-POPTGIFSLDKTIGLGTYGRIYLGLHEKTOAFTAVKVMNARKDEEEDLRTELN^ 

NIVSFYGAFFKLSPPGQRHQLWMVMELCAAGSVTDVVimTSNQSLKEDWIAYICREILQGIJUiLHAHRVIHR^ 

LTHNAEVKLVDFGVSAQVSRTNGRRNSFIGTPYViMAPEVIDCDEDPRRSYPYRSDVWSVGITAIEMAEGAPPLC^ 

ALFVXLRESAPTVKSSGWSRKFHNFMEKCTIKNPLFRPTSANMLQHPFVRDIKNERHVVESL^ 

EKKSKVSTLRQiUJUCRLSPKRFRAKSSWRPEKLELSDLEARRQRRQRRWEDIFNQHEEELRQVDKDKm 

IQAEVQIEPLKPYISNPKKlEVQERSPSVPNNQDHAHHVKFSSSVPQRSljLEQAQKPIDIRQRSSQmQNWL;^ 

KIIAGXTQSYCLTIYISEVKKEEFQEGMNQKCQGAQVGLGPEGHCIWQLGESSSEEESPVTGRRSQSSPPYSTIDQKLLV 

DIHVPDGFKVGKISPPVYLTNEWVGYNALSEIPRNDWLTPAPVIQPPEEDGDYVELYDASADTDGDDDDESN^ 

HANG^nDDLDNQTOOA^roVCKDHDDDNNK^VDDVNNlrlrYEAPSCPRASYGRDGSCK 

•HGGSAASEDNAAIGDQEEHAANIGSERRaSEGDGGKGWRTSEESGAWSLNaEENCSETDGPGLKRPASQDFEYLOEEPG 
GGNEASNAIDSGAAPSAPDHESDNKDISESSTQSDFSA^mSSPSKGSGMSADAK^PASAILYAGPVEVPEES 
NPLYVSPACKKPLIHMYEKEFTSEICCGSLWGVNLLLGTRSNLYLMDRSGKADITKilRRRPFRQIOVLEPLNLL 
HK^fRLRVYKLTWLRNKIIJJlroPESKRRQEEMLKTEEACKAlDKLTGCEHFSVLQHEEa^ 
ESTAIKVFPTLDHKPVTVDLAIGSEKRLKIFFSSAMYHLIDAESEVMSDVTLPKNPL^^ 
FNAEALSVEANEQLFKKILEMWKDIPSSIAFECTQRTTGWGQKAIEVRSLQSRVLESELKRRSIKKLRFLC 
STLRNHHSRVYFMTLQKLEELQSNYDV 

>PAK5.h 

MFQKRKKRVEISAP3NFEHRVHTGFDQHEQKFTGLPRQWQSLIEESARRPKPLVDPACITSIQPGAPKTIVRGSKGAKDG 

ALTLLLDEFENMSVTRSNSUUU3SPPPPARARQENGMPEEPATTARGGPGKAGSRGRFAGHSEAG0GSGDRRRAGPEK^ 

KSSREGSGGPQESSRDKRPLSGPDVGTPQPAGLASGAKLAAGRPPNTYPRADTDHPSRGAQGEPHDVAPNGPSAGGLAIP 

QSSSSSSRPPTRARGAPSPGVLGPHASEPQLAPPACTPAAPAVPGPPGPRSPQREPQRVSHEQFRAALQLWDPGDPRSY 

LDNFIKIGEGSTGIVCIATVRSSGKLVAWKMDLRKQQRRELLFNEVVIMRDYQHENVVEMYNSYLVGDEL 

GALrDIVTHTRMNEEQIAAVCIA\^QALSVLHAOGVIHRDIKSDSILLTHDGRVKLSDFGFCAQVSKEVPF^ 

WMAPSLISRLPYGPEVDIWSLGIMVIEMV1X3EPPYFNEPPLKAMKMIRD^^ 

ATAAELLKHPFLAKAGPPASIVPLMRQNRTR 

>GEK2,h 

MAFANFRRILRLSTFEKRKSRE:YEHVRRDLDPNEVVrt:iVGEt^DGAFGKVYKAKNKETGA3^^ 

SIEIIJSiTCDHPYIVKLLGAYYHDGKLWIMIEFCPGGAVDAIMLELDRGLTEPQIQVVCROMLEALNFLHSKRIIHRDLKA 

QNVLMTLEGDIRI^FGVSAKNLKTLQKRDSFIGTPYWMAPEVVMCETMKDTPYDYKADIVJSLGITL 

NPMRVLLKIAKSDPPTLLTPSKWSVEFRDFLKIALDKNPETRPSAAQLLEHPFVSSITS^^CALRELVAEAKA£Vl^ 

GRDEGEEEDAVDAASTLENHTQNSSEVSPPSLNADKPLEESPSTPLAPSOSQDSVNEPCSQPSGDRSLQTTSPPWAPGN 

ENGLAVPVPL•RKSRPVS^©ARIQVAOEKQVAEQGGDLSPAANRSQKASQSRPNSSALETLGGEKLANGSLEPPAQAAPGP 

SXRDSDCSSLCTSESMDYGTNLSTDLSLNKEKGSLSIKDPKLYKKTLKRTRKFVVDGVEVSITTSKIISEDEKKDEEMRF 

LRRQELRELRLLOKEEHRNOTQI-SI«CHELQLEOMHKRFE0EI^LAKKKFFIy^ELE^^LERQOK0OVEKMEODHAV^ 
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RIRL£QD:OYTRFQEQLKLMKKEVKNEVEKLPRQORKESMKQKMEEHTQKKQLU5RDPVAKQICEDLEU^ 

ICDKERECL>JKKQELLRDREAALWEMEEHQLQERHQLVKQQLKDQYFLQRHELLRKHEKEREQMQRYNQRMIEOl-KVRQC! 

QEKARLPKIQRSEGKTRMAMYKKSLHINQQOSAAEQREKIKQPSQQEEKRQKSERLQQQQKHENQMRDMLAQCESNMSEL 

QQLQNEKCHLLVEHETQKLKALDESHNQNLKEWRDKLRPRKKALEEDLNQKKREQEMFFKLSEEAECPNPSTPSK^ 

PYSSGPAS 
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>STLK5.h 

GGCCAAGACGGTCGGGGCTGCTTGCTAACTCCAQGAACAGGTTTAAGTTTTTGAAACTGAAGTAGGTCTACACAGTAGGA 
ACTCATGTCATTTCTTGTAAGTAAACCAGAGCGAATCAGGCGGTGGGTCTCGGAAAAGTTCATTGTTGAGGGCTTAAGAG 

atttggaactatttggagaccaatgatgcgagctcagagt£:aatagcatccttctctaaacaggaggtcatgagtagctt 

TCTGCCAGAGGGAGGGTGTTACGAGCTGCTCACTGTGATAGGCAAAGGATTTGAGGACCTGATGACTGTGAATCTAGCAA 

ggtacaaaccaacaggagagtacgtgactgtacggaggattaacctagaagcttgttccaatgagatggtaacattcttg 

CAGGGCGAGCTGCATGTCTCCAAACTCTTCAACCATCCCAATATCGTGCCATATCGAGCCIACTTTTATTGCAGACAATGA 
GCTGTGGGTTGTCACATCATTCATGGCATACGGTTCTGCAAAAGATCTCATCTGTACACACTTCATGGATGGCATGAA'rG 
AGCTGGCGAT7GCTTACATCCTGCAGGGGGTGCTGAAGGCCCTCGACTACATCCACCACATGGGATATGTACACAGGAGT 
GTCAAAGCCAGCCACATCCTGATCTCTGTGGATOGGAAGGTCTACCTGTCTGGTTTGCGCAGCAACCTCAGCATOATAAG 
CCATGGGCAGCGGCAGCGAGTGGTCCACGATTTTCCCAAGTACAGTGTCAAGGTTCTGCCGTGGCTCAGCCCCGAGGTCC 
TCCAGCAGAATCTCCAGGGTTATGATGCCAAGTCTGACATCTACAGTGTGGGAATCACAGCCTGTGAACTGGCCAACGGC 
CATGTCCCCTTTAAGGATATGCCTGCCACCCAGATGCTGCTAGAGAAACTGAACGGCACAGTGCCCTGCCTGTTGGATAC 
CAGCACCATCCCCGCTGAGGAGCTGACCATGAGCCCTTCGCGCTCAGTGGCCAACTCTGGCCTGAGTGACAGCCTGACCA 
CCAGCACCCCCCGGCCCTCCAACGGTOACTCGCCCTCCCACCCCTACCACCGAACCTTCTCCCCCCACTTCCACCACTTT 
GTGGAGCAGTGCCTTCAGCGCAACCCGGATGCCAGGCCCAGTGCCAGCACCCTCCTGAACCACTCTTTCTTCAAGCAGAT 
• CAAGCGACGTGCCTCAGAGGCTTTGCCCGAATTGCTTCGTCCTGTCACCCCCATCACCAATTTTGAGGGCAGCCAGTCTC 
AGGACCACAGTGGAATCTTTGGCCTGGTAACAAACCTGGAAGAGCTGGAGGTGGACGATTGGGAGTTCTGAGCCTCTGCA 
AACTGTGCGCATTCTCCAGCCAGGGATGCAGAGGCCACCCAGAGGCCCTTCCTGAGGGCCGGCCACATTCCCQCCCTCCT 
GGGCAGATTGGGTAGAAAGGACArrCTTCCAGGAAAGTTGACTGCTGACTGATTGGGAAAGAAAATCCTGGAGAGATACT 
TCACTCCTCCAAGGCTTTTGAGACACAAGGGAATCTCAACAACCAGGGATCAGGAGGGTCCAAAGCCGACATTCCCAOTC 
CTGTGAGCTCAGGTGACCTCCTCCGCAGAAGAGAGATGCTGCTCTGGCCCTQQQAGCTQAATTCCAAGCCCAGGGTTTGG 
CTCCTTAAACCCGAGGACCGCCACCTCTTCCCAGTGCTTGCGACCAGCCTCATTCTATTTAACTTTGCTCTCAQATGCCT 
CAGATGCTATAGGTCAGTGAAAGGGCAAGTAGTAAGCTGCCTGCCTCCCTTCCCTCAGACCTCTCCCTCATAATTCCAGA 
GAAGGGCATTTCTGTCTTTTTAAGCACAGACTAAGGCTGGAACAGTCCATCCTTATCCCTCTTCTGGCTTGGGCCCTGAC 
ACCTAAGTCTTTCCCACGGTTTATGTGTGTGCCTCATTCCTrrCCCACCAAGAATCCATCTTAGCGCCTCCTQCCAGCTG 
CCCTGGTGCTTTCTCCAAGGGCCATCAGTGTCTTGCCTAGCTTGAGGGCTTAAGTCCTTATGCTGTGTTAGTTTCG 
CAGAACAAATTAAAATTTTCAGAGACGCTG 

>STLK6_h 

AAGGAAGATAAAACAAAAGCCTTCTTTGGAATAGATGGATTTTTGTCACTTTCTGTGTCAACTAAAOTC^ 

CTTTTGGATTGCTTCTGCACTTCAAGAACACAAGTTGAATCACTCAGACCTGAAAAACAGTCTGAAACCAGTATCCATCA 

ATACTTGGTTGATGAGCCAACCCTTTCCTaGTCACGTCCATC»CTAGAGCCAGTGAAGTACTATOTTCC7^CC^ 

CTCACTATCAGCTCCAAGTAGAAATAGGAAGAGGATTTGACAACTTGACTTCTGTCCATCTTGCACGGCATACTCCCACG 

GGAACACTGGTAACTATAAAAATTACAAATCTGGAAAACTGCAATGAAGAACGCCTGAAAGCTTTACAGAAAGCCGTQAT 

TCTATCCCACTTTTTCCGGCATCCCAATATTACAACTTATTGGACAGTTTTCACTGTTGGCAGCTGGCTTTGGGTTA 

CTCCATTTATGGCCTATGGTTCAGCAAGTCAACTCTTGAGGACCTATTTTCCTGAAGGAATGAGTGAAACTTTAATA^ 

AACATTCTCTTTGGAGCCGTGAGAGGGTTGAACTATCTGCACCAAAATGGCTGTATTCACAGGAGTATTAAAGCCAGCCA 

TATCCTCATTTCTGGTGATGGCCTAGTGACCCTCTCTGQCCTTTCCCATCTGCATAGTTTGGTTAAGCATGGACAGAGGC 

ATAGGGCTGTGTATGATTTCCCACAGTTCAGCACATCAGTCCAGCCGTGGCTGAGTCCAGAACTACTOAGACAGGATTTA 

CATGGGTATAATGTGAAGTCAGATATTTACAGTGTTGGGATTACAGCATGTGAATTAGCCAGTGGGCAGGTGCCTTTCCA 

GGACATGCATAGAACTCAGATGCTGTTACAGAAACTGAAAGGTCCTCCTTATAGCCCATTGGATATCAGTATTTTCCCTC 

AATCAGAATCCAGAATGAAAAATTCCCAGTCAGGTGTAGACTCTGGGATTGGAGAAAGTGTGCTTGTCTCCAGTGGAACT 

CACACAGTAAATAGTGACCGATTACACACACCATCCTCAAAAACTTTCTCTCCTGCCTTCTTTAGCTTGGTACAGCTCTG 

TrrQCAACAAGATCCTGAGAAAAGGCCATCAGCAAGCAGTTTATTGTCCCATGTTTTCTTCAAACAGATGAAAGAAGAAA 

GCCAGGArrCAATACTTTCACTGTTGCCTCCTGCTTATAACAAGCCATCAATATCATTGCCTCCAGTGTTACCTTGQACT 

GAGCCAGAATGTGATTTTCCTGATGAAAAAGACTCATACTGGGAATTCTAGGGCTGCCAAATCATTTTATGTCCTATATA 

CTTGACACTTTCTCCTTGCTGCTTTTTCTTCTGTATTTCTAGGTACAAATACCAQAATTATACTTGAAAATACAGTTGGT 

GCACTGGAGAATCTATTATTTAAAACCACTCTGTTCAAAGGGGCACCAGTTTGTAGTCCCTCTGTTTCGCACAQAGTACT 

ATQACAAGGAAACATCAGAATTACTAATCTAGCTAGTGTCATTTATTCTGGAATTTTTTTCTAAGCTGTGACT/kACTCTT 

TTTATCTCTCAATATAATTTTTGAGCCAGTTAATTTTTTTCAGTATTTTGCTGTCCCTTGGGAATGGGCCCTCAOAGGAC 

AGTGCTTCCAAGTACATCTTCTCCCAGATTCTCTGGCCTTTTTAATGAGCTATTGTTAAACCAACAGGCTAGTTTATCTT 

ACATCAGACCCTTTTCTGGTAGAGGGAAAATGTTTGTGCTTTCCCTTTTTCTTCTGTTAATACTTATGGTAACACCTAAC 

TGAGCCTCACTCACATTAAATGATTCACTTGAAATATATACAGAAATTGTAATTTGCTTTTTTTTAAAAAAGGGGGCTAA 

AGTAACACTTTCCTACTTATGTAAATTATAGATCCTAAATTCACGCACCCCGTGGGAGCTCAATAAAGATTTACTGAATT 
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>STLK7_h 

ATGQCGGGACCTGGGGGCTGGAGGGACAGGGAGGTCACGGATCTGGGCCACCTGCCGGATCCAACTGGAATATTCTCACI. 
AGATAAAACCATTGGCCTTGGTACTTATGGCAGAATCTATTTGGGACTTCATGAGAAGACTGGTGCATTTAoI^^^^ 




TTGJUCTCTATGATGCCAGTGCTGATACTGATGGTGATGATGATGATGAGTCTAATGATACTTTT^^ 

catgccaatggcaatgatgactosgataaccaggttsatcaggctaatgatgtttgtaaagaSat^^ 

TAAGTTTGTTGATGATGTAAATAATAATTA-I-rATGAGGCGCCTAGTTGTCCAAGGGCAAGC^^^ 

^^^^^°''^°°'^'''''*^=°'^°''='^^°*aaagag6aagcctacagaggctatggaaSc^^^ 
catggaggaagtgcagccagtgaggacaatgcagccattggagatcaggaagaacatgcagcSatS 

^°^^S^'^f°°^°AT°°A'^GTAAGGOAGTCGTTCGAACCAGTGAAG^^ 
AA^ATTGCTCAGAGACAGA'i^TCCAGGATTGAAGAGACCTGCGTCTCAGGACTTTGAATA 

^^^^f^!"I^^"'=**^=AG*'^TTCTGCCAATCACTCATCTCCTTCCAAAGaTTCTQG 
^*!!?^;^S?=^'^A*='^^^A'=2CTGGATTCGTAGAAGTACCTCAGGAATCACCTA^ 

Af^^^^^^'^^'''^°'^°''''*°*'^^°'='''^°^*^°^'AlTX3ATAAGTTAACA0GCTGTGAAC^^ 
^!°^-^^f^=^TATATTGCAATTGCTTTGAAATCATCAArrCACCTTTATGCATGGScC^^ 

"fI^?^r"^'^'^'^^"°'^'^^^°'^*'^''-°°*'^*'^^A^CTCATCGATGCAGAATCTGAGGTTATGTCTGSOT^ 
^;;^°^^;;=^^^°°^T=ATTATACCACAGAATATCATCATTTTACCTGATTGCTTGGG^^^^ 



>PAK5_h 
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CGGGAGTGTCCGCGGTGGTGGCGGTGCAAGAGAGCTGAAGGAGGCGCGAGGGCGCGGAGTTCCAGGCCQAGCAGTTAGGC 

CGCGAGCGACTGCGGCGCCGAGCCGATGAGTAACCCGAAGCCCCTAGAGGAGTGGTCACCTGCCTGAGGGCACTTCTGTC 

CCACCAGCATCAGACCAGGCCGCACCGAGTCCCCGGCACCATGTTTGGGAAGAGGAAGAAGCGGGTGGAGATCTCCGCGC 

CGTCCAACTTCGAGCACCGCGTGCACACGGGCTTCGACCAGCACGAGCAGAAGTTCACGGGGCTGCCCCGCCAGTGGCAG 

AGCCTGATCGAGGAGTCGGCTCGCCGGCCCAAGCCCCTCGTCGACCCCGCCTGCATCACCTCCATCCAGCCCGGGGCCCC 

CAAGACCATCGTGCGGGGCAGCAAAGGTGCCAAAGATGGGGCCCTCACGCTGCTGCTGGACGAGTTTGAGAACATGTCGG 

TGACACGCTCCAACTCCCTGCGGAGAGACAGCCCGCCGCCGCCCGCCCGTGCCCGCCAGGAAAATGGGATGCCAGAGGAG 

CCGGCCACCACGGCCAGAGGGGGCCCAGGGAAGGCAGGCAGCCGAGGCCGGTTCGCCGGTCACAGCGAGGCGGGTGGCGG 

CAGTGGTGACAGGCGACGGGCGGGGCCAGAGAAGAGGCCCAAGTCTTCCAGGQAGGGCTCAGGGGGTCCCCAGGAGTCCT 

CCCGCGACAAACGCCCCCTCTCCGGGCCTGATQTCGGCACCCCCCAGCCTGCTGQTCTGGCCAGTGGGGCGAAACTGGCA 

GC'TGGCCGGCCCTTTAACACCTACCCGAGGGCTGACACGCACCACCCATCCCGGGGTGCCCAGGGGGAGCCTCATGACGT 

QGCCCCTAACGGGCCATCAGCGGGGGGCCTGGCCATCCCCCAGTCCTCCTCCTCCTCCTCCGGGCCTCCCACCCGAGCCC 

GAGGTGCCCCCAGCCCTGGAGTGCTGGGACCCCACGCCTCAGAGCCCCAGCTGGCCCCTCCAGCCTGCACCCCCGCCGCC 

CCTGCTGTTCCTGGGCCCCCTGQCCCCCGCTCACCACAGCGGGAGCCACAGCOAGTATCCCATGAGCAGTTCCGGGCTGC 

CCTGCAGCTGGTGGTGGACCCAGGCGACCCCCGCTCCTACCTGGACAACTTCATCAAGATTGGCGAQGGCTCCACGGQCA 

TCQTGTGCATCGCCACCGTGCGCAGCTCGGGCAAGCTGGTGOCCQTCAAGAAGATGGACCTGCGCAAGCAGCAGAGGCGC 

GAGCTGCTCTTCAACGAGGTGGTAATCATGAGGGACTACCAGCACGAGAATGTGGTGGAGATGTACAACAGCTACCTGGT 

GGGGGACGAGCTCTGGGTGGTCATGGAGTTCCTGGAAGGAGGCGCCCTCACCGACATCGTCACCCACACCAGGATGAACG 

AGGAGCAGATCGCGGCCGTGTGCCTTGCAQTGCTQCAGGCCCTGTCGOTGCTCCACGCCCAGGGCGTCATCCACCGGGAC 

ATCAAGAGCGACTCGATCCTGCTGACCCATGATGGCAGGGTGAAGCTGTCAGACTTTGGGTTCTGCGCCCAGGTGAGCAA 

GGAAGTGCCCCGAAGGAAGTCGCTGGTCGGCACGCCCTACTGGATGGCCCCAGAGCTCATCTCCCGCCTTCCCTACGGGC 

CAGAGGTAGACATCTGGTCGCTGGGGATAATGGTGATTGAGATGGTGGACGGAGAGCCCCCCfACTTCAACGAGCCACCC 

CTCAAAGCCATGAAGATGATTCGGGACAACCTGCCACCCCGACTGAAGAACCTGCACAAGGTGTCGCCATCCCTGAAGGG 

CTTCCTGGACCGCCTGCTGGTGCGAGACCCTGCCCAGCGGGCCACGGCAGCCGAGCTGCTGAAGCACCCATTCCTGGCCA 

AGGCAGGGCCGCCTGCCAGCATCGTCCCCCTCATGCGCCAGAACCGCACCAGATGAGGCCCAGCGCCCTTCCCCTCAACC 

AAAGAGCCCCCCCGGGTCACCCCCGCCCCACTGAGGCCAGTAGGGGGCCAGGCCTCCCACTCCTCCCAGCCCGGGAGATG 

CTCCGCGTGGCACCACCCTCCTTGCTGGGGGTAGATGAGACCCTACTACTGAACTCCAGTTTTGATCTCGTOACTTTTAG 

iLAAAACACAGGGACTCGTGGGAGCAAGCGAGGCTCCCAGGACCcCCAcCCTcTGGGACAGGCCCTCCcCCATGTTCTTCT 

GTCTCCAGGAAGGGCAGCGGCCCTCCCATCACTGGAAGTCTGCAGTGGGGGTCGCTGGGGGTGGAGAGAACACTAAGAGG 

TGAACATGTATGAGTGTGTGCACGCGTGTGAGTGTGCATGTGTGTGTGTGTGCAAAGGTCCAGCCACCCCGTCCTCCAGC 

CCGCAAGGGGTGTCTGGCGCCTTGCCTGACACCCAGCCCCCTCTCCCCCTGAGCCATTGTGGGGGTCGATCATGAATGTC 

CGAAGAGTGGCCTTTTCCCGTAGCCCTGCGCCCCCTTTCTGTGGCTGGATGGGGAGACAGGTCAGGGCCCCCCACCCTCT 

CCAGCCCCTGCAGCAAATGACTACTGCACCTGGACAGCCTCCTCr-TTTCTAGAAGTCTATTTATATTQTC^ 

ACTCTAGCCCCTGCCCTTATTGGGGGACAGATGGTCCCTGTCCTGCGGGGTGGCCCTGGCAGAACCACTGCCTGAAGAAC 

CAGGTTCCTGCCCGGTCAGCGCAGCCCCAGCCCGCCCACCCCTGCCTCGAGTTAGTTTTACAATTAAAACATTGTCTTGT 

TTTQTG 

>GEK2_h 

CGAAGCCACAGCCCGAGCCCGAGCCCGAGCCCGAGCCGGCGCCACCGCGCCCCCGGCCATGGCTTTTGCCAATTTCCGCC 
GCATCCTGCGCCTGTCTACCTTCGAGAAGAGAAAGTCCCGCGAATATGAGCACGTCCGCCGCGACCTGGACCCCAACQAG 
QTGTQGGAGATCGTGGGCGAGCTGGGCGACGGCGCCTTCGGCAAGGTTTACAAGGCCAAGAATAAGQAGACGGGTGCTTT 
GGCTGCGGCCAAAGTCATTGAAACCAAGAGTGAGGAGGAGCTGGAGGACTACATCGTGGAGATTGAGATCCTGGCCACCT 
GCGACCACCCCTACATTGTGAAGCTCCTGGGAGCCTACTATCACGACGGGAAGCTGTGGATCATGATTGAGTTCTGTCCA 
GGGGGAGCCGTGGACGCCATCATGCTGGAGCTGGACAGAGGCCTCACGQAGCCCCAQATACAGGTGGTTTGCCGCCiiGAT 
GCTAGAAGCCCTCAACTTCCTGCACAGCAAOAGGATCATCCACCQAGATCTGAAAGCTGGCAACGTGCTGATGACCCTCG 
AGGGAOACATCAGQCTGGCTGACTTTGGTGTGTCTGCCAAGAATCTGAAGACTCTACAGAAACGAGATTCCTTCATCGGC 
ACGCCTTACTGGATGGCCCCCGAGGTGGTCATGTGTGAGACCATGAAAGACACGCCCTACGACTACAAAGCCGACATCTG 
GTCCCTGGGCATCACGCTGATTGAGATGGCCCAGATCGAGCCGCCACACCACGAGCTCAACCCCATGCGQQTCCTGCTAA 
AGATCGCCAAGTCGGACCCTCCCACGCTGCTCACGCCCTCCAAGTGGTCTGTAGAGTTCCGTGACTTCCTGAAGATAGCC 
CTGQATAAGAACCCAGAAACCCGACCCAOTGCCGCOCAGCTGCTGGAGCATCCCTTCGTCAGCAGCATCACCAGTAACAA 
GGCTCTGCGGGAGCTGGTGGCTGAGGCCAAGGCCGAGGTGATGGAAGAGATCGAAGACGGCCGGGATGAGGGGGAAGAGG 
AGGACGCCGTGGATGCCGCCTCCACCCTGGAGAACCATACTCAGAACTCCTCTGAGGTGAGTCCGCCAAGCCTCAATGCT 
GACAAGCCTCTCGAGGAGTCACCTTCCACCCCGCTGGCACCCAGCCAGTCTCAGGACAGTGTGAATGAGCCCTGCAGCCA 
GCCCTCTGGGGACAGATCCCTCCAAACCACCAGTCCCCCAGTCGTGGCCCCTGGAAATGAOAACGGCCTGGCAGTGCCTG 
TGCCCCTGCGGAAGTCCCGACCCGTGTCAATGGATGCCAGAATTCAGGTAGCCCAGGAGAAGCAAGTTGCTGAGCAGGGT 
GGGGACCTCAGCCCAGCAGCCAACAGATCTC-^AAAGGCCAGCCAGAGCCGGCCCAACAGCAGCGCCCTGQAOACCTTGGG 
TCCGCAGAAGCTGGCCAATGGCAGCCTGGAGCCACCTGCCCAGGCAQCTCCAGGGCCTTCCAAGAGGGACTCQGACTGCA 
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CATA^ACCrTTTGAACAGGAAATCAACGCCAAGAAGAAGTTCTTTGACACGGAATTAGAGAA^^ 

J^*°^^'^^°^*'=**<^'^^=^A'S'=AAGACCATGCCGTGCGCCGCCGQGAGGAGGCCAGGCG^TC^^ 

°f°^f^^^^^'^*°°^^^'^'^SAGCAGCTCAAACTGATGAAGAAAGAGGTGAAGAACGAGGTCGAC^^ 

GCAGAAGGAGGACCTGGAGCTGGCCATGAAGAGGCTCACCACCGACAACAGGCGGGAGATCTGTGACAAGG^^^ 
GCCTCATGAAGAAGCAGGAGCTCCTTCGAGACCGGGAAGCAGCCCTGT^G^^^^^ 
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Db«LOKl m 
Qy = GEK2lh 



Db 1 t^FANFRRILRLSTFEKRKSREVEHVRJU)lJ3PNDVWEIVGEIiGDGAFGKVYKAKNKETGA 60 

Qy 1 MAFANFRRILJILSTFEKRKSREYEHVRRDLDPNEWEIVGELGDGAFGKVYKAKNICETGA 60 

Db 61 LAAAKVIETKSEEELEDYIVEIEILATCDHPYIVKH/GAYYYDGKLWIMIEFCPGGAVDA 12 0 

Qy 61 LAAAKVIETKSEEELEDYIVElEILATCDHPYIVKLIiGAYYHDGKLWIMIEFCPGGAVDA 12 0 

Db 121 IMLELDRGLTEPQIQ\rvCRQMLEALNFIiHGKRIIHRDLKAGNVLMTLEGDIRI*ADFGVSA 180 

Qy 121 IMI,ELDRGLTEPQIQ\r7CRQMLEALNFLHSKRIIHRDLKAGMVI*MTLEGDIRLADFGVSA 180 

Db 181 KNLKTLQKRDSFIGTPYW^mPEWLCETMKDAPYDYKADIMSIiGITIiIEMAQIEPPHHEL 24 0 

Qy 131 KNLKTLQKRDSFIGTPYW^WPE^VVMCETMKDTPYDYKADIWSLGITLIEMAQIEPPHHEL 24 0 

Db- 241 . NPMRVLLKIAKSDPPTLLTPSKWSVEFRDFLKIAIiDKNPETRPSAAQLLQHPrVSR^/TSN 300 

Qy 241 NPMRVLLKIAKSDPPTLLTPSKWSVEFRDFLKIAIiDKNPETRPSAAQLLEHPFVSSITSN 300 



Db 301 KAIJlELVAEAKAEVMEEIEDGREDGEEEDAVDAVPPLVNHTQDSANVTQPSLDSNia.LQD 36 0 
Qy 301 KALRELVAEAKAEVMEEIEDGRDEGEEEDAVDAASTLENHTQNSSBVSPPSLNADKPLEE 360 



Db 361 S-STPLPPSQPQEPVNGPCSQPSGDGPLQTTSPADGLSKNDNDLKVPVPLRKSRPLSMDA 419 
Qy 361 SPSTPLAPSQSQDSVNEPCSQPSGDRSLQTTSPPWAPGNENGLAVPVPLRKSRPVSMDA 42 0 



♦ * ♦ 



Db 4 20 RIQMDEEKQIPDQDENPSPAASKSQKANQSRPNSSALETIjGGEALTNGGLELPSSVTPSH 479 

Qy 421 RIQVAQEKQVAEQGGDLSPAANRSQKASQSRPNSSALETIiGGEKIiANGSLEPPAQAAPGP 4 80 

Db 4 80 SKRASDCSNLSTSESMDYGTSLSADLSLNKETGSLSLKGSKLHNKTLKRTRRFVVDGVEV 539 

Qy 4 81 SKRDSDCSSLCTSESMDYGTNLSTDLSLNKEKGSLSIKDPKLYKKTL.KRTRKFVVDGVEV 54 0 

Db 54 0 SITTSKIISEDEKKDEEMRFIJIRQELREUU.LQKEEHRNQTQLSSKHELQLEQMHKRFEQ 599 

Qy 541 SITTSKIISEDEKKDEEMRFI*RRQELRELRIJ-QKEEHRNQTQl*SNKHELQLEQMHKRFEQ 600 

Db 600 EINAKKKFYDVELENLERQQKQQVEKMEQDHSVRRKEEAKRIRI^EQDRDYAKFQEQLKQM 65 9 

Qy 501 EINAKKKFFDTELENIjERQQKQQVEKMEQDHAVRRREEARRIRLEQDRDYTRFQEQLKLM 660 

Db 66 0 ICKEVKSEVEKI.PRQQRKESMKQKMEEHSQKKQRLDRDFVAKQKEDLEIAMRKLTTENRRE 719 

Qy 651 KKEVKNEVEKLPRQQRKESMKQKMEEHTQKKQLLDRDFVAKQKEDLELAMKRLTTDNRRE 720 

Db 72 0 ICDKERDCLSKKQELI*RDREAAI*WEMEEHQLQERHQDVKQQLKDQYFLQRHDLLRKHEKE 779 

Qy 721 ICDKERECUiKKQELLRDREAALWEMEEHQLQERHQLVKQQLKDQYFLQRHELLRKHEKE 7 80 

Db 780 REQMQRYNQRMMEQLKVRQCQEKARL.PKIQRSDGETRMAMYKKSLHINGAGSASEQREKI 83 9 

Qy 781 REQMQRYITQRMIEQLKVRQQQEKARLPKIQRSEGKTRMAMYKKSLHINGGGSAAEQREKI 840 

Db 84 0 KQFSQQEEKRQKAERLQQQQKHEHQMRDMVAQCESNMSELOQLQNEKCYLLVEHETQKLK 8 99 

Qy 341 KQFSQQEEKRQKSERLQQQQKHENQMRDMLAQCESNMSELQQLQNEKCHLLVEHETQKLK 900 

Db 900 AXJ3ESHNQSLKEWRDKLRPRKKALEEDLNQKKREQEMFFKLSEEAE-PRPTTPSKASNFF 958 

Qy 901 AIJ}ESHNQNLKEWRDKLRPRKKALEEDLNQKKREQEMFFKLSEEAECPNPSTPSKAAKFF 960 



Db 959 PYSSGDAS 9 66 
Qy 961 PYSSGDAS 968 
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SEQUENCE LISTING 

<110> SUGEN, INC. 

<12C> STE2C-RELATED PROTEIN KINASES 

<130> 240/300-PCT 

<140> TO 3E ASSIGNED 
<141> Herewith 

<150> US 60/081, 784 
<i51> 1993-04-14 

<160> 147 



<170> FastSEQ for Windows Version 3.0 

<210> 1 
<21i> 3268 
<212> DNA 

<213> Marrjnalian (Human) STLK2 
<400> 1 

taacagccca cctccragQc ccgggctacg cgccgccagc ccagtaaccc cacttttgtg 60 

tgtcctccca ggccccgatc gaaaagcctg ggagggccgc cgaactaccc ccggagggag 120 

gagccagtcc gaacccaagg cgccaccgcc gcagaagcgg agcgaggcag cattcgcctc 180 

catggcccac rcgccggtgg ctgcccaagt gcctgggatg cagaataaca tagctgatcc 240 

agaagaacrg ttcacaaaat tagagcgcat tgggaaaggc tcatttgggg aagttttcaa 300 

aggaattgat aaccgtaccc agcaagtcgt rgctattaaa atcatagacc ttgaggaagc 360 

cgaagargaa atagaagaca tucagcaaga aaizaactgtc ttgagtcaat gtgacagctc 420 

aratgraaca r -^atactatg ggtcatattt aaaggggtcr aaattatgga taataatgga 480 

atacctgggc -jrggttcag cactggatct tcttcgagct ggtccatttg atgagttcca 540 

gattgctacc atgctaaagg aaattttaaa aggtctggac ratctgcatt cagaaaagaa 600 

aattcaccga gacataaaag ctgccaatgt ctrgctctca gaacaaggag atgttaaact 660 

tgccgatttt ggagttgctg gtcagctgac agatacacag attaaaagaa atacctttgt 720 

gggaactcca ttttggargg ctccrgaagt tattcaacag tcagcttatg actcaaaagc 780 

tgacatttgg tcattgggaa ttactgctat tgaactagcc aagggagagc cacctaactc 840 

cgaracgcat ccaatgagag ttctgtttct tattcccaaa aacaatcctc caactcttgt 900 

tggagacttt actaagtctt ttaaggagtt tattgatgct tgcctgaaca aagatccatc 960 

atttcgtcct acagcaaaag aactrctgaa acacaaattc artgtaaaaa attcaaagaa 1020 

gacttcttat ctgaccgaac rgatagatcg ttttaagaga tggaaggcag aaggacacag 1080 

tgatgatgaa tctgatcccg agggctctga ttcggaatct accagcaggg aaaacaatac 1140 

ccatcctgaa tggagcttta ccaccgracg aaagaagcct gatccaaaga aagtacagaa 1200 

tggggcagag caagatcttg tgcaaaccct gagt-gtttg tcratgataa tcacacctgc 1260 

atttgctgaa cttaaacagc aggacgagaa taacgctagc aggaatcagg cgatrgaaga 1320 

acEcgagaaa agtattgcrg tggctgaagc cgcctgtccc ggcatcacag ataaaatggt 1380 

gaagaaacta atrgaaaaat ttcaaaagrg rtcagcagac gaatccccct aagaaactta 1440 
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ttattggctt ctgtttcata tggacccaga gagccccacc aaacctacgt "^^^^"^^ ° 
aatac??aac ccatgagctc catgtgcctt t.ggatcttt gcaacactga \lf 
gaagctatta aactartttg tgatggcgtt tatcatttta cattttgaaa ^^^ttatttc 1620 
gtaaggaata acttttaata cratagtttc acctgtattc tagtaaatgt tgagacaccg 1680 
?tt?gctttt aagtatccct atttcttaag ttacgaggat gaataccttt cacattttga 
tctt^ag tg acLtacagt catgaaacat acaggtcttt caaagtcatt ct-atat c 
aactt'Jgta aattarcaag cttcaaaaag ctttttttta aaaaaaaaaa cacgcatat. 
craalaatqa ctattggtgg ggaggtgtaa ataagtcata ccrtcttaaa acagaaaat. 
taaq?aaagJ ctJtraaa^g aaac^tgtaa aagtatrgac tcttctacca agttggtatg 
a?a?'ccagg cagctcaatg attatcacat ttgagaccct: gtgtttgaag catttacagg 
caatg^acag caLagagg. acctcttggt gtatagtart tacattctct -taggtaga 
aaa4caatt tracccttat ttcacatggt tagaaattta aagcaagatc attcacccaa 
aoataqg?;t ttggtaatgt tgaaggagtt agtctggctt cacgttttac atcttcaact 
aaaarcccft ^ctatctgct tggatttgga gagccaaaaa ataaagctga -.tcatgtg 
attaaatatc tgatcaacag gtatgaatat aacttaaatc agcatatt.. ^gccarggta 
a.aaattgtc crataaacta tttatatatt tttgttcttc ataattatca ^^^^^^^^^^ 



ra^ratgcc. gccJaagttc tgaactgggg ctgtatttca ^tagttgtag -t.attgat 
atttaq^tt^ gatagctaat gtttaattgt ttggatctgc acagtttggt ttttgcacaa 
fa^r^a^tt; aaaaaatctg agtaattgtc aaatattaaa agaaagatat tcttcctgta 
;,nLatacaq tttttaqtca aagtggccat tacatcctct ttttaattta cataa.acag 
llllllllll a'ag^gttgt gg^g^tgtat gccaagaaaa --ttttt .ggtgcctat 
attgtaacaa ttatttttaa tgcattgtat ttrgaagtaa cggttcagtt aaatttttca 
cctgcrgtqc aactgaaaca caattacagt ttataatcat ctgtagaagt ^^g^^gataa 
^tttgcaact ca.gttatgg gttaaatgaa tattttcg.a aaagtaaaag caacaaattt 
ataaactqat tatttgaaac tttacaacac aattgcatcc caaatacaaa ttgtattgct 
taJtJattat agctattcgt cctgtaatct gtttctaggt gaagcatact ccagrgtttt 
IggggJtttg aaaataaata tttaaatttc acagtcaaaa aaaaaaaaaa aaaaaaaaaa 



1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 



cag^t g ; l^^^^ ^^Itatttaa gtgagcattt tctagttcat atgaaaa a 2 0 

^catlgtLa-qgatgatttc tgtccacaca aaggttaaat tagattgcac agttaatttt 2520 

^actJatatt tatggtacta ttatgrgggt gargcctttc rcttttaagc ccagtacata 2580 

^^^^^^^ i-rr^^ntnnaa ctatatttca gtagttgtag 



aaaaaaaaaa aaaaaaaaaa aaaaaaaa 



2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3268 



<210>' 2 
<2il> 3030 
<212> DNA 

<213> Mammalian (Human) STLK3 



<400> 2 



gacagcagcg 

ggcggcacag 

cggcagrgga 

agcaataaaa 

aattcaagcc 

ggtcaaagat 

cataaaatac 

agcaacaatt 

tcacagggat 

agatt trggg 

aaaaacattc 

ctargact tc 

agcagcgcct 

tccacccact 

Cl. 1 1 agaaaa 



ccggccccgg 

gctgtcggct 

gctactgctg 

cggatcaact 

atgagtcagt 

gaactttggc 

attgtcaacc 

ctnaaagagg 

ttgaaagctg 

gnaagtgcgt 

gttggcaccc 

aaggctgaca 

tat cacaaat 

ttggaaacag 

ttactttcac 



cagct cccgc 

ggcccatctg 

tggttcaggc 

tggaaaaatg 

gcagccat cc 

tggtcatgaa 

gaggagaaca 

ttttggaagg 

gtaatattct 

tcctagcaac 

catgttggat 

tgtggagt tt 

atcctcccat 

gggtagagga 

tgtgtctt ca 



ggccccggcc 
cagggacgcg 
agccctatgc 
ccagaccagt 
caacgtagtg 
atcactaagt 
caagaatgga 
cttagactat 
tctgggtgag 

agggggtgat 

ggctcctgaa 
tggaataact 
gaaagtgt ta 
taaagaaatg 
gaaagatcct 



ccggccccgg 

tacgagctgc 

aaacccaggc 

atggatgaac 

acctattaca 

ggaggttcaa 

gttctggaag 

ctacacagaa 

gatggttcag 

gttacccgaa 

gtcatggaac 

gccat tgaat 

atgtcgactt 

atgaaaaagt 

r ccaaaaggc 



ccccggcccc 

aggaggttat 

aagaacgtgt 

tattaaaaga 

cctcttttgt 

tgttggatat 

aggcaataat 

acggtcagat 

tacaaatagc 

acaaagtaag 

aggtgagagg 

tagcaacagg 

tgcaaaatga 

acggcaagtc 

ccacagcagc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
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agaacttt ta 

gctgcttaca 

gtcaagtggt 

ggatgagaag 

agaagaaaa t 

CTictgngcac 

ttgtgccgtg 

atttgagr ct 

tggctuggrg 

tgatcccaaa 

gattcctgat 

ccttgatgrc 

tctttctgcc 

tgaagaggcc 

ccgagttctg 

cacgacrgr t 

cccti:aagtg 

aat uttctat 

aattgracat 

aaattaagat 

tttcatacat 

ctitcttgcag 

gtaacttcta 

tttgcctaaa 

ct ttcaca t t 

ttcctctata 

gtaaaatgtt 

ttaagtgata 

ctacttgaaa 

actaaaaggt 

tgcttgactg 

trrcaaggag 

ggcagctctt 

ataartgcct 

atttcattcc 

gaaaanaaaa 



aaatgcaaat 

agaacaccag 

caccttcata 

agcgaagaag 

ccagagattg 

gactctcagg 

aacctcgtt t 

actccaggaa 

gatggtcacg 

gctttaaaaa 

gaagtgaagc 

acGcrgatct 

catttcctcc 

ggcttcaaca 

ctttattctc 

gcccartttt 

aaaagaarct 

ccagaagccr 

ctgacaatac 

ttarragcca 

tratgcatta 

ttaatctcgt 

aattttgaaa 

aaccaattta 

aagaatgtgc 

taaggtggca 

gccattctgt 

ggatttgtta 

ttttgcagaa 

aaaacgcrgt 

tatatt atgg 

grgggctccc 

taatgtactt 

tgtcat aat t 

attctattgt 

gcgtattgca 



tcttccagaa 

acatagccca 

aaaccgaaga 

ggaaagcagc 

cagtgagtgc 

gcccacccaa 

ugagat taag 

gagatacagc 

atgtagttat 

catrgacatt 

tgattgggtt 

gtcatgcccc 

caccccctca 

aaatgggata 

tagcaarcca 

ccaaaagcra 

gaaatacact 

atttctcctt 

tgcctctt tt 

aacttgaat t 

cacaccttag 

cattcaaaca 

tgactgaatt 

caatctgacc 

ctgaggctgc 

tatgttaagt 

tagattgcta 

ttcraaaggt 

tacccaggtg 

taaggttcaa 

aacragtaaa 

aggarggtac 

caaaaagcaa 

gttaaggtgt 

t tacacaacg 

caatgaaaaa 



agccaagaac 
aagagccaaa 

tttttctcag 
cagcaccatc 
tgctaatgaa 
aaactccaga 
agatggtgta 
agtggctgct 
taagttggct 
tgctcagttg 
accgccaccc 
ctcccatttc 
aaaaaataat 
cagtacaaga 
tttaatattc 
caggtggtct 
tcattgttgt 
atgttgtatt 
ctagttttaa 
ctataagaaa 
aaaagttttg 
aggaatttgg 
gtctcttggg 
tttactctgg 
tttgcttcat 
actatatacc 
gataaacttg 
gagtgaaaat 
gaatcaatac 
ccttaggatt 
cattgctct t 
aratatatta 
tctagagcca 
atractcgaa 



agagagtacc 
aaggtaagaa 
gagrggagtg 
gaaaagtcac 
cccgaacaaa 
gacracagag 
aaggaacrta 
tctcaggagc 
aatttacaga 
tctggctgtg 
agtgtcagct 
ctactccctt 
ctagcaaaat 
tttttaaaac 
acaagcaaat 
ttagcaatca 
tatttattgg 
tatttctgtt 
tagaaattaa 
aactgactgt 
aaaagggttt 
ggtctatctt 
atgcttattc 
agagggaggt 
aatagtctca 
tggaccgttt 
catctctgat 
aaaatatcag 
tggaagggtt 
tttcaaccca 
ttgaaaattg 
tcctagctaa 
craaggaaaa 
tttgcataca 
gargactgca 



tgatngagaa 
gagttcctgg 
acgacgagat 
gaagagtaaa 
tacagtccct 
aagcrtcttc 
atgacatacg 
tctrctctgc 
agattgtaga 
atgggtcgga 
gatgtatgtc 
caaccctccc 
cagaagattg 
tr acaacact 
gccacagctg 
atttggatat 
caacaaaagg 
ataatacttt 
tatacttata 
gaattttatt 
tgattatatg 
tggagtattt 
ttttagtctg 
gccrtgcaaa 
gatctaaaat 
agaatgctat 
t tggctctcc 
aatctgagtt 
ttgtgcaatg 
agtagccctc 
gagtctaatc 
cccr agatat 
aaagttattt 
at tt aatgta 
aaggtaaaag 



960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1600 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3030 



<210> 3 
<211> 3857 
<212> DNA 

<213> Mammalian (Human) STLK4 
<400> 3 

caaaagrgga gtcctagatg antctaccat tgctacgata ctccgagaag tactggaagg 60 

gctggaatat ctgcataaaa ntggacagat ccacagagat gtgaaagctg gaaacattct 120 

tnttggagaa gatggctcag tacaganrtc agactttggg gttagtgctt ttttagcaac 180 

tgg-ggtgat attacccgaa acaaagtgag aaagaccttt gttggcaccc crtgttggat 240 

ggcacctgaa gtratggaac aggtccgtgg nratgatttc aaagctgata ttrggagttt 300 

tggaattaca gcaattgaat cggctacagg ggcggctccr tatcataaat atccaccaat 360 

gaaggrttta atgcrgacac tgcagaacga ucctccttct ttggaaactg gtgtrcaaga 420 

taaagaaatg ctgaaaaaat atggaaaatc atrtagaaaa atgatttcat tgtgccttca 480 

aaaagarcca gaaaaaagac caacagcagc agaactatta aggcacaaat trtrccagaa 540 

agcaaagaat aaagaatttc ttcaagaaaa aacattgcag agagcaccaa ccatttcrga 600 
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aagagcaaaa 
tggaggcrgg 
aattccacaa 
aacaactgat 
tcagccagct 
gccagcaaaa 
aatcagtcta 
atttactcct 
ggtcgacgga 
tcagtcaaat 
tgacgatggt 
agaggcggcc 
cactacrgcc 
ttcttcctgc 
cagcat cccc 
gagaagt ggc 
actccaatga 
ttcccccaca- 
ttgattgttc 
gaattct age 
gttcttgttt 
ttcutcattg 
tgactgtagc 
agacaacatc 
aaaatcacaa 
ctcaatcctc 
aaggctgaga 
tgcctgcrct 
atggaaagtg 
ugattttcta 
caatatgtga 
tcTiaactgct 
aaaagataca 
atcactcgag 
ctct tggaac 
ctgagrctt t 
ggaaagagat 
agagtaggtg 
ggagcctgct 
gcttttcttr 
aatgaggtat 
ccttagagtt 
ggggtggcag 
gagtgagcag 
agtggggcag 
tttgccctgc 
ca::gatgtgt 
cagaaactgc 
atgctgattc 
gqacuaacta 
taaacgacta 
ccgataatat 
ccttatgatg 
tggactttgt 
aaaaaaaaaa 



aaggttcgga 
gagtggagtg 
ctcaggtctc 
cctgtgggta 
gggcagattg 
acagct cagg 
gtactaagat 
gggagagata 
agggatiitag 
cgatctgt ca 
aaactgatag 
taaggagatr 
aaagaaccca 
catcattcct 
agagttccgt 
ccatgtgctr 
tggcccaggt 
•cctgccag.ga 
agctggagcc 
aacctttgtt 
tgttcatggt 
ctctctccag 
cgaacttcag 
gagtaatggc 
ttcatatcat 
ccatgaaaat 
tttctacagc 
cctgcactga 
aaaccacatg 
cagaaataat 
ataatcagtg 
agtaacccta 
caaagatggg 
ccctttccag 
atatctgaaa 
agttttagtc 
ggatgttggg 
agatgatcag 
gagccaggag 
ttcttttttt 
agaaaagaga 
ttgttttttt 
catcactctg 
cagcccarcc 
tcatggcaaa 
taggaattag 
ggtagactcc 
ccaggtggaa 
agagtgcacc 
cagcacaaag 
aaggarttgt 
ttaataatcr 
tatgctaggt 
attttactgc 
aaaaaaa 



gagraccagg 
a tgatgaatt 
cccgagrgaa 
ctttgctcca 
ctacacagcc 
ctttgtcttc 
raaggaattc 
cagcagaggg 
taatagtggc 
ctttcaaact 
gatttgccca 
ccacacatgc 
gcaacaaacc 
ccttttCQca 
tagtaaactt 
caaggcccag 
ggaaaagtag 
tatggacacc 
cagagaattt 
aggaaaaagc 
gtttctaagc 
aaagggttgc 
ccatcagatc 
ttgtaaatgt 
tggagaagta 
cagttcgcct 
aataaaggag 
ccctttggag 
tgccgtgacc 
ataaattatt 
aggttgattt 
ccgagtttta 
ctgtgggrcc 
cactgttggt 
accttcccca 
atgggctttc 
gaggaagaga 
acaccggagt 
ggagaaaaga 
gtggaggggg 
tagcatcttc 
tttttcaaac 
t tctagcatt 
cctgttcact 
tagaattggg 
aagacagaca 
ctttgcrggc 
cagcacgtaa 
tctttgacta 
taatatgtgc 
tgggtttrtg 
ggaggagaga 
• tatggaagat 
atgttttctr 



ttccagcggg 
tgatgaagaa 
agaarcaata 
agttccagaa 
aacticaagtc 
aggatcaggt 
caaaaaagaa 
tgtctctcag 
agctaatttg 
ggcatctggt 
gctcagcatc 
gtatctctgt 
tcccggctag 
cagggaaaga 
acttcatatg 
gagggagatc 
cagctgtatc 
ttgggatatc 
aatttaatgt 
acagcctcag 
gttttgctga 
tagcctraac 
cttcaaagtg 
gaattttgcc 
tttattttca 
ggcctccaag 
acacacactg 
ggggtctctg 
tttaggtttt 
ctttaggttt 
ttcttttttc 
tatatgagtg 
ctggaaaggg 
ctgatgaaca 
caaataactt 
ttcacctgct 
ggagatggat 
tcaacgtccc 
agattgacca 
gacggagagg 
tttggcacaa 
ccatcaaaat 
ctttgtggag 
ttctctagcc 
ctggggtttc 
ccatgtccca 
ttgtgcagrg 
cctagtgagt 
ggucccagga 
caatgcratt 
cttaagtttt 
ataatgatgr 
graaaatat t 
catrttraat 



cgtctrcata 
agtgaggaag 
tcaaattctg 
cagacctctg 
tctctcccac 
tcacaagaaa 
ccaaatgata 
gaactcattt 
cagaaaattg 
gtcgaaggct 
agctaaacca 
tgcttctatt 
gagctttaga 
aaagttggat 
tcccctgtct 
tgrcagctca 
gggct tcctc 
tctttaccac 
tttrtctttg 
atggaggcag 
agctgctctc 
ttcagctggt 
gaactttgga 
agaggtggtt 
aatatcaaat 
tcgtgaggaa 
ggccagagag 
tgtgctgaag 
atgagtagac 
aaaaaagagc 
ctaccgtttc 
ggatactcaa 
ggagagttgc 
aggrtgtttt 
gt cacacctt 
ctaggtgcaa 
ttcagttggg 
agcagtcttg 
gcttgctaga 
aacaaggatg 
gactagtggc 
ctacttattt 
atggtctggt 
catcattacc 
tccttctttt 
ggacagtgtt 
atactgagaa 
gactgtacrc 
cccccttgtc 
tgrgaaatgt 
gaaccaaatc 
accaataagt 
caactttfcc 
caa caaagag 



agacagagga 
ggaaagcagc 
agctct t tec 
ctcatctacc 
ccaccgcaga 
ccaagatccc 
ttcgatttga 
ctgctggcct 
tggaagaacc 
cagatattcc 
caaccctgga 
ggcctaaacc 
agtctttatg 
cacragtggc 
tcctccatct 
ttcttgcctt 
atcctgcctg 
tgaagtagaa 
tacctgatgt 
cctaaactgt 
aggcaccccc 
gcaaaacatc 
ttgtttttac 
tttgaacagg 
tgaagaaaaa 
atgggtatgc 
gcctgccttc 
ctaact caag 
agtgttcatt 
actcataatg 
atagtctttg 
tctggcctta 
cctttacaga . 
accttatttt , 
ttgtttcatt . 
aggcatgttg 
agttaggagg 
gtaaaaggag 
aaaatactta 
gggaggtagg 
ttaccgctta 
atgaatccaa 
gcctagctgg 
tgtgaactgc 
cagttcattg 
acttcttctg 
aatacatgaa 
ctttctagga 
cccggagtag 
ttggtctttc 
ctagagccag 
ggagattcct 
ctcct t. ttt c 
taaattgtca 



660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2230 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 . 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3857 
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<21C> 4 
<21i> 1584 
<212> DNA 

<213> Mammaiiar. (Human) STLK5 
<400> 4 



ctcatctgta cacacttcat ggatggcatg aatgagctgg cgattgctta catcctgcag 50 

ggggtgcrga aggccctcga ctacatccac cacargggat atgtacacag gagtgtcaaa 120 

gccagccaca tcctgatctc tgtggatggg aaggtctacc tgtctggttt gcgcagcaac 180 

ctcagcatga taagccatgg gcagcggcag cgagtggtcc acgattttcc caagcacagt 240 

gtcaaggttc tgccgtggct cagccccgag gtcctccagc agaatctcca gggttatgat 300 

gccaagtctg acatctacag tgtgggaatc acagcctgtg aactggccaa cggccatgtc 360 

ccctttaagg atatgcctgc cacccagatg ctgctagaga aactgaacgg cacagtgccc 420 

tgccrgttgg ataccagcac catccccgct gaggagctga ccatgagccc ttcgcgctca 480 

gt-ggccaact orggcctgag tgacagcctg accaccagca ccccccggcc ctccaacggt 540 

gactcgcccc cccaccccta ccaccgaacc ttctcccccc acttccacca ctrrgtggag 600 

cagtgccttc agcgcaaccc ggatgccagg cccagtgcca gcaccctcct gaaccactct 660 

ttcttcaagc agatcaagcg acgtgcctca gaggctttgc ccgaattgct tcgtcctgtc 720 

acccccarca ccaatrttga gggcagccag tctcaggacc acagtggaat ctttggcctg 780 

gtaacaaacc tggaagagct ggaggtggac gattgggagt tctgagcctc tgcaaactgt 840 

gcgcattctc cagccaggga tgcagaggcc acccagaggc ccttcctgag ggccggccac 900 

attcccgccc tcctgggcag attgggtaga aaggacattc ttccaggaaa gttgactgct 960 

gactgattgg gaaagaaaat cctggagaga tacttcactg ctccaaggct tttgagacac 1020 

aagggaatct caacaaccag ggatcaggag ggtccaaagc cgacattccc agtcctgtga 1080 

gctcaggtga cctcctccgc agaagagaga . tgctgctctg gccctgggag ctgaattcca 1140 

agcccagggt ttggctcctt: aaacccgagg accgccacct cttcccagtg cttgcgacca 1200 

gcctcatrct atttaacttt gcrctcagat gcctcagatg ctataggtca gtgaaagggc 1260 

aagtagtaag ctgcctgcct cccttccctc agacctctcc ctcataattc cagagaaggg 1320 

cattrctguc tttttaagca cagactaagg crggaacagt ccatccttat cccrcttctg 1380 

gcttgggccc tgacacctaa gtcrttccca cggtttatgt gtgtgcctca ttcctttccc 1440 

accaagaatc catcttagcg cctcctgcca gctgccctgg tgctttctcc aagggccatc 1500 

agtgtctizgc ctagcttgag ggcttaagtc cttatgctgt gttagtttcg ttgtcagaac 1560 

aaattaaaat tutcagagac gctg 1584 



<210> 5 
<211> 416 
<212> PRT 

<213> Mammalian (Human) STLK2 
<400> 5 

Met Ala His Ser Pro Val Ala Val Gin Val Pro Gly Met Gin Asn Asn 
15 10 15 

lie Ala Asp Pro Glu Glu Leu Phe Thr Lys Leu Glu Arg lie Gly Lys 
20 25 30 



Gly Ser Phe Gly Glu Val Phe Lys Gly lie Asp Asn Arg Thr Gin Gin 
35 40 45 
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Val Val Ala He Lys He He Asp Leu Glu Glu Ala Glu Asp Glu He 



50 



55 60 



Glu Asp He Gin Gin Glu He Thr Val Leu Ser Gin Cys Asp Ser Ser 
65 

Tyr val Thr Lys Tyr Tyr Giy Ser Tyr Leu Lys Giy Ser Lys Leu Trp 
85 90 95 

He He Met Glu Tyr Leu Giy Giy Giy Ser Ala Leu Asp Leu Leu Arg 
100 

Ala Giy pro Phe Asp Glu Phe Gin He Ala Thr Met Leu Lys Glu He 

120 123 



115 



Leu Lys Giy Leu Asp Tyr Leu Hxs Ser Glu Lys Lys He His Arg Asp 



130 



135 



He Lys Ala Ala Asn Val Leu Leu Ser Glu Gin Giy Asp Val Lys Leu 
145 150 155 



Ala ASP Phe Giy Val Ala Giy Gin Leu Thr Asp Thr Gin He Lys Arg 

165 1"^° 

Asn rnr Phe Val Giy Thr Pro Phe Trp Met Ala Pro Glu Val He Gin 
180 185 150 

Gin ser Ala Tyr Asp Ser Lys Ala Asp He Trp Ser Leu Giy He Thr 
195 200 205 

Ala He Glu Leu Ala Lys Giy Glu Pro Pro Asn Ser Asp Met His Pro 
210 215 220 

Met A^g Val Leu Phe Leu He Pro Lys Asn Asn Pro Pro Thr Leu Val 
225 230 235 

Giy ASP Phe Thr Lys Ser Phe Lys Glu Phe He Asp Ala Cys Leu Asn 



245 



250 



Lys ASP Pro Ser Phe Arg Pro Thr Ala Lys Glu Leu Leu Lys His Lys 
260 265 270 

Phe He val Lys Asn Ser Lys Lys Thr Ser Tyr Leu Thr Glu Leu He 
275 280 28. 

ASP Arg Phe Lys Arg Trp Lys Ala Glu Giy His Ser Asp Asp Glu Ser 



290 



295 300 



Aso ser Glu Giy Ser Asp Ser Glu Ser Thr Ser Arg Glu Asn Asn Thr 
305 310 •^■'•^ 

His °ro Glu Trp Ser Phe Thr Thr Val Arg Lys Lys Pro Asp Pro Lys 
325 330 33D 

Lys val Gin Asn Giy Ala Glu Gin Asp Leu Val Gin Thr Leu Ser Cys 



340 



345 
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Leu Ser Met lie lie Thr Pro Ala Phe Ala Glu Leu Lys Gin Gin Asp 
355 360 365 

Glu Asn Asn Ala Ser Arg Asn Gin Ala lie Giu Glu Leu Glu Lys Ser 
370 375 380 

lie Ala Val Ala Glu Ala Ala Cys Pro Gly lie Thr Asp Lys Met Val 
385 -390 395 400 

Lys Lys Leu lie Giu Lys Phe Gin Lys Cys Ser Ala Asp Glu Ser Pro 
405 410 415 



<210> 6 
. <211>- -516 
<212> PRT 

<213> Mammalian (Human) STLK3 
<400> 6 

Thr Ala Ala Pro Ala Pro Ala Ala Pro Ala Ala Pro Ala Pro Ala Pro 
1 5 10 15 



Ala Pro Ala Pro Ala Ala Gin Ala Val Gly Trp Pro lie Cys Arg Asp 
20 25 30 

Ala Tyr Giu Leu Gin Glu Val lie Gly Ser Gly Ala Thr Ala Val Val 
35 40 45 

Gin Aia Ala Leu Cys Lys Pro Arg Gin Giu Arg Val Aia lie Lys Arg 
50 ' 55 60 

lie Asn Leu Giu Lys Cys Gin Thr Ser Met Asp Giu Leu Leu Lys Glu 
65 * 70 75 80 

lie Gin Ala Met Ser Gin Cys Ser His Pro Asn Val Val Thr Tyr Tyr 
85 90 95 

Thr Ser Phe Val Val Lys Asp Giu Leu Trp Leu Val Met Lys Leu Leu 
100 105 lie 

Ser Gly Giv Ser Met Leu Asp lie lie Lys Tyr lie Val Asn Arg Gly 
115 120 125 

Giu His Lys Asn Gly Val Leu Glu Giu Aia lie lie Ala Thr lie Leu 
130 ' 135 140 

Lys Giu Vai Leu Giu Gly Leu Asp Tyr Leu His Arg Asn Gly Gin lie 
145 150 155 160 

His Arg Asd Leu Lys Aia Gly Asn lie Leu Leu Gly Glu Asp Gly Ser 
165 170 175 
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Val Gin lie Ala Asd Phe.Gly Vai Ser Ala Phe Leu Ala Thr Gly Giy 
18C ' 185 190 

ASD Vai Thr Ara Asn Lys Val Arg Lys Thr Phe Val Gly Thr Pro Cys 
195 ' 200 205 

Trp Met Ala Pro Glu Val Met Glu Gin Val Arg Gly Tyr Asp Phe Lys 
210 ■ 215 220 

Ala Asp Met Trp Ser Phe Gly He Thr Ala He Glu Leu Ala Thr Giy 
225 230 235 240 

Ala Ala Pro Tvr His Lys Tyr Pro Pro Met Lys Val Leu Met Leu Thr 
- 245 250 255 

Leu Gin Asn Aso Pro Pro Thr Leu Glu Thr Gly Val Glu Asp Lys Glu 
260 265 270 

Met Met Lys Lvs Tyr Gly Lys Ser Phe Arg Lys Leu Leu Ser Leu Cys 
275 ' 280 285 

Leu Gin Lys Asp Pro Ser Lys Arg Pro Thr Ala Ala Glu Leu Leu Lys 
290 295 300 

Cys Lys Phe Phe Gin Lys Ala Lys Asn Arg Glu Tyr Leu lie Glu Lys 
305 310 315 320 

Leu Leu Thr Arg Thr Pro Asp He Ala Gin Arg Ala Lys Lys Val Arg 
325 330 335 

A-g Val Pro Gly Ser Ser Giy His Leu His Lys Thr Glu Asp Gly Asp 
340 345 350 

Trp Glu Trp Ser Asn Asp Glu Met Asp Glu Lys Ser Glu Glu Giy Lys 
. 355 360 365 

Ala Ala Phe Ser Gin Glu Lys Ser Arg Arg Val Lys Glu Glu Asn Pro 
370 375 380 

Glu He Ala Val Ser Ala Ser Thr He Pro Glu Gin He Gin Ser Leu 
385 390 395 400 

S°r Val His Asp Ser Gin Gly Pro Pro Asn Ala Asn Glu Asp Tyr Arg 
405 410 415 

Glu Ala Ser Ser Cys Ala Vai Asn Leu Val Leu Arg Leu Arg Asn Ser 
420 425 430 

Ara Lys Glu Leu Asn Asp He Arg Phe Glu Phe Thr Pro Giy Arg Asp 
435 440 445 

Thr Ala Asd Gly Val Ser Gin Glu Leu Phe Ser Ala Gly Leu Val Asp 
450 " 455 460 

Gly His Asp Val Vai He Val Ala Ala Asn Leu Gin Lys He Val Asp 



BNSDOCID: <WO 9953036A2_I_> 



wo 99/53036 



PCT/US99/08150 



465 470 475 480 

Asp Pro Lys Aia Leu Lys Thr Leu Thr Phe Lys Leu Ala Ser Giy Cys 
485 490 495 

Asp Giy Ser Giu lie Pro Asp Glu Val Lys Leu lie Giy Phe Aia Gin 
500 505 510 

Leu Ser Vai Ser 
515 



<210> 7 

<211> 414 

<212> PRT 

<213> Mammaiian (Human) STLK4 
<220> 

<223> "Xaa*' stands for any amino acid. 

<400> 7 

Lys Ser Giy Val Leu Asp Xaa Ser Thr lie Aia Thr lie Leu Arg Glu 
15 10 15 

Val Leu Giu Giy Leu Giu Tyr Leu His Lys Xaa Giy Gin lie His Arg 
20 25 30 

A.sp Vai Lys Aia Giy Asn lie Leu Xaa Giy Giu Asp Giy Ser Vai Gin 
35 40 45 

lie Aia Asp Phe Giy Vai Ser Aia Phe Leu Aia Thr Giy Giy Asp lie 
50 55 60 

Thr Arg Asn Lys Vai Arg Lys Thr Phe Val Giy Thr Pro Cys Trp Met 
65 70 75 80 

Ala Pro Giu Vai Met Giu Gin Vai Arg Giy Tyr Asp Phe Lys Aia Asp 
85 90 95 ' 

lie Trp Ser Phe Giy lie Thr Ala lie Giu Leu Ala Thr Giy Ala Aia 
100 105 110 

Pro Tyr His Lys Tyr Pro Pro Met Lys Val Leu Met Leu Thr Leu Gin 
115 ' 120 125 

Asn Asp Pro Pro Ser Leu Giu Thr Giy Val Gin Asp Lys Giu Met Leu 
130 135 140 

Lys Lys Tyr Giy Lys Ser Phe Arg Lys Met lie Ser Leu Cys Leu Gin 
145 150 155 160 

Lys Asp Pro Giu Lys Arg Pro Thr Ala Aia Glu Leu Leu Arg His Lys 
165 170 175 
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Phe Phe Gin Lys Ala Lys Asn Lys Glu Phe Leu Gin Glu Lys Thr Leu 
180 185 190 

Gin Arg Ala Pro Thr lie Ser Glu Arg Ala Lys Lys Val Arg Arg Val 
195 200 205 

Pro Glv Ser Ser Gly Arg Leu His Lys Thr Glu Asp Giy Gly Trp Glu 
210 215 220 

Trp ser Asp Asp Glu Phe Asp Glu Glu Ser Glu Glu Gly Lys Ala Ala 
225 230 235 240 

lie Ser Gin Leu Arg Ser Pro Arg Val Lys Glu Ser He Ser Asn Ser 
245 250 255 

Glu Leu Phe Pro Thr Thr Asp Pro Val Gly Thr Leu Leu Gin Val Pro 
260 265 270 

Glu Gin He Ser Ala His Leu Pro Gin Pro Ala Gly Gin He Ala Thr 
275 280 285 

G'.n Pro Thr Gin Val Ser Leu Pro Pro Thr Ala Glu Pro Ala Lys Thr 
290 295 300 

Ala Gin Ala Leu Ser Ser Gly Ser Gly Ser Gin Glu Thr Lys He Pro 
305 310 315 320 

He Ser Leu Val Leu Arg Leu Arg Asn Ser Lys Lys Glu Leu Asn Asp 
325 330 333 

He A-g Phe Glu Phe Thr Pro Gly Arg Asp Thr Ala Glu Gly Val Ser 
340 345 350 

Gin Glu Leu He Ser Ala Gly Leu Val Asp Gly Arg Asp Leu Val He 
355 360 365 

Val Ala Ala Asn Leu Gin Lys He Val Glu Glu Pro Gin Ser Asn Arg 
370 375 . 380 

Ser Val Th- Phe Lys Leu Ala Ser Gly Val Glu Gly Ser Asp He Pro 
385 * 390 395 400 

Asp Asp Gly Lys Leu He Gly Phe Ala Gin Leu Ser He Ser 
405 410 



<210> 8 
<211> 274 
<212> PRT 

<213> Mammalian (Human) STLK5 
<400> 8 

Leu He rys Thr His Phe Met Asp Gly Met Asn Glu Leu Ala He 
15 10 15 
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Tyr He Leu Gin Giy Val Leu Lys Ala Leu 
20 



Asp Tyr He His His Met 
25 30 



Giy Tyr Val His Arg Ser Val Lys Ala Ser His He Leu Tie Se- Va ^ 

35 40 45 * 

Asp Giy Lys Val Tyr Leu Ser Giy Leu Arg Ser Asn Leu Ser Met He 

55 60 



Ser His Giy Gin Arg Gin Arc Val Val Hi 
65 



s Asp Phe Pro Lvs Tyr Ser 
70 75 



80 



Val Lys Val Leu Pro Trp Leu Se 



p Leu Ser Pro Glu Vai Leu Gin Gin Asn Leu 
85 90 93 



Gin Giy Tyr Asp Ala Lys Ser Asp He Tyr Ser Val Giy He Th- Ala 
.1°° 105 no 

Cys Glu Leu Ala Asn Giy His Val Pro Phe Lys Asp Met Pro Ala Thr 
115 120 125 

Gin Met Leu Leu Glu Lys Leu Asn Giy Thr Val Pro Cys Leu Leu Asp 
130 135 



Thr Ser Thr He Pro Ala Glu Glu Leu Thr Met Ser Pro Ser Arg Ser 

■''^ 155 160 

Val Ala Asn Ser Giy Leu Ser Asp Ser Leu Thr Thr Ser Thr Pro Arq 

165 170 

Pro Ser Asn Giy Asp Ser Pro Ser His Pro Tyr His Arg Thr Phe Ser 

180 185 



190 



Pro His Phe His His Phe Val Glu G^n 
195 



Cys Leu Gin Arq Asn Pro Asp 
200 205 



Ala Arg Pro Ser Ala Ser Thr Leu" Leu Asn His Ser 



210 



215 



220 



Phe Phe Lys Gin 



lie Lys Arg Arg Ala Ser Glu Ala Leu Pro Glu Leu Leu Arg Pro 



230 



235 



Val 
240 



Thr Pro He Thr Asn Phe Glu Gl 



y Ser Gin Ser Gin Asp His Ser Glv 
245 250 



255 



He Phe Giy Leu Vai Thr Asn Leu Glu Glu 



!60 



265 



Leu Glu Val Asp Asp Trp 



no 



Glu Phe 



<210> 9 
<211> 3798 
<212> DNA 
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<213> Mammalian (Human) ZCl 
<400> 9 



gagaccatgg cgaacgactc rcccgcgaaa agtcrggtgg acatcgacct ctcctccctg 6C 

cgggatcctg ctgggatttt tgagcrggtg gaagtggttg gaaatggcac ctatggacaa 120 

gtctataagg gtcgacatgt taaaacgggt cagttggcag ccatcaaagr targgargtc 180 

actgaggatg aagaggaaga aatcaaactg gagataaata tgctaaagaa atactctcat 240 

cacagaaaca ttgcaacata ttatggtgct ttcatcaaaa agagccctcc aggacatgat 300 

gaccaactct ggcttgttat ggagttcrgr ggggctgggt ccattacaga ccttgtgaag 360 

aacaccaaag ggaacacact caaagaagac tggatcgctt acatctccag agaaarcctg 420 

aggggactgg cacatcttca cartcatcat gtgattcacc gggatatcaa gggccagaat 480 

gtgttgctga ctgagaatgc agaggtgaaa cttgttgact ttggtgtgag tgctcagctg 540 

gacaggactg tggggcggag aaatacgttc ataggcactc cctactggat ggctcctgag 600 

gtcatcgcct gtgatgagaa cccagatgcc acctatgatt acagaagtga tctttggtct 660 

tgtggcacca cagccattga gatggcagaa ggtgctcccc ctctctgtga catgcatcca 720 

atgagagcac tgtttctcat tcccagaaac cctcctcccc ggctgaagtc aaaaaaatgg 780 

tcgaagaagt tttttagttt tatagaaggg tgcctggtga agaattacat gcagcggccc 840 

tctacagagc agcttttgaa acatcctttt ataagggatc agccaaatga aaggcaagtt 900 

agaatccagc ttaaggatca tatagarcgt accaggaaga agagaggcga gaaagatgaa 960 

actgagtatg agtacagtgg gagtgaggaa gaagaggagg aagtgcctga acaggaagga 1020 

gagccaagtt ccattgtgaa cgtgcctggt gagtctactc ttcgccgaga ttrcctgaga 1080 

ctgcagcagg agaacaagga acgttccgag gctcttcgga gacaacagtt actacaggag 1140 

caacagctcc gggagcagga agaatataaa aggcaactgc tggcagagag acagaagcgg 1200 

attgagcagc agaaagaaca gaggcgacgg ctagaagagc aacaaaggag agagcgggaa 1260 

gctagaaggc agcaggaacg tgaacagcga aggagagaac aagaagaaaa gaggcgtcta 1320 

gaggagctgg agagaaggcg caaagaagaa gaggagagga gacgggcaga agaagaaaag 1380 

aggagagttg aaagagaaca ggagtatatc aggcgacagc tagaagagga gcagcggcac 1440 

ttggaagrcc ttcagcagca gctgctccag gagcaggcca tgttactgga gtgccgatgg 1500 

cgggagatgg aggagcaccg gcaggcagag aggctccaga ggcagttgca acaagaacaa 1560 

gcatatctcc tgtctctaca gcatgaccat aggaggccgc acccgcagca ctcgcagcag 1620 

ccgccaccac cgcagcagga aaggagcaag ccaagcttcc atgctcccga gcccaaagcc 1680 

caccacgagc ctgctgaccg agcgcgagag gtggaagata gatttaggaa aactaaccac 1740 

agctcccctg aagcccagtc taagcagaca ggcagagtat tggagccacc agtgccttcc 1800 

cgatcagagt ctttttccaa ;:ggcaactcc gagtctgtgc atcccgccct gcagagacca 1860 

gcggagccac aggttcctgt gagaacaaca tctcgctccc ctgttctgtc ccgrcgagat 1920 

tccccactgc agggcagtgg gcagcagaat agccaggcag gacagagaaa ctccaccagt 1980 

attgagccca ggctcctgtg ggagagagtg gagaagctgg cgcccagacc tggcagtggc 2040 

agctcctcag ggrccagcaa ctcaggatcc cagcccgggt ctcaccctgg gtctcagagt 2100 

ggctccgggg aacgcttcag agtgagatca tcatccaagt ctgaaggctc tccatctcag 2160 

. cgcctggaaa atgcagtgaa aaaacctgaa gataaaaagg aagttttcag acccctcaag 2220 

cctgccgatc tgaccgcacr ggccaaagag cttcgagcag tggaagatgt acggccacct 2280 

cacaaagtaa cggactactc ctcatccagt gaggagtcgg ggacgacgga tgaggaggac 2340 

gacgatgtgg agcaggaagg ggctgacgag tccacctcag gaccagagga caccagagca 2400 

gcgtcatctc tgaatttgag caatggtgaa acggaatctg rgaaaaccat gattgtccat 2460 

gatgatgtag aaagrgagcc ggccatgacc ccatccaagg agggcactct aatcgtccgc 2520 

cggactcagt ccgctagtag cacacrccag aaacacaaat crtcctcctc ctttacacct 2580 

tttatagacc ccagattact acagatttct ccatcragcg gaacaacagt gacatctgtg 2640 

gtgggatttt cctgtgatgg gatgagacca gaagccataa ggcaagatcc tacccggaaa 2700 

ggcrcagtgg tcaatgtgaa rccraccaac actaggccac agagrgacac cccggagatt 2760 

cgtaaataca agaagaggtt taactctgag attcrgtgtg ctgccttatg gggagtgaat 2820 

ttgctagtgg gracagagag tggcctgatg ctgctggaca gaagrggcca agggaaggtc 2880 

tarccrctta ccaaccgaag acgatttcaa caaarggacg tacttgaggg CTitgaatgtc 2940 

ttggtgacaa tatccggcaa aaaggataag ttacgtgrcT: acratttgtc ctggttaaga 3000 

aaraaaatac ttcacaarga tccagaagtt gagaagaagc agggatggac aaccgtaggg 3060 

gatutggaag gatgtgtaca tcataaactt graaaatatg aaagaatcaa atttctggrg 3120 
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artgctLtga agagttcrct ggaagtctat gcgtgggcac caaagccaca tcacaaattt 318C 

atggccttta agtcatttgg agaattggta cataagccat tactggtgga tctcactgtt* 3240 

gaggaaggcc agaggttgaa agtgatctat ggatcctgtg ctggattcca tgctgrtgat 3300 

gtggattcag gatcagtcta rgacatttat ctaccaacac atacccagtg tagcatcaaa 3360 

ccccatgcaa rcatcarcct ccccaataca gatggaatgg agcttctggr gtgctatgaa 3420 

gatgaggggg tttatgtaaa cacatatgga aggatcacca aggatgtagt tctacagtgg 3480 

ggagagatgc ctacatcagt agcatatatt cgatccaatc agacaatggg ctggggagag 3540 

aaggccarag agatccgatc tgrggaaact ggtcacttgg atggtgtgtr catgcacaaa 3600 

agggcrcaaa gactaaaatt cttgtgtgaa cgcaatgaca aggrgttctt tgcctctgtt 3660 

cggtcrggtg gcagcagtca ggtttatttc atgaccttag gcaggacttc tcttctgagc 3720 

tggtagaagc agtgtgatcc agggattact ggcctccaga gtcttcaaga tcctgagaac 3780 
ttggaattcc ttgtaact 



3798 



<210> 10 
<211> 4055 
<212> DNA 

<213> Manunalian (Human) ZC2 
<400> 10 

gcttrcgggg aggtcratga gggtcgtcat gtcaaaacgg gccagcttgc agccatcaag 60 
gttatggacg tcacagggga tgaagaggaa gaaatcaaac aagaaattaa catgttgaag 120 
aaatattctc atcaccggaa tattgctaca tactatggtg cttttatcaa aaagaaccca 
ccaggcatgg atgaccaact ttggttggtg atggagtttt gtggtgctga ctctgtcacc 
gacctgacca agaacacaaa aggtaacacg ttgaaagagg agtggattgc atacatctgc 
agggaaatct tacgggggct gagtcacctg caccagcata aagtgattca tcgagatatt 
aaagggcaaa atgtcttgct gactgaaaat gcagaagtta aactagtgga ctttggagtc 
agtgctcagc ttgatcgaac agtgggcagg aggaatactt tcattggaac tccctacrgg 
atggcaccag aagttattgc ctgtgatgaa aacccagatg ccacatatga tttcaagagt 
gacrtgtggt ct'tgggtat caccgccatt gaaatggcag aaggtgctcc ccctctctgt 
gacatgcacc ccatgagagc tctcttcctc atcccccgga atccagcgcc tcggctgaag 
tc-aagaagt ggtcaaaaaa attccagtca tttattgaga gctgc-ctggt aaagaatcac 
agccagcgac cagcaacaga acaattgatg aagcatccat ttatacgaga ccaacctaat 
gagcgacagg tccgcattca acrcaaggac catattgata gaacaaaaaa gaagcgagga 
gaaaaagatg agacagagta tgagtacagt ggaagtgagg aagaagagga ggagaatgac 
tcaggagagc ccagctccat cctgaatctg ccaagggagt cgacgctgcg gagggacttt 
ctgaggctgc agctggccaa caaggagcgt tctgaggccc tacggaggca gcagctggag 1020 
cagcagcagc gggagaatga ggagcacaag cggcagctgc tggccgagcg tcagaagcgc 1080 
atcgaggagc agaaagagca gaggcggcgg ctggaggagc aacaaaggcg agagaaggag 1140 
ctgcggaagc agcaggagag ggagcagcgc cggcactatg aggagcagat gcgccgggag 1200 
gaggagagga ggcgtgcgga gcatgaacag. gaatataagc gcaaacaatt ggaagaacag 1260 
agacaagcag aaagactgca gaggcagcra aagcaagaaa gagactactt agtttccctt 1320 
cagcatcagc ggcaggagca gaggcctgtg gagaagaagc cactgtacca ttacaaagaa 1380 
ggaatgagrc ctagtgagaa gccagcatgg gccaaggagg tagaagaacg gtcaagactc 1440 
aaccggcaaa gttcccctgc catgcctcac aaggttgcca acagaatacc tgaccccaac 1500 
ctgcccccaa ggtcggagtc cttcagcatt agtggagttc agcctgctcg aacacccccc 1560 
atgctcagac cagtcgarcc ccagatccca catctggtag ctgtaaaatc ccagggacct 1620 
gccttgaccg cctcccagtc agrgcacgag cagcccacaa agggcctctc tgggtttcag 1680 
gaggcrcrga acgtgacctc ccaccgcgtg gagatgccac gccagaactc agatcccacc 1740 
tcggaaaatc ctcctctccc cactcgcatt gaaaagtttg accgaagccc ttggttacga 1800 
caggaagaag acatrccacc aaaggrgcct caaagaacaa cttctatatc cccaccatta I860 
gccagaaaga attctcctgg gaatggtagt gctcrgggac ccagactagg atctcaaccc 1920 
atcagaqcaa gcaaccctga rctccggaga actgagccca tcttggagag ccccttgcac 1980 
aggaccagca gtggcagttc ctccagc-cc agcaccccta gctcccagcc cagctcccaa 2040 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
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ggaggctccc 

agraagtcag 

tccagggaca 

ctgacggcat 

aagaaggtga 

gaagatggag 

ataccaacag 

gggctggaga 

atgattagag 

gctggccaca 

actgagggac 

ggcatgggga 

acgtctccca 

ggcgaacttc 

aatgtaaacc 

aaacgatt ca 

actgaaaatg 

aaccggaggc 

tcaggaae ga 

ca taar gacc 

tgtatacat t 

aargctgtgg 

tcttttgcag 

agattaaagg 

aactcttatg 

gtcatcttgc 

tatgtaaaca 

acgtctgtgg 

atccggtcag 

ttaaagtttc 

agtagccaag 

agcacttggc 

actt atgcca 

tcct crccct 



agccrggatc 

aaggatcacc 

ttacccggcc 

ragccaaaga 

ctgatractc 

agagcgagac 

gagctccagg 

cctctcatgc 

agacgtctgg 

rcaacctccc 

tggggcgcgt 

gcagcaccaa 

ctgatgaaga 

ttaggcaaga 

caaccaacat 

actcagaaat 

gcctgatgct 

gatttcagca 

agaataagct 

cagaagtaga 

ataaagttgt 

aaatatargc 

atctccagca 

ttatttttgg 

atatctacac 

ctaaaacaga 

cctatggccg 

cctacattca 

tggaaacagg 

tatgtgaaag 

tgtttttcat 

acttatcttc 

gtctagaggc 

ccgcccct cc 



acaagcagga 
tgtgcrcccc 
cagtcgacca 
actaagagaa 
ctcczcoaqt 
ccargatggg 
cagcaacgag 
ggacagt t tc 
agagaagaag 
tgacctggtg 
ctcaacccat 
agcctccttc 
tgaagaggat 
acaggccaaa 
t cggcczcat 
actttgtgca 
tttggaccga 
gatggatgrg 
acgagtttac 
aaagaaacaa 
raaatatgaa 
ttgggctcct 
caagcctctg 
r tcacacact 
accatctcat 
tggaatggaa 
gataactaag 
ttccaatcag 
acatttggat 
aaatgataag 
gaccctcaac 
atggcgttat 
agaatcagaa 
cagr acagt c 



tccagtgaac 
cacgagcctg 
gctagctaca 
ctccggattg 
gaggagtcag 
acagtggctg 
cagtacaatg 
agcggcagta 
cgatctggcc 
cagcagagcc 
tcccaggaga 
accccctttg 
gaggaatcat 
ctcaatgaag 
agcgacacac 
gctctgtggg 
agrgggcaag 
ctagagggac 
tatctttcac 
ggctggatca 
aggatcaaat 
aaaccgtatc 
ctagttgat c 
ggtttccatg 
attcagggca 
atgcttgttt 
gatgtggtgc 
ataatgggct 
ggagtat tta 
gtactttttg 
agaaattcca 
ttctaattta 
ggcttggttg 
cat ct 



gcaccagagt 
ccaaggtgaa 
aaaaagctat 
aagaaacaaa 
aaagtagcga 
tcagcgacat 
tgggaatggt 
tttcaagaga 
acagtgacag 
attctccagc 
tggactctgg 
tggaccccag 
cagccgcagc 
caagaaagat 
cagaaatcag 
gtgtaaacct 
gcaaagtcta 
tgaatgtcct 
ggttaagaaa 
ctgctgggga 
ttttggtgat 
ataaattcat 
tcacggtaga 
taattgatgt 
atatcactcc 
gctatgagga 
tccaatgggg 
ggggcgagaa 
tgcataagcg 
catccgtgcg 
tgatgaactg 
aaagaacata 
aacatatcgc 



tcgagccaac 
accagaagaa 
agatgaggat 
ccgcccaatg 
ggaagaggag 
acccagactg 
ggggacgcat 
aggaaccttg 
caatggcttt 
tggaaccccg 
gactgaatat 
agtataccag 
tctgtttact 
ttcggtggta 
aaaaracaag 
tctggtgggg 
taatctgatc 
tgtgacaatt 
cagaatacta 
cttggaaggc 
tgccttaaag 
ggcatttaag 
agaaggtcaa 
tgattcagga 
tcargctatt 
tgagggggtg 
agaaatgccc 
agctat tgag 
agctcaaagg 
atctggagga 
gtaacagaag 
actcatgtgg 
tttccctttt 



2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4055 



<210> 11 
<211> 4133 
<212> DNA 

<213> Mammalian (Human) 2C3 
<400> 11 



gcatttgggg 
gtcarggatg 
aagtacrctc 
ccgggaaacg 
gacctggtaa 
agggagarcc 
aaggggcaga 
agtgctcagc 
atggct ccag 
gatatttggt 
gacatgcacc 
tccaagaagt 



aggtgtatga 
tcacggagga 
accaccgcaa 
atgaccagct 
agaacacaaa 
tcaggggtcr 
atgtgctgct 
tggaccgcac 
aggtcatcgc 
ctct aggaat 
ccatgcgagc 
ggtcraagaa 



ggqrcggcat 
cgaggaggaa 
catcgccacc 
ctggctggtg 
aggcaacgcc 
ggcccatctc 
gacagagaac 
cgrgggcaga 
ctgtgatgag 
cacagcca tc 
cot cztcct c 
gtrcattgac 



gtcaagacgg 
gagatcaaac 
tactacggag 
atggagttct 
crgaaggagg 
catgcccaca 
gctgaggtca 
cggaacact t 
aacccrgatg 
gagarggcag 
attcctcgga 
t tcattgaca 



ggcagctggc 
aggagatcaa 
ccttcatcaa 
grggtgctgg 
actgtatcgc 
aggcgatcca 
agctagtgga 
tcattgggac 
ccacctatga 
agggagcccc 
accctccgcc 
catatctcat 



tgccarcaag 
catgctgaaa 
gaagagcccc 
rtcagtgact 
ctatatctgc 
tcgagacatc 
tttrggggtg 
tccctactgg 
ctacaggagt 
ccctctgtgt 
caggctcaag 
caagacttac 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
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ccgagccgcc cacccacgga gcagctactg aagtrtccct tcatccggga ccagcccacg 780 

gagcggcagg ticcgcatcca gctcaaggac cacatrgacc gatcccggaa gaagcggggt 840 

gagaaagagg agacagaara tgagtacagc ggcagcgagg aggaagatga cagccatgga 900 

gaggaaggag agccaagctc catcatgaac gtgcctggag agtcgactct acgccgggag 960 

ttrctccggc tccagcagga aaataagagc aactcagagg cttcaaaaca gcagcagcag 1020 

ctgcagcagc agcagcagcg agaccccgag gcacacatca aacacctgct gcaccagcgg 1080 

cagcggcgca tagaggagca gaaggaggag cggcgccgcg tggaggagca acagcggcgg 1140 

gagcgggagc agcggaagct gcaggagaag gagcagcagc ggcggctgga ggacargcag 1200 

gctctgcggc gggaggagga gcggcggcag gcggagcgcg agcaggaata tattcgtcac 1260 

aggctagagg aggagcagcg acagctcgag atccttcagc aacagctgct ccaggaacag 1320 

gccctgctgc tggaatacaa gcggaagcag ctggaggagc agcggcagtc agaacgtctc 1380 

cagaggcagc tgcagcagga gcatgcctac ctcaagtccc tgcagcagca gcaacagcag 1440 

cagcagcrtc agaaacaaca gcagcagcag ctcctgcctg gggacaggaa gcccctgtac 1500 

cattatggrc ggggcatgaa tcccgctgac aaaccagcct gggcccgaga ggtagaagag 1560 

agaacaagga tgaacaagca gcagaactct cccttggcca agagcaagcc aggcagcacg 1620 

gggccrgagc cccccatccc ccaggcctcc ccagggcccc caggacccct ttcccagact 1680 

cctcctatgc agaggccggt ggagccccag gagggaccgc acaagagcct ggtggcacac 1740 

cgggtcccac tgaagccata tgcagcacct gtaccccgat cccagtccct gcaggaccag 1800 

cccacccgaa acctggctgc ctrcccagcc tcccatgacc ccgaccctgc catccccgca 1860 

cccactgcca cgcccagtgc ccgaggagct gtcatccgcc agaattcaga ccccacctct 1920 

gaaggacctg gccccagccc gaatccccca gcctgggtcc gcccagataa cgaggcccca 1980 

cccaaggtgc cccagaggac ctcatctatc gccactgccc ttaacaccag tggggccgga 2040 

gggtcccggc cagcccaggc agtccgtgcc agacctcgca gcaactccgc ctggcaaatc 2100 

tatctgcaaa ggcgggcaga gcggggcacc ccaaagcctc cagggccccc tgctcagccc 2160 

cctggcccgc ccaacgcctc tagtaacccc gacctcagga ggagcgaccc tggctgggaa 2220 

cgctcggaca gcgtccttcc agcctctcac gggcacctcc cccaggctgg ctcactggag 2280 

cgg'aaccgcg tgggagtctc ctccaaaccg gacagctccc ctgtgctctc ccctgggaat 2340 

aaagccaagc ccgacgacca ccgctcacgg ccaggccggc ccgcagactt tgtgttgctg 2400 

aaagagcgga ctctggacga ggcccctcgg cctcccaaga aggccatgga ctacrcgtcg 2460 

tccagcgagg aggtggaaag cagtgaggac gacgaggagg aaggcgaagg cgggccagca 2520 

gaggggagca gagatacccc tgggggccgc gacggggata cagacagcgt cagcaccatg 2580 

gtggtccacg acgtcgagga gatcaccggg acccagcccc catacggggg cggcaccatg 2640 

gtggtccagc gcacccctga agaggagcgg aacctgctgc atgctgacag caatgggtac 2700 

acaaacctgc ctgacgtggt ccagcccagc cactcaccca ccgagaacag caaaggccaa 2760 

agcccaccct cgaaggatgg gagrggtgac taccagtctc gtgggctggt aaaggcccct 2820 

ggcaagagct cgrtcacgat gtttgtggar cragggatct accagcctgg aggcagtggg 2880 

gacagcatcc ccatcacagc cctagtgggt ggagagggca ctcggctcga ccagctgcag 2940 

cacgacgrga ggaagggttc tgtggtcaac gtgaatccca ccaacacccg ggcccacagt 3000 

gagacccctg agatccggaa gtacaagaag cgattcaact ccgagatcct ctgtgcagcc 3060 

ctttgggggg rcaacctgct ggrgggcacg .gagaacgggc tgatgttgct ggaccgaagt 3120 

gggcagggca aggtgtatgg actcattggg cggcgacgct tccagcagat ggatgtgctg 3180 

gaggqgcrca acctgctcat caccatctca gggaaaagga acaaactgcg ggtgtattac 3240 

ttgtcctggc cccggaacaa gattctgcac aatgacccag aagtggagaa gaagcagggc 3300 

tggaccaccg tgggggacar ggagggctgc gggcactacc gtgttgtgaa atacgagcgg 3360 

attaagttcc tggtcatcgc cctcaagagc tccgtggagg tgtatgcctg ggcccccaaa 3420 

ccctaccaca aattcatggc ctccaagrcc tttgccgacc tcccccaccg ccctctgctg 3480 

gtcgaccrga cagcagagga ggggcagcgg ctcaaggtca tctatggctc cagtgctggc 3540 

ttccargcrg rggatgtcga ctcggggaac agctatgaca tctacatccc tgtgcacatc 3600 

cagagccaga tcacgcccca tgccarcatc ttccrcccca acaccgacgg catggagatg 3660 

ctgctgtcct acgaggacga gggrgtctac gtcaacacgt acgggcgcat cattaaggat 3720 

atggtgccgc agtgggggga gatgcctact tctgtggcct acacctgctc caaccagata 3780 

argggc-cggg gtgagaaagc cattgagatc cgctctgtgg agacgggcca cctcgacggg 3840 

gtcttcatgc acaaacgagc tcagaggctc aagttcctgt gtgagcggaa tgacaaggtg 3900 

ttttttgcct cagtccgctc tgggggcagc agccaagttt acttcatgac tctgaaccgt 3960 

aaccqcatca tgaactggtg acggggcccc gggcrggggc tgtcccacac tggacccagc 4020 

tcrcccccrg cagccaggct tcccgggccg cccctctttc cccrccctgg gctitttgctt 4080 



»^^^^^-/ ^t;:.^- . .... . 



wo 99/53036 PCT/US99/08150 

16 

ttactggttc gatttcactg gagcctgctg ggaacgtgac ctcrgacccc tga 4133 



<210> 12 
<211> 1459 
<212> DNA 

<213> Mammalian (Human) ZC4 
<400> 12 



caatgttaac ccactctarg tctctcctgc atgtaaaaaa ccactaatcc acatgtatga 60 

aaaggagttc acttctgaga tctgctgtgg ttctttgtgg ggagtcaatt tgctgttggg 120 

aacccgatct aatctatatc tgatggacag aagtggaaag gctgacatta ctaaacttat 180 

aaggcgaaga ccattccgcc agattcaagt cttagagcca ctcaatttgc tgattaccat 240 

ctcaggtcat aagaacagac ttcgggtgta tcatctgacc tggttgagga acaagatttt 300 

gaataatgat ccagaaagta aaagaaggca agaagaaatg ctgaagacag aggaagcctg 360 

caaagctatt gataagtraa caggctgtga acacttcagt gtcctccaac atgaagaaac 420 

aacatatatt gcaattgctt tgaaatcatc aattcacctt tangcatggg caccaaagtc 480 

ctttgatgaa agcactgcta ttaaagtatt tccaacactt gatcataagc cagtgacagt 540 

tgacctggct attggttctg aaaaaagact aaagattttc ttcagctcag cagatggata 600 

tcacctcatc gatgcagaat ctgaggttat gtctrgatgtg accctgccaa agaatcccct 660 

ggaaatcatt ataccacaga atatcatcat tttacctgat tgcttgggaa ttggcatgat 720 

gctcaccttc aatgctgaag ccctctctgt ggaagcaaat gaacaactct tcaagaagat 780 

cctcgaaatg tggaaagaca taccatcttc tatagctttt gaatgtacac agcgaaccac 840 

aggatggggc caaaaggcca ttgaagtgcg ctctttgcaa tccagggttc tggaaagtga 900 

gctgaagcgc aggrcaatta agaagctgag attcctgtgc acccggggtg acaagctgtt 960 

.ctttacctct accctgcgca atcaccacag ccgggtttac ttcatgacac ttggaaaact 1020 

tgaagagctc caaagcaatt atgatgtcta aaagtttcca gtgatttatt accacattat 1080 

aaacatcatg tataggcagt ctgcatcttc agatttcaga gattaaatga gtattcagtt 1140 

ttattttrag taaagatcaa atccaaaact ttacttttaa tgtagcacag aatagtttta 1200 

atgagaaatg cagcttratg tataaaatta actatagcaa gctctaggta ctccaatggt 1260 

gtacaatgtc ttttgcacaa actttgtaac ttttgttact gtgaattcaa acattactct 1320 

trggacagtt tggacagtat ctgtattcag attttacaac atggagtaaa gaaacctgtt 1380 

atgaattaga n.cacaagcag .ccttcaaaag aattggcact gggataagar ttttcagaaa 1440 

agaaaaacat: cggcaaact 1459 



<210> 13 
<211> 1239 
<212> PRT 

<213> Mammalian (Human) ZCl 
<400> 13 

Met Ala Asn Asp Ser Pro Ala Lys Ser Leu Val Asp lie Asp Leu Ser 
1 ■ 5- 10 ' 15 

Ser Leu Arg Asp Pro Ala Giy lie Phe Giu Leu Vai Glu Vai Val Giy 
20 25 30 

Asn Giy Thr Tyr Giy Gin Vai Tyr Lys Giy Arg His Vai Lys Thr Giy 



BNSDOCID: <WO 9953036A2J_> 



VVO99/53036 

PCT/US99/08150 



17 



35 



40 



45 

Gin Leu Ala Ala lie Lys Val Met Asn Tn^ ri n 

60 

Glu lie Lys Leu Glu lie Asn Met Leu Lys Lys Ty. Ser His His Arg 

80 

Asn lie Ala Th. Tyr Tyr Gly Ala Phe lie Lys Lys Ser Pro Pro Gly 

90 95 ^ 

His ASP Asp Gin Leu Trp Leu Val Met Glu Phe Cys Gly Ala Gly Ser 

110 

lie Thr ASP Leu Val Lys Asn Thr Lys Gly Asn Thr Leu Lys Glu Aso 

1^0 125 

Trp lie Ala Tyr lie Ser Arg Glu lie Leu Arg Gly Leu Ala His Leu 

140 

His lie His His Val lie Fis Ar-^ aor^ n 

14 5 ^""^ Gly Gin Asn Val Leu 

160 

Leu T... =l„ 

170 3^75 

Gin Leu Asp Arg Thr Val Gly Arg Arg Asn Thr Phe He Gly Thr Pro 

185 

Tyr Trp Met Ala Pro Glu Val lie A^- - rwo a 

195 onn ^ ^^P Ala 

.iuu 205 

Thr Tyr Asp Tyr Arg Ser Asd Leu T-p Ser Cvc, riw ti 

210 ^-^^ Ala He 

^l^ 220 

Clu „.t M. oi„ CI, „, p„ 

235 240 

P., 

255 

Lys Trp ser Lys Lys Phe Phe Ser Phe He Glu Gly Cys Leu Val Lys 

265 270 

Asn Tyr Met Gin Arg Pro Se- Thr nn ri r 

275 ITr, ^^"^ ^^'^ His Pro Phe 

^80 285 

lie Arg Asp Gin Pro Asn Glu Arg Gin Val Arg tie Gin Leu Lys Aso 

300 

His lie ASP Arg Thr Arg Lys Lys Arg Gly Giu Lys Asp Glu Thr Glu 

320 

Tyr Glu Tyr Ser Gly Ser Glu Glu Glu Glu Glu Glu Val Pro Glu Gin 
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Giu Gly Giu Pro 
340 

Arg Arg Asp Phe 
355 

Ala Leu Arg Arg 
370 

Giu Giu Tyr Lys 
385 

Gin Gin Lys Giu 



Arg Giu Ala Arg 
420 

Giu Giu Lys Arg 
435 

Giu Giu Arg Arg 
450 

Gin Giu Tyr lie 
465 

Val Leu Gin Gin 



Arg Trp Arg Giu 
500 



Ser Ser lie Val 



Leu Arg Leu Gin 
360 

Gin Gin Leu Leu 
375 

Arg Gin Leu Leu 
390 

Gin Arg Arg Arg 
405 

Arg Gin Gin Giu 



Arg Leu Giu Giu 
440 

Arg Ala Giu Giu 
455 

Arg Arg Gin Leu 
470 

Gin Leu Leu Gin 
485 

Met Giu Giu His 



Asn Val Pre Gly 
345 

Gin Giu Asn Lys 



Gin Giu Gin Gin 
380 

Ala Giu Arg Gin 
395 

Leu Giu Giu Gin 
410 

Arg Giu Gin Arg 
425 

Leu Giu- Arg Arg 



Giu Lys Arg Arg 
460 

Giu Giu Giu Gin 
475 

Giu Gin Ala Met 
490 

Arg Gin Ala Giu 
505 



Giu Ser Thr Leu 
350 

Giu Arg Ser Giu 
365 

Leu Arg Giu Gin 



Lys Arg lie Giu 
400 

Gin Arg Arg Giu 
415 

Arg Arg Giu Gin 
430 

Arg Lys Giu Giu 
445 

Val Giu Arg Giu 



Arg His Leu Giu 
480 

Leu Leu Giu Cys 
495 

Arg Leu Gin Arg 
510 



Gin Leu Gin Gin Giu Gin 
515 

Arg Arg Pro His Pro Gin 
530 

Giu Arg Ser Lys Pro Ser 

545 550 

Giu Pro Ala Asp Arg Ala 
565 

A.sn His Ser Ser Pro Giu 
580 

Giu Pro Pro Val Pro Ser 
595 

Giu Ser Val His Pro Ala 
610 

Val Arg Thr Thr Ser Arg 



Ala Tyr Leu Leu Ser 
520 

His Ser Gin Gin Pro 
535 

Phe His Ala Pro Giu 
555 

Arg Giu Val Giu Asp 
570 

Ala Gin Ser Lys Gin 
585 

Arg Ser Giu Ser Phe 
600 

Leu Gin Arg Pro Ala 
615 

Ser Pro Val Leu Ser 



Leu Gin His Asp His 
525 

Pro Pro Pro Gin Gin 
540 

Pro Lys Ala His Tyr 
560 

Arg Phe Arg Lys Thr 
575 

Thr Gly Arg Val Leu 
590 

Ser Asn Gly Asn Ser 
605 

Giu Pro Gin Val Pro 
620 

Arg Arg Asp Ser Pro 
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625 



630 



635 



640 



Leu Gin Gly Ser Gly Gin Gin Asn Ser Gin Ala Gly Gin Arg Asn Ser 

650 655 

Thr Ser He Glu Pro Arg Leu Leu Trp Glu Arg Val Glu Lys Leu Val 
660 665 670 

Pro Arg Pro Gly Ser Gly Ser Ser Ser Gly Ser Ser Asn Ser Gly Se- 
6'^^ 680 685 

Gin Pro Gly Ser Kis Pro Gly Ser Gin Ser Gly Ser Gly Glu Arg Phe 
690 695 700 

Arg Val Arg Ser Ser Ser Lys Ser 'Glu Gly 3er Pro Ser Gin Arg Leu 

715 720 

Glu Asn Ala Val Lys Lys Pro Glu Asp Lys Lys Glu Val Phe Arg Pro 
725 730- 735 

Leu Lys Pro Ala Asp Leu Thr Ala Leu Ala Lys Glu Leu Arg Ala Val 
740 745 750 

Glu Asp Val Arg Pro Pro His Lys Val Thr Asp Tyr Ser Ser Ser Ser 
755 760 765 

Glu .Glu Ser Gly Thr Thr Asp Glu Glu Asp Asn Aso Val Glu Gin Glu 
770 775 780 



Gly Ala Asp Glu Ser Thr Ser Gl 



785 



790 



y Pro Glu Asp Thr Arg Ala Ala S 



795 



er 
800 



Ser Leu Asn 



Leu Ser Asn Gly Glu Thr Glu Ser Val Lys Thr Met He 
805 810 815 



Val His Asp Asp Val Glu Ser Glu Pro Ala Met Thr Pro Ser Lys Glu 
820 825 830 

Gly Thr Leu He' Val Arg Arg Thr Gin Ser Ala Ser Ser Thr Leu Gin 

835 840 845 

Lys His Lys Ser Ser Ser Ser Phe Thr Pro Phe He Asp Pro Arq Leu 

855 860 



Leu Gin lie Ser Pro Ser Ser Gly Thr Thr V 



865 



870 



al Thr Ser Val Val Gl^ 



875 



880. 



Phe Ser Cys Asp Gly Met Arg Pro Glu Ala lie Arg Gin Asp Pro Tr 
885 890 895 



nr 



Arg Lys Gly Ser Val Val Asn Val Asn Pro Thr Asn Thr Arg Pro Gin 

905 910 



900 



Ser Asp Thr Pro Giu He Arg Lys Tyr Lys Lys Arg Phe Asn Ser Glu 

920 925 
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lie Leu Cys Ala Ala Leu Trp Gly Val Asn Leu Leu Val Gly Thr Glu 
930 935 940 

Ser Gly Leu Met Leu Leu Asp Arg Ser Gly Gin Gly Lys Val Tyr Pro 
945 950 955 960 

Leu lie Asn Arg Arg Arg Phe Gin Gin Met Asp Val Leu Glu Gly Leu 
965 970 975 

Asn Val Leu Val Thr lie Ser Gly Lys Lys Asp Lys Leu Arg Val Tyr 
980 985 990 

Tyr Leu Ser Trp Leu Arg Asn Lys lie Leu His Asn Asp Pro Glu Val 
995 1000 1005 

Glu Lys Lys Gin Gly Trp Thr Thr Val Gly Asp Leu Glu Gly Cys Val 
iOlO 1015 1020 

His Tyr Lys Val Val Lys Tyr Glu Arg lie Lys Phe Leu Val lie Ala 
1025 1030 1035 1040 

Leu Lys Ser Ser Val Glu Val Tyr Ala Trp Ala Pro Lys Pro Tyr His 
1045 1050 1055 



Lys Phe Met Ala Phe Lys Ser Phe Gly Glu Leu Val His Lys Pro Leu 
1060 1065 1070 

Leu Val Asp Leu Thr Val Glu Glu Gly Gin Arg Leu Lys Val lie Tyr 
1075 1080 1085 

Gly Ser Cys Ala Gly Phe ^is Ala Val Asp Val Asp Ser Gly. Ser Val 
1090 1095 1100 

Tyr Asp lie Tyr Leu Pro Thr His lie Gin Cys Ser lie Lys Pro His 
1105 1110 1115 1120 

Ala lie lie lie Leu Pro Asn Thr Asp Gly Met Glu Leu Leu Val Cys 
1125 1130 1135 

Tyr Glu Asp Glu Gly Val Tyr Val Asn Thr Tyr Gly Arg lie Thr Lys 
1140 1145 ■ 1150 

Asp Val Val Leu Gin Trp Gly Glu Met Pro Thr Ser Val Ala Tyr lie 
1155 1160 1165 

Arg Ser Asn Gin Thr Met Gly Trp Gly Glu Lys Ala lie Glu lie Arg 
1170 1175 1180 

Ser Val Glu Thr Gly His Leu Asp Gly Val Phe Met His Lys Arg Ala 
1185 1190 1195 ' 1200 



Gin Arg Leu Lys Phe Leu Cys Glu Arg Asn Asp Lys Val Phe Phe Ala 
1205 1210 ^ 1215 
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Ser Vai Arg Ser Gly Gly Ser Ser Gin Val Tyr Phe Met Thr Leu dy 
1220 1225 1230 

Arg Thr Ser Leu Leu Ser Trp 
1235 



<210> 14 
<211> 1297 
<212> PRT 

<2i3> Mammalian (Human) 2C2 
<400> 14 

Aia Phe Gly Glu Val Tyr Glu Gly Arg His Val Lys Thr Gly Gin Leu 

10 15 

Ala Ala He Lys Vai Me. Asp Val Thr Gly Asp Glu Glu Glu Glu He 
^° 25 30 



Lys Gin Glu He Asn Met Leu Lys Lys Tyr Ser His His Arg Asn He 
35 40 45 

Ala Thr Tyr Tyr Gly Ala Phe He Lys Lys Asn Pro Pro Gly Met Asp 



60 



Asp Gin Leu Trp Leu Val Met Glu Phe Cys Gly Ala Gly Ser Val Thr 

""^ 75 80 

Asp Leu He Lys Asn Thr Lys Gly Asn Thr Leu Lys Glu Glu Trp He 
85 90 95 

Ala Tyr He Cys Arg Glu He Leu Arg Gly Leu Ser His Leu His Gin 



100 



110 



His Lys val He H.s Arg Asp He Lys Gly Gin Asn Val Leu Leu Thr 



120 



125 



Glu Asn Aia Glu Vai Lys Leu Val Asp Phe Gly Vai Ser Ala Gin Leu 

135 1^0 

Asp Arg Thr Val Gly Arg Arg Asn Thr Phe lie Gly Thr Pro Tyr Trp 



155 



160 



Met Ala Pro Glu Val He Ala Cys Asp Glu Asn Pro Asp Ala Thr Tyr 

I'O 175 

Asp Phe Lys Ser Asp Leu Trp Ser Leu Gly He Thr Ala He Glu Met 
180 185 190 

Ala Glu Gly Aia Pro Pro Leu Cys Asp Met His Pro Met Arg Ala Leu 
195 200 205 

Phe Leu He Pro Arg Asn Pro Ala Pro Arg Leu Lys Ser Lys Lys Trp 
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215 



220 



Ser Lys Lys Phe Gin Ser Phe lie Glu Ser Cys Leu Val Lys Asn His 
225 230 235 240 

Ser Gin Arg Pro Ala Thr Glu Gin Leu Met Lys His Pro Phe lie Arg 
245 250 255 

Asp Gin Pro Asn Glu Arg Gin Val Arg lie Gin Leu Lys Asp His lie 
260 265 270 

Asp Arg Thr Lys Lys Lys Arg Gly Glu Lys Asp Glu Thr Glu Tyr Glu 
275 280 285 

Tyr Ser Gly Ser Glu Glu Glu Glu Glu Glu Asn Asp Ser Gly Glu Pro 
290 295 300' 



Ser Ser lie* Leu Asn Leu Pro Arg Glu Ser Thr Leu Arg Arg Asp Phe 
305 310 315 320 

Leu Arg Leu Gin Leu Ala Asn Lys Glu Arg Ser Glu Ala Leu Arg Arg 
325 330 335 

Gin Gin Leu Glu Gin Gin Gin Arg Glu Asn Glu Glu His Lys Arg Gin 
340 345 350 

Leu Leu Ala Glu Arg Gin Lys Arg lie Glu Glu Gin Lys Glu Gin Arg 
355 360 365 

Arg Arg Leu Glu Glu Gin Gin Arg Arg Glu Lys Glu Leu Arg Lys Gin 
370 375 380 

Gin Glu Arg Glu Gin Arg Arg His Tyr Glu Glu Gin Met Arg Arg Glu 
385 390 395 400 

Glu Glu Arg Arg Arg Ala Glu His Glu Gin Glu Tyr Lys Arg Lys Gin 
405 410 415 

Leu Glu Glu Gin Arg Gin Ala Glu Arg Leu Gin Arg Gin Leu Lys Gin 
420 425 430 

Glu Arg Asp Tyr Leu Val Ser Leu Gin His Gin Arg Gin Giu Gin Arg 
435 440 445 

Pro Val Glu Lys Lys Pro Leu Tyr His Tyr Lys Glu Gly Met Ser Pro 
450 455 460 

Ser Glu Lys Pro Ala Trp Ala Lys Glu Val Glu Glu Arg Ser Arg Leu 
465 470 475 480 

Asn Arg Gin Ser Ser Pro Ala Met Pro His Lys Val Ala Asn Arg lie 
485 490 495 



Ser Asp Pro Asn Leu Pro Pro Arg Ser Glu Ser Phe Ser lie Ser Gly 
500 - 505 510 
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Vai Gin Pro Aia Arg Thr Pro Pro Met Leu Arg Pro Val Asd Pro Gin 

515 520 525 

lie Pro His Leu Val Aia Vai Lys Ser Gin Giv Pro Aia Leu Thr Aia 

530 535 ^ 540 

Ser Gin Ser Vai His Glu Gin Pro Thr Lys Gly Leu Ser Giy Phe Gin 

550 555 560 

Glu Aia Leu Asn Vai Thr Ser His Arg Vai Glu Met Pro Arg Gin Asn 

565 570 575 



Ser Asp Pro Thr Ser Glu Asn Pro Pro Leu Pro Thr Arg lie Giu Lys 
580 585 590 

Phe Asp Arg Ser Ser Trp Leu Arg Gin Giu Giu Asp lie Pro Pro Lys 
595 ■ 600- 605 

Vai Pro Gin Arg Thr Thr Ser lie Ser Pro Aia Leu Aia Arg Lys Asn 
610 615 620 

Ser Pro Giy Asn Giy Ser Aia Leu Giy Pro Arg Leu Giy Ser Gin Pro 
625 630 635 640 

lie Arg Aia Ser Asn Pro Asp Leu Arg Arg Thr Giu Pro lie Leu Giu 
645 650 655 

Ser Pro Leu Gin Arg Thr Ser Ser Giy Ser Ser Ser Ser Ser Ser Thr 
660 665 670 

Pro Ser Ser Gin Pro Ser Ser Gin Gly Gly Ser Gin Pro Giy Ser Gin 
675 680 685 

Aia Gly Ser Ser Giu Arg Thr Arg Vai Arg Aia Asn Ser Lys Ser Giu 
690 695 700 

Giy Ser Pro. Val Leu Pro His Giu Pro Aia Lys Val Lys Pro Giu Giu 
705 710 . 715 720 

Ser Arg Asp lie Thr Arg Pro Ser Arg Pro Aia Ser Tyr Lys Lys Aia 
725 730 735 

lie Asp Glu Asp Leu Thr Aia Leu Ala Lys Glu Leu Arg Glu Leu Arg 
740 745 750 

lie Giu Giu Thr Asn Arg Pro Met Lys Lys Vai Thr Asp Tyr Ser Ser 
755 760 765 

Ser Ser Giu Giu Ser Giu Ser Ser Glu Giu Giu Glu Giu Asp Giy Giu 
770 775 780 

Ser Glu Thr His Asp Gly Thr Vai Aia Vai Ser Asp lie Pro Arg Leu 
785 790 795 800 
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lie Pro Thr Gly Ala Pro Giy Ser Asn Giu Gin Tyr Asn Val Gly Met 

805 810 815 

Val Gly Thr His Gly Leu Glu Thr Ser His Ala Asp Ser Phe Ser Giy 

820 825 830 

Ser lie Ser Arg Glu Gly Thr Leu Met lie Arg Glu Thr Ser Gly Glu 

835 840 845 



Lys Lys Arg Ser Giy His Ser Asp Ser Asn Giy Phe Ala Gly His lie 
850 855 860 

Asn Leu Pro Asp Leu Val Gin Gin Ser His Ser Pro Ala Gly Thr Pro 
865 870 875 880 

Thr Glu Giy Leu Gly Arg Val Ser Thr His Ser Gin Glu Met Asp Ser 
885 890 S95 

Gly Thr Glu Tyr Giy Met Gly Ser Ser Thr Lys Ala Ser Phe Thr Pro 
900 905 910 

Phe Val Asp Pro Arg Val Tyr Gin Thr Ser Pro Thr Asp Glu Asp Glu 
915 920 925 

Glu Asp Glu Giu Ser Ser Ala Ala Ala Leu Phe Thr Gly Giu Leu Leu 
930 935 940 

Arg Gin Giu Gin Ala Lys Leu Asn Giu Ala Arg Lys lie Ser Val Val 
945 950 955 960 

Asn Val Asn Pro Thr Asn lie Arg Pro His Ser Asp Thr Pro Giu lie 
965 970 ^ 975 

Arg Lys Tyr Lys Lys Arg Phe Asn Ser Giu lie Leu Cys Ala Ala Leu 
980 985 990 

Trp Giy Val Asn Leu Leu Val Giy Thr Glu Asn Giy Leu Met Leu Leu 
995 1000 1005 

Asp Arg Ser Gly Gin Gly Lys Val Tyr Asn Leu lie Asn Arg Arg Arg 
1010 1015 1020 

Phe Gin Gin Met Asp Val Leu Giu Giy Leu Asn Val Leu Val Thr lie 
1025 1030 1035 1040 

Ser Gly Lys Lys Asn Lys Leu Arg Val Tyr Tyr Leu Ser Trp Leu Arg 
1045 1050 1055 

Asn Arg lie Leu His Asn Asp Pro Glu Val Giu Lys Lys Gin Gly Trp 
1060 1065 1070 

lie Thr Val Gly Asp Leu Giu Giy Cys lie His Tyr Lys Val Val Lys 
1075 1080 ■ 1085 

Tyr Giu Arg lie Lys Phe Leu Val lie Ala Leu Lvs Asn Ala Val Glu 
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1090 1095 1100 

lie Tyr Ala Trp Ala Pro Lys Pro Tyr His Lys Phe Met Ala Phe Lys 
1105 1110 1115 1120 



Ser Phe Ala Asp Leu Gin His Lys Pro Leu Leu Vai Asp Leu Thr Val 
1125 1130 ' 1135 

Giu Giu Gly Gin Arg Leu Lys Val lie Phe Gly Ser His Thr Gly Phe 
1140 1145 1150 

His Val lie Asp Val Asp Ser Gly Asn Ser Tyr Asp lie Tyr Thr Pro 
1155 * 1160 " 1165 

Ser His lie Gin Gly Asn lie Thr Pro His Ala lie Val lie Leu Pro 
1170 1175 1180 

Lys Thr Asp Gly Met Giu Met Leu Val Cys Tyr Giu Asp Giu Gly Vai 
1185 1190 1195 1200 

Tyr Val Asn Thr Tyr Gly Arg lie Thr Lys Asp Vai Vai Leu Gin Trp 
1205 1210 1215 

Gly Giu Met Pro Thr Ser Vai Ala Tyr lie His Ser Asn Gin lie Met 
1220 1225 1230 



Gly Trp Gly Giu Lys Ala lie Giu lie Arg Ser Vai Giu Thr Gly His 
1235 1240 1245 

Leu Asp Gly Vai Phe Met His Lys Arg Aia Gin* Arg Leu Lys Phe Leu 
1250 1255 1260 



Cys Giu Arg Asn Asp Lys Vai Phe Phe Aia Ser Vai Arg Ser Gly Gly 
1265 1270 1275 1280 

Ser Ser Gin Vai Phe Phe Met Thr Leu Asn Arg Asn Ser Met Met Asn 
1285 1290 1295 

Trp 



<210> 15 
<211> 1326 
<212> PRT 

<213> Mammalian (Human) 2C3 
<400> 15 

Aia Phe Gly Giu Vai Tyr Giu Gly Arg His Vai Lys Thr Giy Gin Leu 
1 5 10 15 

Aia Aia lie Lys Vai Met Asp Vai Thr Giu Asp Giu Giu Giu Giu lie 
20 25 ' 30 
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Lys Gin Giu lie Asn Met Leu Lys Lys Tyr Ser His His Arg Asn He 
35 40 45 

Ala Thr Tyr Tyr Gly Ala Phe He Lys Lys Ser Pro Pro Gly Asn Asp 
50 55 60 

Asp Gin Leu Trp Leu Val Met Glu Phe Cys Gly Ala Gly Ser Val Thr 
65 70 75 80 

Asp Leu Val Lys Asn Thr Lys Gly Asn Ala Leu Lys Glu Asp Cys He 
85 90 95 

Ala Tyr He Cys Arg Glu He Leu Arg Gly Leu Ala His Leu His Ala 
100 105 110 

His Lys Val He His Arg Asp He Lys Gly Gin Asn Val Leu Leu Thr 
115 120 125 

Glu Asn Ala ■ Giu Val Lys Leu Val Asp Phe Gly Val Ser Ala Gin Leu' 
130 135 ^ 140 

Asp Arg Thr Val Gly Arg Arg Asn Thr Phe He Gly Thr Pro Tyr Trp 
145 150 155 160 

Met Ala Pro Giu Val He Ala Cys Asp Glu Asn Pro Asp Ala Thr Tyr 
165 170 175 

Asp Tyr Arg Ser Asp He Trp Ser Leu Gly He Thr Ala He Glu Met 
180 185 190 

Ala Glu Gly Ala Pro Pro Leu Cys Asp Met His Pro Met Arg. Ala Leu 
195 . 200 205 

Phe Leu He Pro Arg Asn Pro Pro Pro Arg Leu Lys Ser Lys Lys Trp 
210 215 220 

Ser Lys Lys Phe He Asp Phe He Asp Thr Cys Leu He Lys Thr Tyr 
225 230 235 240 

Leu Ser Arg Pro Pro Thr Glu Gin Leu Leu Lys Phe Pro Phe He Arg 
245 250 255 

Asp Gin Pro Thr Glu Arg Gin Val Arg He Gin Leu Lys Asp His He 
260 265 270 

Asp Arg Ser Arg Lys Lys Arg Gly Glu Lys Glu Glu Thr Giu Tyr Giu 
275 280 285 

Tyr Ser Gly Ser Glu Glu Glu Asp Asp Ser His Gly Glu Glu Gly Glu 
290 295 300 

Pro Ser Ser He Met Asn Val Pro Gly Giu Ser Thr Leu Arg Arg Giu 
305 310 315 ^320 

Phe Leu Arg Leu Gin Gin Giu Asn Lys Ser Asn Ser Glu Ala Leu Lys 
325 330 335 
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Gin Gin Gin Gin Leu Gin Gin Gin Gin Gin Arg Asp Pro Glu Ala His 
340 345 350 

He Lys His Leu Leu His Gin Arg Gin Arg Arg He Glu Glu Gin Lys 
355 360 365 

Glu Glu Arg Arg Arg Val Glu Glu Gin Gin Arg Arg Glu Arg Glu Gin 
370 375 380 

Arg Lys Leu Gin Glu Lys Glu Gin Gin Arg Arg Leu Glu Asp Met Gin 
385 390 395 400 

Ala Leu Arg Arg Glu Glu Glu Arg Arg Gin Ala Glu Arg Glu Gin Glu 
405 410 415 

Tyr He Arg His Arg Leu Glu Glu Glu Gin Arg Gin Leu Glu He Leu 
420 425 430 

Gin Gin Gin Leu Leu Gin Glu Gin Ala Leu Leu Leu Glu Tyr Lys Arg 
435 440 445 

Lys Gin Leu Glu Glu Gin Arg Gin Ser Glu Arg Leu Gin Arg Gin Leu 
450 455 460 

Gin Gin Glu His Ala Tyr Leu Lys Ser Leu Gin Gin Gin Gin Gin Gin 

470 475 480 

Gin Gin Leu Gin Lys Gin Gin Gin Gin Gin Leu Leu Pro Giy Asp Arg 
485 490 495 

Lys Pro Leu Tyr His Tyr Gly Arg Giy Met Asn Pro Ala Asp Lys Pro 
500 505 510 

Ala Trp Ala Arg Glu Val Glu Glu Arg Thr Arg Met Asn Lys Gin Gin 
515 520 525 

Asn Ser Pro Leu Ala Lys Ser Lys Pro Gly Ser Thr Giy Pro Glu Pro 
530 535 54Q 

Pro He Pro Gin Ala Ser Pro Gly Pro Pro Gly Pro Leu Ser Gin Thr 

550 555 560 

Pro Pro Met Gin Arg Pro Val Glu Pro Gin Glu Giy Pro His Lys Ser 
565 570 575 

Leu Val Ala His Arg Val Pro Leu Lys Pro Tyr Ala Ala Pro Val Pro 
580 585 590 



Arg Ser Gin Ser Leu Gin Asp Gin Pro Thr Arg Asn Leu Ala Ala Phe 
595 600 605 



Pro Ala Ser His Asp Pro Asp Pro Ala He Pro Ala Pro Thr Ala Thr 

615 620 
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Pro Ser Ala Arg Giy Ala Val lie Arg Gin Asn Ser Asp Pro Thr Ser 
625 630 635 640 

Giu Gly Pro Gly Pro Ser Pro Asn Pro Pro Ala Trp Val Arg Pro Asp 
645 650 655 

Asn Glu Ala Pro Pro Lys Val Pro Gin Arg Thr Ser Ser lie Ala Thr 
660 665 670 

Ala Leu Asn Thr Ser Gly Ala Gly Gly Ser Arg Pro Ala Gin Ala Val 
675 680 685 

Arg Ala Arg Pro Arg Ser Asn Ser Ala Trp Gin lie Tyr Leu Gin Arg 
690 695 700 

Arg Ala Glu Arg Gly Thr Pro Lys Pro Pro Gly Pro Pro Ala Gin Pro 
705 710 715 720 

Pro Gly Pro " Pro Asn Ala Ser Ser Asn Pro- Asp Leu Arg Arg Ser Asp 
725 730 735 

Pro Gly Trp Glu Arg Ser Asp Ser Val Leu Pro Ala Ser His Gly His 
740 745 750 

Leu Pro Gin Ala Gly Ser Leu Glu Arg Asn Arg Val Gly Val Ser Ser 
755 760 765 

Lys Pro Asp Ser Ser Pro Val Leu Ser Pro Gly Asn Lys Ala Lys Pro 
770 775 780 

Asp Asp His Arg Ser Arg Pro Gly Arg Pro Ala Asp Phe Val Leu Leu 
785 790 795 800 

Lys Glu Arg Thr Leu Asp Glu Ala Pro Arg Pro Pro Lys Lys Ala Met 
805 810 815 

Asp Tyr Ser Ser Ser Ser Glu- Glu Val Glu Ser Ser Glu Asp Asp Glu 
820 825 830 

Glu Glu Gly Glu Gly Gly Pro Ala Glu Gly Ser Arg Asp Thr Pro Gly 
835 840 845 

Gly Arg Asp Gly Asp Thr Asp Ser Val Ser Thr Met Val Val His Asd 
850 855 860 



Val Glu Glu He Thr Gly Thr Gin Pro Pro Tyr Gly Gly Gly Thr Met 

865 870 875 880 

Val Val Gin Arg Thr Pro Glu Glu Glu Arg Asn Leu Leu His Ala Asp 

"885 890 895 

Ser Asn Gly Tyr Thr Asn Leu Pro Asp Val Val Gin Pre Ser His Ser 

900 905 910 

Pro Thr Glu Asn Ser Lys Gly Gin Ser Pro Pro Ser Lys Asp Gly Ser 
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915 920 925 

Gly Asp Tyr Gin Ser Arg Gly Leu Val Lys Ala Pro Gly Lys Ser Ser 
930 935 940 

Phe Thr Met Phe Val Asp Leu Gly lie Tyr Gin Pro Gly Gly Ser Gly 
945 950 955 960 

Asp Ser lie Pro lie Thr Ala Leu Val Gly Gly Glu Gly Thr Arg Leu 
965 970 975 

Asp Gin Leu Gin Tyr Asp Val Arg Lys Gly Ser Val Val Asn Val Asn 
980 985 990 

Pro Thr Asn Thr Arg Ala His Ser- Glu Thr Pro Glu lie Arg Lys Tyr 
995 1000 1005 

Lys Lys Arg Phe Asn Ser Glu lie Leu Cys Ala Ala Leu Trp Gly Val 
. 1010 1015 1020 

Asn Leu Leu Val Gly Thr Glu Asn Gly Leu Met Leu Leu Asp Arg Ser 
1025 1030 1035 ' 1040 

Gly Gin Gly Lys Val Tyr Gly Leu lie Gly Arg Arg Arg Phe Gin Gin 
1045 1050 1055 

Met , Asp Val Leu Glu Gly Leu Asn Leu Leu lie Thr lie Ser Gly Lys 
1060 1065 1070 

Arg Asn Lys Leu Arg Val Tyr Tyr Leu Ser Trp Leu Arg Asn Lys lie 
1075 1080 1085 

Leu His Asn Asp Pro Glu Val Glu Lys Lys Gin Gly Trp Thr Thr Val 
1090 1095 1100 

Gly Asp Met Glu Gly Cys Gly His Tyr Arg Val Val Lys Tyr Glu Arg 
1105 1110 1115 1120 

lie Lys Phe Leu Val lie Ala Leu Lys Ser Ser Val Glu Val Tyr Ala 
1125 1130 1135 



Trp Ala Pro Lys ir'ro Tyr His Lys Phe Met Ala Phe Lys Ser Phe Ala 

1140 1145 1150 

A.sp Leu Pro His Arg Pro Leu Leu Val Asp Leu Thr Val Glu Glu Gly 

1155 1160 1165 

Gin Arg Leu Lys Val lie Tyr Gly Ser Ser Ala Gly Phe His Ala Val 

1170 1175 1180 

Asp Val Asp Ser Gly Asn Ser Tyr Asp lie Tyr lie Pro Val His lie 

1135 1190 1195 1200 

Gin Ser Gin lie Thr Pro His Ala lie lie Phe Leu Pro Asn Thr Asc 
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1205 1210 1215 

Giy Met Glu Met Leu Leu Cys Tyr Glu Asp Glu Gly Val Tyr Val Asn 
1220 1225 123C 

Thr Tyr Gly Arg lie lie Lys Asp Val Val Leu Gin Trp Gly Glu Met 
1235 1240 1245 

Pro Thr Ser Val Ala Tyr lie Cys Ser Asn Gin lie Met Gly Trp Gly 
1250 1255 1260 

Glu Lys Ala lie Glu lie Arg Ser Val Giu Thr Gly His Leu Asp Gly 
1265 1270 1275 1280 

Val Phe Met His Lys Arg Ala Gin Arg Leu Lys Phe Leu Cys Giu Arg 
1285 1290 1295 

Asn Asp Lys Val Phe Phe Ala Ser Val Arg Ser Gly Giy Ser Ser Gin 
■1300 .1305 1310 

Vai Tyr Phe Met Thr Leu Asn Arg Asn Arg lie Met Asn Trp 
1315 1320 1325 



<210> 16 

<21i> 349 

<212> PRT 

<213> Mammalian (Human) ZC4 

<400> 16 

Asn Vai Asn Pro Leu Tyr Vai Ser Pro Ala Cys Lys Lys Pro Leu lie 
15 10 15 

His Met Tyr Giu Lys Giu Phe Thr Ser Giu lie Cys Cys Giy Ser Leu 

20 25 - 30 • 

Trp Gly Val Asn Leu Leu Leu Gly Thr Arg Ser Asn Leu Tyr Leu Met 
35 40 . 45 

Asp Arg Ser Giy Lys Ala Asp lie Thr Lys Leu lie Arg Arg Arg Pro 
50 55 60 

Phe Arg Gin lie Gin Vai Leu Giu Pro Leu Asn Leu Leu lie Thr lie 
65 70 75 80 

Ser Giy His Lys Asn Arg Leu Arg Vai Tyr His Leu Thr Trp Leu Arg 
85 90 " 95 



Asn Lys lie Leu Asn Asn Asp Pro 
100 

Met Leu Lys Thr Giu Giu Aia Cys 

115 120 



Giu Ser Lys Arg Arg Gin Glu Glu 

105 110 

Lys Aia lie Asp Lys Leu Thr Giy 
125 
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Cys Glu His Phe Ser Val Leu Gin His Glu Glu Thr Thr Tyr lie Ala 
130 135 140 

lie Ala Leu Lys Ser Ser lie His Leu Tyr Ala Trp Ala Pro Lys Ser 
145 150 155 * 160 

Phe Asp Glu Ser Thr Ala lie Lys Val Phe Pro Thr Leu Asp His Lys 
165 170 175 

Pro Val Thr Val Asp Leu Ala lie Gly Ser Glu Lys Arg Leu Lys lie 
180 185 190 

Phe Phe Ser Ser Ala Asp Gly Tyr His Leu lie Asp Ala Glu Ser Glu 
195 200 205 

Val Met Ser Asp Val Thr Leu Pro Lys Asn Pro Leu Glu lie lie lie 
210 215 • 220 

Pro Gin Asn -lie lie lie Leu Pro. Asp Cys Leu Gly lie Gly Met Met 
225 230 235 240 

Leu Thr Phe Asn Ala Glu Ala Leu Ser Val Glu Ala Asn Glu Gin Leu 
245 250 255 

Phe Lys Lys lie Leu Glu Met Trp Lys Asp lie Pro Ser Ser lie Ala 
260 265 270 

Phe Glu Cys Thr Gin Arg Thr Thr Gly Trp Gly Gin Lys Ala lie Glu 
275 280 285 

Val Arg Ser Leu Gin Ser Arg Val Leu Glu Ser Glu Leu Lys Arg Arg 
290 295 300 



Ser lie Lys Lys Leu Arg Phe Leu Cys Thr Arg Gly Asp Lys Leu Phe 
305 310 315 320 

Phe Thr Ser Thr Leu Arg Asn His His Ser Arg Val Tyr Phe Met Thr 
325 330 335 

Leu Gly Lys Leu Glu Glu Leu Gin Ser Asn Tyr Asp Val 
340 345 



<210> 17 
<211> 4023 
<212> DNA 

<213> MaiTL-nalian (Human) KHS2 
<400> 17 • 

ccgccargaa ccccggcttc gatttgrccc gccggaaccc gcaggaggac ttcgagctga 60 
ttcagcgcat cggcagcggc acctacggcg acgtctacaa ggcacggaat gttaacacrg 120 
gtgaarragc agcaatnaaa gtaataaaat tggaaccagg agaagacttt gcagttgtgc 180 
agcaagaaat tartatgatg aaagacrgta aacacccaaa tattgttgct cartttggaa 240 
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gctatctcag gcgagataag ctttggattt gcatggagtt ttgtggaggt ggttctttac 300 

aggatattta tcacgtaact ggacctctgt cagaactgca aattgcatat gtcagcagag 360 

aaacacrgca gggattatat ratcttcaca gtaaaggaaa aatgcacaga gatataaagg 420 

gagctaacat rctattaacg gataacggtc atgtgaaatt ggctgactct ggagtarctg 480 

cacagataac agctacaatt gccaaacgga agtctttcat tggcacacca tatrggatgg 540 

ctccagaagt tgcagctgtt gagaggaagg ggggttacaa tcaactctgt gatctctggg 600 

cagtgggaat cactgccata gaacttgcag agcttcagcc tcctatgttt gactcacacc 660 

caatgagagc attatttcra atgacaaaaa gcaattttca gcctcctaaa ctaaaggata 720 

aaatgaaatg gtcaaatagt tttcatcact ttgtgaaaat ggcacttacc aaaaatccga 780 

aaaaaagacc tactgctgaa aaattattac agcatccttt tgtaacacaa catttgacac 840 

ggtctttggc aatcgagctg ttggataaag taaataatcc agatcattcc acttaccatg 900 

atttcgatga tgacgaccct gagcctcttg ttgctgtacc acatagaatt cactcaacaa 960 

gtagaaacgt gagagaagaa aaaacacgcx: cagagataac ctttggccaa gtgaaatttg 1020 

atccaccctr aagaaaggag acagaaccac atcatgaacr tcccgacagt gatggttttt 1080 

tggacagttc agaagaaata tactacactg caagarctaa tctggatctg caactggaat 1140 

arggacaacg acaccaaggt ggttactttt taggtgcaaa caagagtctt ctcaagtctg 1200 

ttgaagaaga attgcatcag cgaggacacg tcgcacattt agaagatgat gaaggagatg 1260 

atgatgaatc taaacactca actctgaaag caaaaattcc acctcctttg ccaccaaagc 1320 

ctiaagtctat ottcatacca caggaaatgc attctactga ggatgaaaat caaggaacaa 1380 

tcaagagatg tcccatgcca gggagcccag caaagccatc ccaagtrcca cctagaccac 1440 

cacctcccag attaccccca cacaaacctg ttgccttagg aaatggaatg agctccttcc 1500 

agttaaatgg tgaacgagat ggctcattat gtcaacaaca gaatgaacat agaggcacaa 1560 

accttucaag aaaagaaaag aaagatgtac caaagcctat tagtaatggt cttcctccaa 1620 

cacctaaagt gcatatgggr gcatgttttt caaaagtttt taatgggtgt cccttgaaaa 1680 

ttcactgtgc atcatcatgg araaacccag atacaagaga tcagtacttg atatttggtg 1740 

ccgaagaagg gat.ttatacc ctcaatctta atgaacttca tgaaacatca atggaacagc 1800 

tattccctcg aaggtgtaca tggttgtatg taargaacaa ttgcttgcta tcaatatctg 1860 

gtaaagcttc tcagctttat tcccataatt taccagggct ttttgattat gcaagacaaa 1920 

tgcaaaagtt acctgttgct attccagcac acaaactccc tgacagaata ctgccaagga 1980 

aattttctgt atcagcaaaa atccctgaaa ccaaatggtg ccagaagtgt tgtgtrgtaa 2040 

gaaatccrta cacgggccat aaatacctat gtggagcact tcagactagc attgttctat 2100 

tagaatgggt tgaaccaatg cagaaattta tgrtaattaa gcacatagat tttcctatac 2160 

catgtccact tagaacgttt gaaatgctgg tagttcctga acaggagtac cctttagttt 2220 

grgttggtgt cagtagaggt agagacttca accaagtggt tcgatttgag acggtcaatc 2280 

caaattctac ctcttcatgg ttracagaat cagatacccc acagacaaat gttactcatg 2340 

taacccaact ggagagagat accatccttg tatgcttgga ctgttgtata aaaatagtaa 2400 

atctccaagg aagattaaaa tctagcagga aattgtcatc agaactcacc tttgatttcc 2460 

agattgaatc aatagtgtgc ctacaagaca gtgtgctagc tttctggaaa catggaatgc 2520 

aaggtagaag ttttagatcr aatgaggtaa cacaagaaat ttcagatagc acaagaattt 2580 

tcaggctgct tggatctgac agggtcgtgg ttttggaaag taggccaact gataacccca 2640 

cagcaaatag caatttgtac atcctggcgg gtcatgaaaa cagttactga gaattgttgt 2700 

gctttgacag ttaactctag aaagaaagaa cactaccact gcaacattaa tggatgcttg 2760 

aagctgtaca aaagctgcag taacctgtct tcagttactt tgtaatttat tgtggcatga 2820 

gataagatgg ggaaaatttt gttrtaagtg gtacggatat atttagcata ttgaaccaca 2880 

caagtgctta attcactgtt atgtaatctt tgtacatata ggcagtattt tttccgtgaa 2940 

acttcatatt gctgaagaca tacactaaga atttatgtag ataatgtact tttangagat 3000 

gracaagraa gtgtctratc tgtacagatg taaatgttga tgaaaatgca attggggtta 3060 

atattttaag aattctttag tatattcttg ggtgtggcta tattacaaaa tgggacgctg 3120 

gcaatgaaac aaracattta acactattgt atttttatta targtaatcr agtaatacga 3180 

atataaatct tgraactttt aaaattgtaa rggaggctgt aatcatttta taatcttttt 3240 

aattttaatg caagtacacc ggtgtrtata ttTigcacaaa gtattgatat gtgatgtatt 3300 

aagtcacaaa agtaagctgt gacattgtct atiaagcattt ggctccacaa atgtatttgg 3360 

attgttTitct atgcgaagca aaccaattar aanraaccac atgttgtagt aactggtctt 3420 

ttitatattta agcagaatcc tgraagattg crtgtcttcg cttaaaaaca atacctttga 3480 

acatttttga atcacagaat agcggtacca tgatagaata ctgcaattgc ggtcagaatt 3540 

acagtatgca caaagaanta attagcatta ttaaagagtc ctcacraaac atttcatatg 3600 
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atcacacrga 
tttacttttg 
atacacggcc 
tcacaggaac 
atataccagg 
tttaatgtga 
traatgataa 
ttt 



agaactgtaa 
ttggccttat 
aaaataattt 
ttagtrttgg 
ttttagcaaa 
gtatatacac 
tggttttaaa 



catt ccatag 
tttatgarcc 
agttactacc 
tttaagtcat 
atgcacactt 
taagaacaaa 
atatgttcct 



agrgaagtgg 
ttttcatatt 
tcatacaaac 
ttttgattgc 
ttggctcttt 
ctaaattgtg 
gattgtacat 



ttcaaatttc 
tcttttgacr 
aataraatgg 
tttrrtccaa 
ttggtatatg 
atttatgatc 
attgtaaaat 



tcttggaatt 
tagagtatta 
ttactiacaca 
tggaatatgt 
ttctttatat 
ttcatttatt 
aaacatgtt t 



366C 
3720 
3730 
3840 
3900 
3960 
4020 
4023 



<210> 18 
<211> 894 
<212> PRT 

<213> Mammalian (Human) KHS2 
<400> 18 

Met Asn Pro Gly Phe Asp Leu Ser Arg Arg Asn Pro Gin Glu Asp Phe 
^5 10 15 

Glu Leu lie Gin Arg lie Gly Ser Gly Thr Tyr Gly Asp Val Tyr Lvs 
20 25 30 

Aia Arg Asn Vai Asn Thr Gly Glu Leu Ala Ala lie Lys Vai lie Lys 
35 40 45 

Leu Glu Pro Gly Glu Asp Phe Aia Val Vai Gin Gin Glu lie lie Met 
50 55 60 

Met Lys Asp Cys Lys His Pro Asn lie Vai Aia Tyr Phe Gly Ser Tvr 
" 75 80 

Leu Arg Arg Asp Lys Leu Trp lie Cys Met Glu Phe Cvs Gly Gly Gly 
35 90 ^ 95 

Ser Leu Gin Asp lie Tyr His Vai Thr Gly Pro Leu Ser Glu Leu Gin 
100 105 110 

lie Ala Tyr Vai Ser Arg Glu Thr Leu Gin Gly Leu Tyr Tyr Leu His 
115 120 125 

Ser Lys Gly Lys Met His Arg Asp lie Lys Gly Ala Asn lie Leu Leu 

135 140 



Thr Asp Asn Gly His Vai Lys Leu Ala Asp Phe Gly Vai Ser Aia Gin 

160 



145 150 155 



He Thr Aia Thr He Ala Lys Arg Lys Ser Phe He Gly Thr Pro Tyr 
165 170 175 

Trp Met Aia Pro Glu Vai Aia Aia Vai Glu Arg Lys Gly Gly Tyr Asn 
180 185 190 

Gin Leu Cys Asp Leu Trp Aia Val Gly lie Thr Aia lie G^u Le- Aia 
1^5 200 205 
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Giu Leu Gin Pro Pro Met Phe Asp Leu His Pro Met Arg Ala Leu Phe 
210 215 220 

Leu Met Thr Lys Ser Asn Phe Gin Pro Pro Lys Leu Lys Asp Lys Met 
225 230 235 ' 240 

Lys Trp Ser Asn Ser Phe His His Phe Val Lys Met Ala Leu Thr Lys 
245 250 255 

Asn Pro Lys Lys Arg Pro Thr Ala Glu Lys Leu Leu Gin His Pro Phe 
260 265 . 270 

Val Thr Gin His Leu Thr Arg Ser Leu Ala lie Glu Leu Leu Asp Lys 
275 280 285 

Val Asn Asn Pro Asp His Ser Thr Tyr His Asp Phe Asp Asp Asp Asp 
290 295 300 

Pro Glu Pro Leu Val Ala Val Pro His Arg lie His Ser Thr Ser Arg 
305 310 315 320 

Asn Val Arg Glu Glu Lys Thr Arg Ser Glu lie Thr Phe Gly Gin Val 
325 330 335 

Lys Phe Asp Pro Pro Leu Arg Lys Glu Thr Giu Pro His His Glu Leu 
340 345 350 

Pro Asp Ser Asp Gly Phe Leu Asp Ser Ser Glu Glu lie Tyr Tyr Thr 
355 360 365 

Ala Arg Ser Asn Leu Asp Leu Gin Leu Glu Tyr Gly Gin Gly His Gin 
370 375 380 

Gly Gly Tyr Phe Leu Gly Ala Asn Lys Ser Leu Leu Lys Ser Val Glu 
385 390 395 400 

Glu Glu Leu His Gin Arg Gly His Val Ala His Leu Glu Asp Asp Glu 
405 410 415 

Gly Asp Asp Asp Glu Ser Lys His Ser Thr Leu Lys Ala Lys lie Pro 
420 425 ' 430 

Pro Pro Leu Pro Pro Lys Pro Lys Ser lie Phe lie Pro Gin Giu Met 
435 440 445 

His Ser Thr Glu Asp Glu Asn Gin Gly Thr lie Lys Arg Cys Pro Met 
450 455 460 

Ser Gly Ser Pro Ala Lys Pro Ser Gin Val Pro Pro Arg Pro Pro Pro 
465 470 475 480 

Pro Arg Leu Pro Pro His Lys Pro Val Ala Leu Gly Asn Gly Met Ser 
485 490 495 



Ser Phe Gin Leu Asn Gly Glu Arg Asp Giy Ser Leu Cys Gin Gin Gin 
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500 505 



510 



Asn Glu His Arg Gly Thr Asn Leu Ser Arg Lys Glu Lys Lys Aso Val 

525 



515 520 



Pro Lys Pro He Ser Asn Gly Leu Pro Pro Thr Pro Lys Val His Met 

535 540 

Gly Ala Cys Phe Ser Lys Val Phe Asn Gly Cys Pro Leu Lvs He His 

555 560 

Cys Ala Ser Ser Trp He Asn Pro Asp Thr Arg Asp Gin Tyr Leu He 
565 570 

Phe Gly Ala Glu Glu Gly He Tyr Thr Leu Asn Leu Asn Glu Leu His 
580 585 

Glu Thr Ser. Met GJu.Gln Leu Phe Pro Arg Arg Cys Thr Trp Leu Tyr 
5S5 600 605 

val Met Asn Asn Cys Leu Leu Ser He Ser Gly Lys Ala Ser Gin Leu 
610 615 620 

Tyr Ser His Asn Leu Pro Gly Leu 



625 630 635 



Ly Leu Phe Asp Tyr Ala Arg Gin Met Gin 

640 

Lys Leu Pro Val Ala He Pro Ala His Lys Leu Pro Asd Arg He Leu 
645 650 " 655. 



Pro Arg Lys Phe Ser Val Ser Ala Lys He Pro Glu Thr Lvs Trp Cys 
660 665 670 

Gin Lys Cys Cys Val Val Arg Asn Pro Tyr Thr Gly His Lys Tyr Leu 
675 680 685 

Cys Gly Ala Leu Gin Thr Ser He Val Leu Leu Glu Trp Val Glu 



690 695 



Pro 
700 



Met Gin Lys Phe Met Leu He Lys His He Asp Phe Pro He Pro Cys 
°' 710 715 720 

Pro Leu Arg Met Phe Glu Met Leu Val Val Pro Glu Gin Glu Tyr Pro 

735 



725 . 730 



Leu Val Cys Val Gly Val Ser Arg Gly Arg Asp Phe Asn Gin Vai Val 
740 745 



n Ser Thr Ser Ser Trp Phe Thr Glu 

765 



Arg Phe Glu Thr Val Asn Pro Asr 
755 76C 

Ser Asp Thr Pro Gin Thr Asn Val Thr His Val Thr Gin Leu Glu Arg 

775 780 

Asp Thr He Leu Val Cys Leu Asp Cys Cys He Lys He Vai Asn Leu 

^5"" 795 80C 



BNSDOCID; <WO 9953036A2_L> 



wo 99/53036 



36 



PCT/US99/08150 



Gin Gly Arg Leu Lys Ser Ser Arg Lys Leu Ser Ser Glu Leu Thr Phe 
805 810 , 815 

Asp Phe Gin lie Glu Ser lie Val Cys Leu Gin Asp Ser Val Leu Ala 
820 825 830 

Phe Trp Lys His Gly Met Gin Gly Arg Ser Phe Arg Ser Asn Glu Val 
835 840 845 

Thr Gin Glu lie Ser Asp Ser Thr Arg lie Phe Arg Leu Leu Gly Ser 
850 855 860 

Asp Arg Val Val Val Leu Glu Ser- Arg Pro Thr Asp Asn Pro Thr Ala 
865 870 875 ' 880 

Asn Ser Asn Leu Tyr lie Leu Ala Gly His Glu Asn Ser Tyr 



<210> 19 
<211> 4196 
<212> DNA' 

<213> Mammalian (Human) SULUl 
<400> 19 

gggagggtcc ttgtggcgcc gggcggcggg gtccrgcgtg gagagtggga cgcaacgccg 60 

agaccgcgag cagaggctgc gcacagccgg atccggcact cagcgaccgg acccaaggat 120 

ccgccgggga acaagccaca ggagagcgac tcaggaacaa gtgtgggaga ggaagcggcg 180 

gcggcggcgc cgggcccggg ggtggtgaca gcaggtctga ggttgcatca taaatacaaa 240 

ggacrgaagt tataaaagag aaaagagaag tttgctgcta aaatgaatcr gagcaatatg 300 

gaatattttg tgccacacac aaaaaggrac tgaagattta ccccccaaaa aaaattgtca 360 

atgagaaata aagctaactg atatcaaaaa gcagagcctg ctctactggc catcatgcgt 420 

aaaggggtgc tgaaggaccc agagattgac gatctattct acaaagatga tcctgaggaa 480 

ctttttattg gtttgcatga aattggacat ggaagttttg gagcagttta ttttgctaca 540 

aatgctcaca ccaangaggt ggtggcaatt 'aagaagatgt cctatagtgg gaagcagacc 600 

catgagaaat ggcaagatat tcttaaggaa gttaaatttt tacgacaatt gaagcatcct 660 

aatactattg agtacaaagg ctgttacttg aaagaacaca ctgcttggtt ggtgatggaa 720 

tattgctrag gctcagcctc tgatttatta gaagttcata aaaaaccact tcaggaagtg 780 

gagatcgctg ccattactca tggagccttg catggactag cctacctaca ttctcatgca 840 

ttgattcata gggatattaa agcaggaaat attcttctaa cagagccagg tcaggraaaa 900 

ctagctgart ttggatctgc ttcaatggct tctcctgcca actccttcgt gggcacacct 960 

tactggatgg ctccagaggt gatcttagct atggatgaag gacagtatga tgggaaagtt 1020 

gacattcggt cacrtggcat cacttgtatt gaattggcgg aacggaagcc gccccttttc 1080 

aacatgaatg caargagtgc cttatatcac attgcccaga atgactcccc aacgttacag 1140 

tctaatgaat ggacagactc ctttaggaga tttgrtgatt actgcttgca gaaaatacct 1200 

caggaaaggc caacatcagc agaactatta aggcatgact trgttcgacg agaccggcca 1260 

ctacgtgtcc tcattgacct caracagagg acaaaagatg cagttcgtga gctagataac 1320 

ctacagtacc gaaaaatgaa aaaaatactt ctccaagaga cacggaatgg acccttgaat 1380 

gagtca.cagg aggatgagga agacagtgaa cacggaacca gccLgaacag ggaaatggac 1440 

agcctgggca gcaaccattc cattccaagc atgtccgtga gcacaggcag ccagagcagc 1500 

agtgtgaaca gcatgcagga agrcatggac gagagcagtt ccgaacttgt catgatgcac 1560 . 

gatgacgaaa gcacaatcaa ttccagctcc tccgtcgrgc anaagaaaga tcatgtattc 1620 
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Jaaaaccaoa " cggcgatccc aggcctgagc cgcggcctac ccaa.cagtt 

"aa^t^;?^ ccctccacta ccggaacaga gagcgcttrg ccacgatcaa atcagca^ct 
aa?tI?L" tgagcatgag caggagaacg agttgcggga acagatgj^a 

ggttataagc ggatgcggcg ccagcaccag aagcagctga tcgccctgga gaacaagcta 
aaggctgaga tggacgagca ccgcctcaag ctacagaagg aggtggagL gcatgccaac 
^caaaaa^'a; ^.^^^^^^^^^ gaagctggcc aagaagcaag tggcStcat agaL'aggao 
gcaaagg.ag ctgcagcaga tgagaagaag ttccagcaac agatcttggc ccaacaaLo 
T.llT"^' caactttctr agaaagtcag aagaagcagt acaagatttg ta^gga^^^^ 
araaaagagg aaatgaatga ggaccatagc acacccaaga aagagaagca agagcgga^^ 
caaca":"^ llllir'' gcagcacaca caggctgaag aggaagccca ccttctcact 2220 
caacagaga. tgtactacga caaaaattgt cgtttcttca agcggaaaat aatgatcaag 2280 

2340 
2400 
2460 
2520 
2580 
2640 



1680 
1740 
1800 
1860 
1920 
1930 
2040 
2100 
2160 



cggcacgagg tggagcagca gaacattcgg g^ggaac.aa ataaariiaa gZcc::":' 
gagatggagc atgccatgct aatccggcac gacgagtcca cccgagagcJ ag^gtaJ^'^ 
ctaaaf ' ^^^^^^^^ gctacgcatg gatctgatcc gttLcagca ccagac^gS 
Taaat't^f agctggagta caaraagagg cgagaaagag aactgcacag aaagca^gJc 
ttlTalllt ^J'^"""^^^ aaaaaactta aaggccatgg aaatgcaaac taaaaaacag 
allar-Tt ^^"^"^^^^ acagaccaaa cagtataaag cactcaagaa tcaccagttg 
gaagt.actc caaagaatga gcacaaaaca atcttaaaga cactaaaaga taagcaaaca 27nr 
agaaaacttg ccattttggc agagcagtat qaacaaaar. r...;. = Z?!!!!?!" ?Z°^' 



agagcagtat gaacagagta taaatgaaat gatggcctct 2760 
coagcgttac ggctagatga ggctcaagaa gcagaatgcc aggccttgag gctacagctc lltn 
cagcaggaaa tggagctgcr caacgcctac cagagcaaaa tcaagatgc! a^c^gaggca 
caacatgaac gtgagctcca gaagctagag cagagagtgt ctctacgcag agcacacc" 
gagcagaaga ttgaagagga gctggctgcc cttcagaagg aacgiagcga gIgJaJaaao 
aacctattgg aaaggcaaga gcgagagatt gaaacttttg aca^ggagag cctcaga^a 
tlVtl-lltt attagatttt cctaaggagg actaSgatg T.lllllll 

ttttg.catt tacaaaaaaa aaaaaaaaaa agaaaacaga aaaaaattca gaccctgcaa 
aaccacattc cccattttaa cgggcgttgc tctcactctc tctctctctt actcttacta 
acatcgtgtc ggactagtgc ctgtttattc ttactccatc aggggccccc ^tcct^cccc 
alT.T'^l ttcagtgctg gccaaaacct ggccgtctct tcta^tcaca g^^ca^gtca 
^a^aaaccaa ^^^"""^^ atgtttcagt gaaaactttg gagacagttt Lacaaalcc 
aatc^ataaa f!""^^^^ agtggatgta tattgcttta agcaatcact cattaccacc 
aatctgtgaa agtaaagcaa aaaataataa taataaatgc caagggggag agagacacaa 
g^^tt'at-: cttacacctt aactagctgc tgcattattt tat???attt tatt'ttSg 
gtatttat.c atcaggaata aaaaaaacaa agttttatta aagattgaaa atttgataca 
TolllToT^: -taattgtg atgtacatat cagtggtgac atattattac ttttttgggg 
aact^^r^:^ ^^tggggtga agagatcttg tgatttttaa gaacctgctg gcaagag^Jt 
gaat:c-c.^ ""^""^^^^-^ ^gattgtatc ataatcattt tctgctgttg cagagga^^t 
?attt":"a' "f^^^^'^^^^ cagaatccca gtagcacaaa ttgggctttg gcaaatcgtg 
Itlll:^:^^ atagaaggaa tttaaggaga ggtattactt attttcatat tgtattttaa 
ctgtttctcr gatcaaattt ttttacttcc tcctcctgtt cctccccacc tccJ^ccttt 
ttatc' tatttggagt tcaacactgt ctctcaatca gatcatcttg atctttttct 

ttatctccct tccccttcct aagtcccatt tcttggtcat aaatattgca ttattcacac 
tttcaaactg tgtattttct tacaataaaa aatgatgaaa aaaaaaaaaa aaaaL 



2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4196 



<210> 20 
<211> 3824 
<212> DNA 

<213> Mammalian (Human) SULU3 
<400> 20 

tarcgaatrg gcggaacgga agcctccttt atttaatatg aatgcaatga g^gccttata fin 
caaaatgaat ccccracact acagtctaac gaatggtc^g LtJt^t^cg .tn 
c^actttg.a gattcttgcc tccagaaaat ccc.caaga. cgaccLca? cagagg^^c^ 
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ttraaagcac atatttgttc ctcgggagcg ccctgaaacc gtgctaatiag atctcattca 240 

gaggacaaag gatgcagraa gagagctgga caatctgcag tatcgaaaga tgaagaaact 300 

ccttttccag gaggcacata atggaccagc agtagaagca caggaagaag aagaggaaca 360 

agatcatggt gttggccgga caggaacagt taatagtgtt ggaagtaatc aatccattcc 420 

cagcatgtcc atcagtgcca gcagccaaag cagtagtgtt aacagtcttc cagatgcctc 480 

agatgacaag agtgagctag acatgatgga gggagaccac acagtgatgt ctaacagttc 540 

tgttatccat ttaaaaccag aggaagaaaa tracagagaa gagggagatc ctagaacaag 600 

agcatcagat ccacaatctc caccccaagt atctcgtcac aaatcacact atcgraatcg 660 

agaacacttt gcractatac ggacagcatc actggttacg aggcaaatgc aagaacatga 720 

gcaggactct gagcrtagag aacaaatgtc tggctataag cgaatgaggc gacaacatca 780 

aaagcaactg atgactctgg aaaacaagct aaaggctgag atggatgaac atcgcctcag 840 

attagacaaa gatcttgaaa ctcagcgtaa caattttgct gcagaaatgg agaaacttat 900 

caagaaacac caggctgcca tggagaaaga ggctaaagtg atgtccaatg aagagaaaaa 960 

atttcagcaa catattcagg cccaacagaa gaaagaactg aatagttttc tcgagrccca 1020 

gaaaagagag tataaacttc gaaaagagca gcttaaagag gagctaaatg aaaaccagag 1080 

tacccccaaa aaagaaaaac aggagcggct ttcaaagcag aaggagaata tacagcattt 1140 

ccaagcagaa gaagaagcta accttcttcg acgtcaaaga caatacctag agctggaatg 1200 

ccgtcgctrc aagagaagaa tgttacttgg gcgtcataac ttagagcagg acctrgtcag 1260 

ggaggagtta aacaaaagac agactcagaa. ggacttagag catgccatgc tactccgaca 1320 

gcatgaatct atgcaagaac xiggagttccg ccacctcaac acaattcaga agatgcgctg 1380 

tgagttganc agattacagc arcaaactga gctcactaac cagcrggaat ataataagcg 1440 

aagagaacga gaactaagac gaaagcatgt catggaagtt cgacaacagc ctaagagttt 1500 

gaagtctaaa gaactccaaa taaaaaagca gtttcaggat acctgcaaaa tccaaaccag 1560 

acagtacaaa gcattaagaa atcacctgct ggagactaca ccaaagagtg agcacaaagc 1620 
tgttctgaaa cggctcaagg aggaacagac ccggaaatta gctatcttgg ctgagcagta * 1680 

tgatcacagc attaatgaaa tgctctccac acaagccctg cgtttggatg aagcacagga 1740 

aqcagagtgc caggttttga agatgcagct gcagcaggaa ctggagctgt tgaatgcgta 1800 

tcagagcaaa atcaagatgc aagctgaggc acaacatgat cgagagcttc gcgagcttga 1860 

acagagggtc tccctccgga gggcactctt agaacaaaag attgaagaag agatgttggc 1920 

tttgcagaat gagcgcacag aacgaatacg aagcctgttg gaacgrcaag ccagagagat 1980 

tgaagcrttt gactctgaaa gcatgagact aggttttagt aatatggtcc tttccaatct 2040 

ctcccctgag gcattcagcc acagctaccc gggagcttct ggttggtcac acaaccctac 2100 

tgggggtcca ggacctcacr ggggtcatcc catgggtggc ccaccacaag cttggggcca 2160 

tccaatgcaa ggtggacccc agccatgggg tcacccttca gggccaatgc aaggggtacc 2220 

iicgaggtagc agratgggag tccgcaatag cccccaggct ctgaggcgga cagcttctgg 2280 

gggacggacg gagcagggca tgagcagaag cacgagtgtc acttcacaaa tatccaatgg 2340 

gtcacacatg tcttatacat aacttaataa ttgagagtgg caattccgct ggagctgtct 2400 

gccaaaagaa actgcctaca gacatcatca cagcagcctc ctcacttggg tactacagtg 2460 

tggaagctga- gtgcatatgg tatattttat tcatttttgt aaagcgttct gttttgtgtt 2520 

tactaattgg gargtcatag tacttggctg ccgggtttgt ttgtttttgg ggaaattttg 2580 

aaaagtggag ttgatattaa aaataaatgt gtatgtgtgt acatatatat acacacacat 2640 

acacatatat targcatgtg gtgaaaagaa ttggctagat aggggatttt tctgaacact 2700 

gcaaaaatag aacgtagcaa aatggcttca gttatcactt ttgggtgtct gtatcctaag 2760 

aagtttctga aaagatctaa agccttttta tcccatatcc caaattccta tgagccactc 2820 

acagcaggca gcatat:gttg aaataagtta ttactggtac acacctgcat tgcctcacca 2880 

gtgtatttat ttgttattaa attgatctga cttctcagcc tcatttggac taaaaaaaga 2940 

aagcagaaat ccatgaacac attgcttctc ggccttttgg cnaagatcaa gtgtagaaat 3000 

ccatgaacac taaaggactt cattgatttt ttcagagagt agaaaacaac ttagtttttc 3060 

ttttttcctg aatgcgtcat aggcttgtga gtgattttrg tccattcaat tgtgccttct 3120 

ttgtattacg araagatggg ggtacttaag gagatcacaa gttgtgrgag gatcgcatta 3180 

acaaacccat gagccttcaa rggggaagac cagaagggtg agaggggccc tgaaagttca 3240 

tatggtgggt atgrcccgca gcagagtgag gagatgaagc ttacgtgtcc tgacgttttg 3300 

ttgcttatac tgtgatatct caxL-z<ztagcz aagctctata atgcccaaga ccccaaacag 3360 

tacttttact ttgtttgtac aaaaacaaag acatatagcc aatacaaatc aaargccgga 3420 

ggrgtttgat gccatatttg caaattgcca tctattgaaa trctcgtcac actacataga 3480 

cataattgtt atctcctttt ggcttatgtg atttcctget zacaagtaga atagccaatt 3540 



SNSDCCID: <WO 9953036A2_:_> 



wo 99/53036 



PCT/US99/08150 



39 



a.ttaaatgt ttagttgcca cagtgaacca ggagccactg agccaatgac tttaccagc^ 360C 

iTalrT^^^l cttcatcacc actgtagatt rtgctgcatg tgcaggtcct ctatttttaa 3660 

ttgctgt..t cg.tgctgca gractttaca aacttctagt tcgtrgagac ttagrgacca 37:>0 

t.tggcatca agttaacatc acacaatagg aaacaccact tccacaagtc tcaagcctca 3780 

grgctaaagt actactgaaa aggaactagg aagtttggcc aatt 



3824 



<210> 21 
<211> 2249 
<2i2> DNA 

<213> Mammalian (Murine) SUL(J3 
<400> 21 



gcaggatgcc atcaactaac agagcaggca gtctaaagga ccctgaaatt gcagagctc- 
tcttcaaaga agatccggaa aagctcttca cagatctcag agaaatcggc catgggagct 
trggagcagt atattttgca cgagatgrgc gractaatga agtggtggcc atcaagaaaa 
rgtctta-ag tggaaagcag tctactgaga aatggcagga tattattaag gaagtcaagt 
ttcracaaag aataaaacat cccaacagta tagaatacaa aggctgctat ttacgtgaac 
acacagcarg gcttgtaatg gaarattgtt taggatctgc ttcagattta ttagaagttc 
ataaaaagcc attacaagaa gtggaaatag cagcaattac acatggtgct ctccagggac 
tagcttattt acattctcat accatgatcc atagagatat caaagcagga aatatccttc 
tgacagaacc aggccaagtg aaacttgctg actttggatc tgcttccatg gcttcccctg 
ccaattcttt tgtgggaaca ccatattgga tggccccaga agtaatttta gccatggatg 
aaggacagta tgatggcaaa gttgatgtat ggtctcttgg aataacgtgt attgaattaq 
ccgagaggaa gcctccttta tttaatatga atgcaatgag tgccttatat cacatagccc 
aaaatgaatc ccctacacta caatctaata tgaatgattc ttgcctccag aaaatccctc 
aagatcgccc tacatcagag gaacttttaa agcacatgtt tgttcttcaa aagcgcccta 
aaacagtgtt aatagatctt attcaaagga caaaggatgc agtaagagag ctggacaatc 
tgcagtatcg aaagangaag aaactccttt tccaggaggc acataatggg ccagcggtag 
aagcacagga agaagaggag gagcaagatc atggtgttgg ccgaacagga acagtgaata 
gtgttggaag caatcagtct atccctagta tgtctatcag tgccagcagt caaagcagca 
gtgttaatag tcttccagat gcatcagarg acaagagrga gctagacatg atggagggag 
accatacagt gatgtctaac agttctgtca tccacttaaa acctgaggag gaaaattacc 
aggaagaagg agatcctaga acaagagcat cagacccaca grctccccct caggtqtctc 
gtcacaagtc acattatcgt aatagagaac actttgcaac catacgaaca gcatcactgg 
ttacaagaca gatgcaagaa catgagcagg actctgaact tagagaacag atgtctggtt 
ataagcggat gaggcgacag catcaaaagc agctgatgac gctggaaaat aaactgaagq 
cagagatgga cgaacatcgg ctcagattag acaaagatct tgaaactcag cgtaacaatt 
tcgctgcaga aatggagaaa cttattaaga aacaccaagc tgctatggaa aaagaggcta 
aagtgarggc caatgaggag aaaaaattcc agcaacacat tcaggctcaa cagaaaaaag 
aactgaatag ctttttggag tctcaaaaaa gagaatataa acttcgcaaa gagcagctta 
aggaggagct gaatgaaaac cagagcacac ctaaaaaaga aaagcaggaa tggctttcaa 
agcagaagga gaatatacag cattttcagg cagaagaaga agctaatctt cttcgacgtc 
aaaggcagta tctagagcta gaatgrcgtc gct.caaaag aagaatgtta cttgagcgac " 
ataacttgga acaggacctt gtcagggagg agttaaacaa aaggcagact caaaaggact 
tggaacatgc aatgctartg cgacagcatg aatcaatgca agaactggag tttcgccatc 
tcaacactat tcagaagatg cgctgtgagt tgatcagact gcagcatcaa actgagctca 
ctaaccagct agagtacaat aagagaaggg aacgggaact gaggcgaaaa catgrcatgg 
aagttcgaca acaacctaag agtccgaagt ctaaagaact ccaaataaaa aagcagtttc 
aggatacctg caaaattcaa accagacagt acaaagcatt aaggaatcac ctactggaga 
ctacaccaaa gaatgagcac aaagcaatc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2249 
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<212> PRT 

<213> Mammalian (Human) SULUl 
<400> 22 

Met Arg Lys Gly Val Leu Lys Asp Pro Glu lie Asp Asp Leu Phe Tyr 
15 10 15 

Lys Asp Asp Pro Glu Glu Leu Phe lie Gly Leu His Glu lie Gly His 
20 25 30 

Gly Ser Phe Gly Ala Val Tyr Phe Ala Thr Asn Ala His Thr Asn Glu 
35 40 ' 45 

Val Val Ala lie Lys Lys Met Ser Tyr Ser Gly Lys Gin Thr His Glu 
50 55 60 

Lys Trp Gin Asp lie Leu Lys Glu Val Lys Phe Leu Arg Gin Leu Lys 

65- ■ • - . . 70 ■ 75 80- 

His Pro Asn Thr lie Glu Tyr Lys Gly Cys Tyr Leu Lys Glu His Thr 
85 90 95 

Ala Trp Leu Val Met Glu Tyr Cys Leu Gly Ser Ala Ser Asp Leu Leu 
100 105 110 

Glu Val His Lys Lys Pro Leu Gin Glu Val Glu lie Ala Ala lie Thr 
115 120 125 

His Gly Ala Leu His Gly Leu Ala Tyr Leu His Ser His Ala Leu lie 
130 135 . 140 

His Arg Asp lie Lys Ala Gly Asn lie Leu Leu Thr Glu Pro Gly Gin 
145 150 155 160 

Val Lys Leu Ala Asp Phe Gly Ser Ala Ser Met Ala Ser Pro Ala Asn 
165 170 175 

Ser Phe Val Gly Thr Pro Tyr Trp Met Ala Pro Glu Val He Leu Ala 
180 185 190 

Met Asp Glu Gly Gin Tyr Asp Gly Lys Val Asp He Trp Ser Leu Gly 
195 200 205 

He Thr Cys lie Glu Leu Ala Glu Arg Lys Pro Pro Leu Phe Asn Met 
210 215 220 

Asn Ala Met Ser Ala Leu Tyr His lie Ala Gin Asn Asp Ser Pro Thr 
225 230 235 ^ 240 

Leu Gin Ser Asn Glu Trp Thr Asp Ser Phe Arg Arg Phe Val Asp Tyr 
245 250 255 

Cys Leu Gin Lys He Pro Gin Glu Arg Pro Thr Ser Ala Glu Leu Leu 
260 265 270 
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Arg His Asp Phe Val Arg Arg Asp Arg Pro Leu Arg Val Leu He Asp 

280 285 

Leu lie Gin Arg Thr Lys Asp Ala Val Arg Glu Leu Aso Asn Leu Gin 
290 295 " 

Tyr Arg Lys Met Lys Lys He Leu Phe Gin Glu Thr Arg Asn Gly Pro 

315 320 

Leu Asn Glu Ser Gin Glu Asp Glu Glu Asp Ser Glu His Gly Th^ Ser 

330 33i 

Lsu Asn Arg Glu Met Asp Ser Leu Gly Ser Asn His Ser He ^ro Se- 
340 345 ' 

Met Ser Val Ser Thr Gly Ser Gin Ser Ser Ser Val Asn Ser Me^ Gin 
35o 360 365 

Glu Val Met Asp Glu S.r Ser Ser Glu Leu- Val Met Met His Asp Asp 



380 



Glu Ser Thr He Asn Ser Ser Ser Ser Val Val His Lys Lys Asp His 



Val Phe Thr Arg Asp Glu Ala Gly His Gly Asp Pro Arg Pro 



405 410 



Glu Pro 
415 



Arg Pro Thr Gin Ser Val Gin Ser Gin Ala Leu His Tyr Arg Asn Arg 
420 425 



Glu Arg Phe Ala Thr lie Lys Ser Ala Ser Leu Val Thr 



435 



430 



Arg Gin lie 



445 



His Glu His Glu Gin Glu Asn Glu Leu Arg Glu Gin Met Ser Gly Tyr 

455 450 

Lys Arg Met Arg Arg Gin His Gin Lys Gin Leu He Ala Leu Glu Asn 

475 480 



Lys Leu Lys Ala Glu Met Asp Glu His Arg Leu Lys Leu Gin Lvs Glu 
485 490 

val Glu Thr His Ala Asn Asn Ser Ser He Glu Leu Glu Lvs Leu Ala 
500 505 5^0 

Lys Lys Gin Val Ala He He Glu Lys Glu Ala Lys Val Ala Ala Ala 



520 



525 



Asp Glu Lys Lys Phe. Gin Gin Gin lie Leu Ala Gin Gin Lys Lys Asp 

535 540 

Leu Thr Thr Pne Leu Glu Ser Gin Lys Lys Gin Tyr Lys He Cys Lys 

"° 555 5^0 

Glu Lys He Lys Glu Glu Met Asn Glu Asp His Ser Thr Pro Lys Lys 
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565 570 575 

Glu Lys Gin Glu Arg lie Ser Lys His Lys Giu Asn Leu Gin His Thr 
580 585 590 

Gin Aia Giu Giu Giu Ala His Leu Leu Thr Gin Gin Arg Leu Tyr Tyr 
595 600 605 

Asp Lys Asn Cys Arg Phe Phe Lys Arg Lys lie Me:: lie Lys Arg His 
610 ^ 615 ^ 620 

Glu Vai Giu Gin Gin Asn lie Arg Giu Glu Leu Asn Lys Lys Arg Thr 
625 630 635 640 

Gin Lys Glu Met Glu His Aia Met Leu lie Arg His Asp Glu Ser Thr 
645 650 655 

Arg Giu Leu Giu Tyr Arg Gin Leu His Thr Leu Gin Lys Leu Arg Met 
660 . 665 670 

Asp Leu lie Arg Leu Gin His Gin Thr Giu Leu Giu Asn Gin Leu Giu 
675 680 685 

Tyr Asn Lys Arg Arg Glu Arg Glu Leu His Arg Lys His Vai Met Gly 
690 695 700 

Leu Arg Gin Gin Pro Lys Asn Leu Lys Aia Met Giu Met Gin lie Lys 
705 . 710 715 720 

Lys Gin Phe Gin Asp Thr Cys Lys Vai Gin Thr Lys Gin Tyr Lys Ala 
725 730 735 

Leu Lys Asn His Gin Leu Giu Vai Thr Pro Lys Asn Glu His Lys Thr 
740 745 750 

lie Leu Lys Thr Leu Lys Asp Giu Gin Thr Arg Lys Leu Ala lie Leu 
755 760 765 

Ala Giu Gin Tyr Glu Gin Ser lie Asn Giu Met Met Aia Ser Gin Ala 
770 775 780 



Leu Arg Leu Asp Glu Ala Gin Giu Ala Giu Cys Gin Ala Leu Arg Leu 

•785 790 795 800 

Gin Leu Gin Gin Glu Met Giu Leu Leu Asn Aia Tyr Gin Ser Lys lie 
805 810 815 

Lys Met Gin Thr Glu Ala Gin His Giu Arg Giu Leu Gin Lys Leu Glu 

820 825 830 

Gin Arg Vai Ser Leu Arg Arg Aia His Leu Glu Gin Lys lie Giu Giu 

835 840 845 

Giu Leu Ala Aia Leu Gin Lys Glu A.rg Ser Glu Arg lie Lys Asn Leu 

850 855 860 
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Leu Glu Arg Gin Glu Arg Giu He Glu Thr Phe Asp Met Glu Se- Leu 

870 875 880 

Arg Met Gly Phe Gly Asn Leu Val Thr Leu Asp Phe Pro Lys Glu Asp 
885 890 895 

Tyr Arg 



<210> 23 
<211> 786 
<212> PRT 

<213> Manunalian (Human) SULU3 
<400> 23 

He Glu Leu Ala Glu Arg Lys Pro Pro Leu Phe Asn Met Asn Ala Met 
1 5 10 15 

Ser Ala Leu Tyr His He Ala Gin Asn Glu Ser Pro Thr Leu Gin Ser 
20 25 30 

Asn Glu Trp Ser Asp Tyr Phe Arg Asn Phe Val Asp Ser Cys Leu Gin 
35 40 45 

Lys He Pro Gin Asp Arg Pro Thr Ser Glu Glu Leu Leu Lys His He 
50 55 60 

Phe Val Leu Arg Glu Arg Pro Glu Thr Val Leu He Asp Le He Gin 

70 75 80 

Arg Thr Lys Asp Ala Val Arg Glu Leu Asp Asn Leu Gin Tyr • - Lvs 
85 90 

Met Lys Lys Leu Leu Phe Gin Glu Ala His Asn Gly Pro Ala Glu 
100 105 no 

Ala Gin Glu Glu Glu Glu Glu Gin Asp His Gly Val Gly Arg T= niy 
115 120 125 

Thr Val Asn Ser Val Gly Ser Asn Gin Ser He Pro Ser Met Ser He 
130 135 



Ser Ala Ser Ser Gin Ser Ser Ser Val Asn Ser 



145 150 



:.eu Pro Asp Val Se 



r 



155 160 



Asp Asp Lys Ser Glu Leu Asp Met Met Glu Gly Asp His Thr Val Met 

165 170 175 

Ser Asn Ser Ser Val He His Leu Lys Pro Glu Glu Glu Asn Tyr Arq 

180 185 190 
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Glu Glu Gly Asp Pro Arg Thr Arg Ala Ser Asp Pro Gin Ser Pro Pro 
195 200 205 

Gin Val Ser Arg His Lys Ser His Tyr Arg Asn Arg Glu His Phe Ala 
210 215 220 



Thr lie Arg Thr Ala Ser Leu Val 
225 230 

Gin Asp Ser Glu Leu Arg Glu Gin 
245 

Arg Gin His Gin Lys Gin Leu Met 
260 



Thr Arg Gin Met Gin Glu His Glu 

235 240 

Met Ser Gly Tyr Lys Arg Met Arg 

250 255 

Thr Leu Glu Asn Lys Leu Lys Ala 

265 270 



Glu Met Asp Glu His 
275 

Arg Asn Asn- Phe Ala 
290 

Ala Ala Met Glu Lys 
305 

Phe Gin Gin His He 
325 

Leu Glu Ser Gin Lys 
340 

Giu Glu Leu Asn Glu 
355 



Arg Leu Arg Leu Asp Lys 
280 

Al^^ Glu Met. Glu Lys Leu 
295 

Glu Ala Lys Val Met Ser 
310 315 

Gin Ala Gin Gin Lys Lys 
330 

Arg Glu Tyr Lys Leu Arg 
345 

Asn Gin Ser Thr Pro Lys 
360 



Asp Leu Giu Thr Gin 
285 

lie Lys Lys His Gin 
300 

Asn Giu Glu Lys Lys 
320 

Glu Leu Asn Ser Phe 
335 

Lys Glu Gin Leu Lys 
350 

Lys Giu Lys Gin Giu 
365 



Trp Leu Ser Lys Gin 
370 

Glu Ala Asn Leu Leu 
385 

Arg Arg Phe Lys Arg 
405 



Lys Giu Asn lie Gin His 
375 

Arg Arg Gin Arg Gin Tyr 

390 395 

Arg Met Leu Leu Gly Arg 
410 



Phe Gin Ala Giu Glu 
380 

Leu Glu Leu Giu Cys 
400 

His Asn Leu Glu Gin 
415 



Asp Leu Val Arg Giu Giu Leu A.sn 
420 

Giu His Aia Met Leu Leu Arg Gin 

435 440 

Phe Arg His Leu Asn Thr lie Gin 

450 455 

Leu Gin His Gin Thr Giu Leu Thr 

465 470 

Arg Giu Arg Giu Leu A.rg Arg Lys 



Lys Arg Gin Thr Gin Lys Asp Leu 
425 430 

His Giu Ser Met Gin Giu Leu Giu 
445 

Lys Met Arg Cys Giu Leu lie Arg 
4 60 

Asn Gin Leu Giu Tyr Asn Lys Arg 

475 480 

His Val Met Giu Val Arg Gin Gin 
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485 



495 



Pro Lys Ser Leu Lys Ser Lys Glu Leu Gin He Lys Lys Gin Phe Gin 
500 505 510 

Asp Thr Cys Lys He Gin Thr Arg Gin Tyr Lys Ala Leu Arg Asn His 
515 520 525 

Leu Leu Glu Thr Thr Pro Lys Ser Glu His Lys Ala Val Leu Lys Arg 

535 54 0 

Leu Lys Glu Glu Gin Thr Arg Lys Leu Ala lie Leu Ala Glu Gin Tyr 



555 



560 



Asp His Ser He Asn Glu Met Leu Ser Thr Gin Ala Leu Arg Leu Asp 
565 570 

Glu Ala Gin Glu Ala Glu Cys Gin Val Leu Lys Met Gin Leu Gin Gin 
580 .585 590 

Glu Leu Glu Leu Leu Asn Ala Tyr Gin Ser Lys He Lys Met Gin Ala 
595 600 605 



615 



620 



Glu Ala Gin His Asp Arg Glu Leu Arg Glu Leu Glu Gin Arg Val Ser 

Leu Arg Arg Ala Leu Leu Glu Gin Lys He Glu Glu Glu Met Leu Ala 

630 635 640 

Leu Gin Asn Glu Arg Thr Glu Arg He Arg Ser Leu Leu Glu Arg Gin 



645 



650 



655 



Ala Arg Glu He Glu Ala Phe Asp Ser Glu Ser Met Arg Leu Gly Phe 
660 665 670- 



Ser Asn Met Val Leu Ser Asn Leu Ser Pro Glu Ala Phe Ser His Ser 

685 



675 680 



Tyr Pro Gly Ala Ser Gly Trp Ser His Asn Pro Thr Gly Gly Pro Gly 

695 700 



Pro His Trp Gly His Pro Met Gly Gly Pro Pro Gin Ala Trp Gly His 

710 715 P ^ 

Pro Met Gin Gly Gly Pro Gin Pro Trp Gly His Pro Ser Gly Pro Met 

730 735 

Gin Gly Val Pro Arg Gly Ser Ser Met Gly Val Arg Asn Ser Pro Gin 
740 745 750 

Ala Leu Arg Arg Thr Ala Ser Gly Gly Arg Thr Glu Gin Gly M»t Ser 
7^= 760 765 

Arg Ser Thr Ser Val Thr Ser Gin He Ser Asn Giy Ser His M=t Ser 
° 775 780 
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Tyr Thr 
785 



<210> 24 
<211> 748 
<212> PRT 

<213> Mammalian (Murine) SULU3 
<400> 24 

Met Pro Ser Thr Asn Arg Ala Gly Ser Leu Lys Asp Pro Glu lie Ala 
15 10 15 

Glu Leu PhePhe Lys Glu Asp Pro -Glu Lys Leu Phe Thr Asp Leu Arg 
20 25 30 

Glu lie Gly His Gly Ser Phe Gly Ala Val Tyr Phe Ala Arg Asp Val 
35 40 45 

Arg Thr Asn Glu Val Val Ala lie Lys Lys Met Ser Tyr Ser Gly Lys 
50 55 60 

Gin Ser Thr Glu Lys Trp Gin Asp lie lie Lys Glu Val Lys Phe Leu 
65 70 75 80 

Gin Arg lie Lys His Pro Asn Ser lie Glu Tyr Lys Gly Cys Tyr Leu 
85 90 95 

Arg Glu His Thr Ala Trp Leu Val Met Glu Tyr Cys Leu Gly Ser Ala 
100 105 110 

Ser Asp Leu Leu Glu Val His Lys Lys Pro Leu Gin Glu Val Glu lie 
115 120 125 

Ala Ala lie Thr His Gly Ala Leu Gin Gly Leu Ala Tyr Leu His Ser 
130 135 140 

His Thr Met lie His Arg Asp lie Lys Ala Gly Asn lie Leu Leu Thr 
145 150 155 160 

Glu Pro Gly Gin Val Lys Leu Ala Asp Phe Gly Ser Ala Ser Met 'Ala 
165 170 175 

Ser Pro Ala Asn Ser Phe Val Gly Thr Pro Tyr Trp Met Ala Pro Glu 
180 185 190 

Val He Leu Ala Met Asp Glu Gly Gin Tyr Asp Gly Lys Val Asp Val 
195 200 " 205 

Trp Ser Leu Gly He Thr Cys lie Glu Leu Ala Glu Arg Lys Pro Pro 
210 215 220 
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Leu Phe Asn Met Asn Ala Met Ser Ala Leu Tyr His He Ala Gin Asn 

230 235 240 

Glu ser Pro Thr Leu Gin Ser Asn Met Asn Asp Ser Cys Leu Gin Lys 



245 250 



255 



lie Pro Gin Asp Arg Pro Thr Ser Glu Glu Leu Leu Lys His Met Phe 
260 265 270 

Val Leu Arg Glu Arg Pro Glu Thr Val Leu He Asd Leu He Gin A-a 
275 280 ■ 285 ' 

Thr Lys Asp Ala Val Arg Glu Leu Asp Asn Leu Gin Tyr Arg Lys Met 

295 300 

Lys Lys Leu Leu Phe Gin Glu Ala His Asn Gly Pro Ala Val Glu Ala 

310 

Gin Glu Glu Glu Glu Glu Gin Asp His Gly Val Gly Arg Thr Gly Th^ 
325 330 

Val Asn Ser Val Gly Ser Asn Gin Ser He Pro Ser Met Ser He Ser 

350 



340 



Ala Ser Ser Gin Ser Ser Ser Val Asn Ser Leu Pro Asp Ala Ser Asp 
355 360 365 

Asp Lys Ser Glu Leu Asp Met Met Glu Gly Asp His Thr Val Me^ Ser 

375 380 

Asn Ser Ser Val He His Leu Lys Pro Glu Glu Glu Asn Tyr Gin Glu 

390 395 400 

Giu Gly Asp Pro Arg Thr Arg Ala Ser Asp Pro Gin Ser Pro Pro Gin 
005 410 415 

Val Ser Arg His Lys Ser His Tyr Arg Asn Arg Giu His Phe Ala Thr 
420 425 430 

He Arg Thr Ala Ser Leu Val Thr Arg Gin Met Gin Glu His Glu Gin 



435 440 



445 



Asp Ser Glu Leu Arg Glu Gin Met Ser Gly Tyr Lys Arg Met Arg Arg 



460 



Gin His Gin Lys Gin Leu Met Thr Leu Glu Asn Lys Leu Lvs Ala Glu 
' 470 475 - 480 

Met Asp Glu H.s Arg Leu Arg Leu Asp Lys Asp Leu Glu Thr Gin Arg 



485 490 



495 



Asn Asn Phe Ala Ala Glu Met Glu Lys Leu He Lys Lys Kis Gin Ala 
500 505 510 

Ala Met Glu Lys Giu Ala Lys Val Met Ala Asn Glu Glu Lys Lys Phe 
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515 



520 



525 



Gin Gin His lie Gin Ala Gin Gin Lys Lys Glu Leu Asn Ser Phe Leu 

530 535 540 

Glu Ser Gin Lys Arg Giu Tyr Lys Leu Arg Lys Glu Gin Leu Lys Glu 

545 550 555 560 

Giu Leu Asn Giu Asn Gin Ser Thr Pro Lys Lys Giu Lys Gin Giu Trp 

565 570 ' 575 

Leu Ser Lys Gin Lys Giu Asn lie Gin His Phe Gin Aia Glu. Giu Giu 

580 585 590 

Aia Asn Leu Leu Arg Arg Gin Arg Gin Tyr Leu Glu Leu Giu Cys Arg 

595 600 605 



Arg .Phe Lys -Arg Arg Met Leu Leu. Giy Arg His Asn Leu Giu Gin Asp 
610 615 620 

Leu Vai Arg Giu Giu Leu Asn Lys Arg Gin Thr Gin Lys Asp Leu Giu 
625 630 635 ' 640 

His Ala Met Leu Leu Arg Gin His Glu Ser Met Gin Glu Leu Giu Phe 
645 650 655 

Arg His Leu Asn Thr lie Gin Lys Met Arg Cys Glu Leu lie Arg Leu 
660 665 670 

Gin His Gin Thr Glu Leu Thr Asn Gin Leu Giu Tyr Asn Lys Arg Arg 
675 680 685 

Glu Arg Glu Leu Arg. Arg Lys His Vai Met Glu Vai Arg Gin Gin Pro 
690 695 700 

Lys Ser Leu Lys Ser Lys Giu Leu Gin lie Lys Lys Gin Phe Gin Asp 
705 .710 715 720 

Thr Cys Lys lie Gin Thr Arg Gin Tyr Lys Aia Leu Arg Asn His Leu 
725 730 735 



Leu Giu Thr Thr Pro Lys Asn Giu His Lys Aia lie 
740 745 . 



<210> 25 
<211> 2795 
<212> DNA 

<213> Mammalian (Human) GEK2 
. <400> 25 

cgaagccaca gcccgagccc gagcccgagc ccgagccggc gccaccgcgc ccccggccat 60 
ggcttttgcc aatttccgcc gcaccctgcg cctgtctacc ttcgagaaga -gaaagrcccg 120 
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<210> 26 
<211> 912 
<212> PRT 

<213> Mammalian (Hioman) GEK2 
<400> 26 

Met Ala Phe Ala Asn Phe Arg Arg He Leu Arg Leu Ser Thr Phe Glu 



360 
420 
480 
540 
600 
660 



cgaatatgag cacgtccgcc gcgacctgga ccccaacgag g^gtgggaga tcgtgggcga 180 

gctgggcgac ggcgccttcg gcaaggttta caaggccaag aacaaggaga caggtgcttt 240 

ggctgcggcc aaagtcattg aaaccaagag tgaggaggag ctggaggact acatcgtgga 300 
gattgagatc ctggccacct gcgaccaccc ctacattgtg aagctcctgg gagcctac^a 
tcacgacggg aagctgtgga tcatgartga gtrctgtcca gggggagccg tggacgccar 
catgctggag ctggacagag gcctcacgga gccccagata caggtggttt gccaccagat 
gctagaagcc ctcaacttcc tgcacagcaa gaggatcatc caccgagatc tgaaagctgg 
caacgtgctg atgaccctcg agggagacat caggctggct gactttggtg tgtctgccaa 

gaatctgaag actctacaga aacgagattc cttcatcggc acgccttact ggatggcccc oou 

cgaggtggtc atgtgtgaga ccacgaaaga cacgccctac gactacaaag ccgacatctg 720 

gtccctgggc atcacgctga ttgagatggc ccagatcgag ccgccacacc acgagctcaa 780 

ccccatgcgg gtcctgctaa agatcgccaa grcggaccct cccacgctgc tcacgccctc 840 

caagtggrct gtagagttcc gtaacttcct gaagatagcc ctggataaga acccagaaac 900 

ccgacccagt gccgcgcagc tgctggagca tcccttcgtc agcagcatca ccagtaacaa 960 

ggctctgcgg gagctggrgg ctgaggccda ggccgaggtg atggaagaga tcaaagacgg 1020 

ccgggatgag ggggaagagg aggaogccgt ggargccgcc tccaccctgg agaaccatac 1080 

tcagaactcc tcrgaggtga gtccgccaag cctcaatgct gacaagcctc tcgaggagtc 1140 

accttccacc ccgctggcac ccagccagtc tcaggacagt gtgaatgagc cctgcagcca 1200 

gccctcrggg gacagatccc tccaaaccac cagtccccca gtcgtggccc ctggaaatga 1260 

gaacggcctg gcagrgcctg tgcccctgcg gaagrcccga cccgtgtcaa tggatgccag 1320 

aattcaggra gcccaggaga agcaagttgc tgagcagggt ggggacctca gcccagcagc 1380 

caacagatct caaaaggcca gccagagccg gcccaacagc agcgccctgg agaccctggg 1440 

tggggagaag ctggccaatg gcagcctgga gccacctgcc caggcagctc cagggccttc 1500 

caagagggac tcggactgca gcagcctctg cacctotgag agcatggact atggtaccaa 1560 

tctctccact gacctgtcgc tgaacaaaga gatgggctct ctgtccatca aggacccgaa 1620 

actgtacaaa aaaaccctca agcggacacg caaatttgtg gtggatggtg tggaggtgag 1680 

caucaccacc tccaagatca tcagcgaaga tgagaagaag gatgaggaga tgagatttct 1740 

caggcgccag gaactccgag agcrtcggct gctccagaaa gaagagcatc ggaaccagac 1800 

ccagctgagt aacaagcatg agctgcagct ggagcaaatg cataaacgtt ttgaacagga I860 
aatcaacgcc aagaagaagt tctttgacac ggaa-tagag aacctggagc gtcagcaaaa 



gcagcaagtg gagaagatgg agcaagacca tgccacgcgc cgccgggagg agaccaggcg 1980 

gatccgcctg gagcaggatc gggactacac caggttccaa gagcagctca aactgatgaa 2040 

gaaagaggtg aagaacgagg tggagaagct cccccgacag cagcggaagg aaagcatgaa 2100 

gcagaagatg gaggagcaca cgcagaaaaa gcagcttctt gaccgggact ttgtagccaa 2160 

gcagaaggag gacctggagc tggccatgaa gaggctcacc accgacaaca ggcgagagat 2220 

ctgtgacaag gagcgcgagt gcctcatgaa gaagcaggag ctccttcgag accgggaagc 2280 

agccctgtgg gagatggaag agcaccagct gcaggagagg caccagcrgg tgaagcagca 2340 

gctcaaagac cagtacttcc tccagcggca cgagctgctg cgcaagcatg agaaggagcg 2400 

ggagcagatg cagcgctaca accagcgcat gatagagcag ctgaaggtgc ggcagcaaca 2460 

ggaaaaggcg cggctgccca agatccagag gagtgagggc aagacgcgca tggccatgta 2520 

aclTttttT ^'^^'^^'^^^^^ acggcggggg cagcgcagct gagcagcgtg agaagatcaa 2580 

gcagttctcc cagcaggagg agaagaggca gaagtcggag cggctgcagc aacagcagaa 2640 

acacgagaac cagatgcggg acatgctggc gcagtgcgag agcaacatga gcgagctgca 2700 

gcagctgcag aatgaaaagt gccacctccc ggragagcac gaaacccaga aactgaaggc 2760 
cctggatgag agccataacc agaacctgaa ggaat 



2795 
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1 ' 5 • 10 15 . 

Lys Arg Lys Ser Arg Glu Tyr Giu His Val Arg Arg Asp Leu Asp Pro 
20 25 30 

Asn Giu Vai Trp Giu lie Vai Gly Giu Leu Giy Asp Gly Aia Phe Giy 
35 . 40 45 

Lys Vai Tyr Lys Ala Lys Asn Lys Giu Thr Giy Aia Leu Aia Aia Aia 
50 55 60 

Lys Vai lie Giu Thr Lys Ser Giu Giu Giu Leu Giu Asp Tyr lie Val 
65 70 75 ^ 80 

Glu lie Giu lie Leu Aia Thr Cys Asp His Pro Tyr lie Vai Lys Leu 
85 90 95 

Leu Giy Ala Tyr Tyr His Asp Gly Lys Leu Trp lie Met lie Giu Phe 
•100 . 105 ^ 110 

Cys Pro Giy Giy Aia Vai Asp Aia lie Met Leu Giu Leu Asp Arg Giy 
115 120 125 

Leu Thr Glu Pro Gin lie Gin Val Vai Cys Arg Gin Met Leu Giu Aia 
130 135 140 

Leu Asn Phe Leu His Ser Lys Arg lie lie His Arg Asp Leu Lys Ala 
145 150 155 160 

Giy Asn Vai Leu Met Thr Leu Glu Gly Asp lie Arg Leu Aia Asp Phe 
165 170 175 

Giy Vai Ser Aia Lys Asn Leu Lys Thr Leu Gin Lys Arg Asp Ser Phe 
180 185 190 

lie Giy Thr Pro Tyr Trp Met Aia Pro Giu Vai Vai Met Cys Giu Thr 
195 200 205 

Met Lys Asp Thr Pro Tyr Asp Tyr Lys Aia Asp lie Trp Ser Leu Giy 
210 215 220 . • ' 

lie Thr Leu lie Giu Met Ala Gin lie Giu Pro Pro His His Giu Leu 
225 230 235 240 

Asn Pro Met Arg Vai Leu Leu Lys lie Aia Lys Ser Asp Pro Pro Thr 
245 250 255 

Leu Leu Thr Pro Ser Lys Trp Ser Vai Glu Phe Arg Asp Phe Leu Lys 
260 265 210 

lie Aia Leu Asp Lys Asn Pro Giu Thr Arg Pro Ser Aia Aia Gin Leu 
275 280 285 



Leu Giu His Pro Phe Vai Ser Ser lie Thr Ser Asn Lvs Aia Leu Arg 
290 295 300 
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Giu Leu Val Ala Glu Ala Lys 
305 

320 



Ala Lys Ala Glu Val Met Glu Glu He Glu Asp 

315 

Gly Arg Asp Glu Gly Glu Glu Glu Asp Ala Val Asp Ala Ala Ser Thr 

330 

Leu Glu Asn H.s Thr Gin Asn Ser Ser Glu Val Ser Pro Pro Ser Leu 



345 



350 



Asn Ala Asp Lys Pre Leu Glu Glu Ser Pro Ser Thr Pro Leu Ala Pro 



360 



365 



Ser Gin Ser Gin Asp Ser Val Asn Glu 



370 



375 



Pro Cys Ser Gin Pro Ser Gly 



380 



ASP Ar, ser Leu Gin Thr Thr Ser Pro Pro Val Val Ala Pro Gly Asn 

395 400 

Glu Asn Gly. Leu Ala Val Pro Val . Pro Leu Arg Lys Ser Arg Pro Val 

410 

Ser Met Asp Ala Arg He Gin Val Ala Gin Glu Lys Gin Val Ala Glu 

Gin Gly Gly Asp Leu Ser Pro Al 



^25 430 



435 



440 



a Ala Asn Arg Ser Gin Lys Ala Ser 



445 



Cln ser Arg Pro Asn Ser Ser Ala Leu Glu Thr Leu Gly Gly Glu Lys 



4 60 



Leu Ala Asn Gly Ser Leu Glu 



Pro Pro Ala Gin Ala Ala 



465 ' ^ " Ala Pro Gly Pro 



475 



480 



Ser Lys Arg Asp Ser Asp Cys Ser Ser Leu 



485 



4 90 



Cys Thr Ser Glu Ser Met 



ASP Tyr Gly Thr Asn Leu Ser Thr Asp Leu Ser Leu Asn Lys Glu Met 

505 520 

Cly ser Leu Ser He Lys Asp Pro Lys Leu Tyr Lys Lys Thr Leu Lys 

520 



Arg Thr Arg Lys Phe Val Val Asp Gly Val Glu Val Ser He Thr Thr 

540 

ser Lys He He Ser Glu Asp Glu Lys Lys Asp Glu Glu Met Arg Phe 

560 

Leu Arg Arg Gin Glu Leu Arg Glu Leu Arg Leu Leu Gin Lys Glu Glu 

.^"■^ 570 
His Arg Asn Gin Thr Gin Leu Ser Asn Lys His Glu Leu Gin Leu Glu 

Gin Met Hjs Lys Arg Phe Glu Gin Glu He Asn Ala Lys Lys' Lys Phe 



600 



605 
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Phe Asp Thr Glu Leu Giu Asn Leu Glu Arg Gin Gin Lys Gin Gin Vai 
610 615 620 

Glu Lys Met Glu Gin Asp His Ala Vai Arg Arg Arg Glu Giu Ala Arg 
625 630 635 640 

Arg lie Arg Leu Giu Gin Asp Arg Asp Tyr Thr Arg Phe Gin Giu Gin 
645 650 655 

Leu Lys Leu Met Lys Lys Glu Vai Lys Asn Glu Vai Glu Lys Leu Pro 
660 665 670 

Arg Gin Gin Arg Lys Giu Ser Met Lys Gin Lys Met Glu Glu His Thr 
675 680 685 

Gin Lys Lys Gin Leu Leu Asp Arg Asp Phe Vai Ala Lys Gin Lys Glu 
690 . 695 700 

Asp Leu Giu Leu Ala Met Lys Arg Leu Thr Thr Asp Asn Arg Arg Glu 
705 710 715 720 

lie Cys Asp Lys Glu Arg Glu Cys Leu Met Lys Lys Gin Giu Leu Leu 
725 730 735 

Arg Asp Arg Giu Ala Ala Leu Trp Glu Met Glu Giu His Gin Leu Gin 
740 745 750 

Giu Arg His Gin Leu Vai Lys Gin Gin Leu Lys Asp Gin Tyr Phe Leu 
755 760 765 

Gin Arg His Giu Leu Leu Arg Lys His Glu Lys Glu Arg Giu Gin Met 
770 775 780 

Gin Arg Tyr Asn Gin Arg Met lie Glu Gin Leu Lys Vai Arg Gin Gin 
785 790 795 800 

Gin Giu Lys Ala Arg Leu Pro Lys lie Gin Arg Ser Glu Gly Lys Thr* 
805 810 815 

Arg Met Ala Met Tyr Lys Lys Ser Leu His lie Asn Gly Gly Gly Ser 
. . 820 825 830 

Ala Ala Giu Gin Arg Giu Lys lie .Lys Gin Phe Ser Gin Gin Giu Glu 
835 840 845 

Lys Arg Gin Lys Ser Giu Arg Leu Gin Gin Gin Gin Lys His Giu Asn 
850 855 860 

Gin Met Arg Asp Met Leu Ala Gin Cys Giu Ser Asn Met Ser Giu Leu 
865 870 875 880 



Gin Gin Leu Gin Asn Giu Lys Cys His Leu Leu VaJ 
885 890 



Giu His Giu Thr 
895 
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Gin Lys Leu Lys Aia Leu Asp Glu Ser His Asn Gin Asn Leu Lys Glu 



<210> 27 
<211> 3604 
<212> DNA 

<213> Mammalian (Human) 'PAK4 
<400> 27 

cgttcctggg cttcccgcrc cgcaggcctg cggaggaccg gcccagcaag gtcccaggtc 60 

ttccctctcc ttagcgccta agagagaggc ccagtgcggg tgaggagtcg cgaggaagag 120 

gcggaaggcg ccggaaggca ccatgttccg caagaaaaag aagaaacgcc ctgagatctc 180 

agcgccacag aacttccagc accgtgtcca cacctccttc gaccccaaag aaggcaagtt 240 

tgtgggcctc cccccacaat ggcagaacat cctggacaca ctgcggcgcc ccaagcccgt 300 

ggtggaccct tcgcgaatca cacgggtgca gctccagccc argaagacag tggtgcgggg 360 

cagcgcgatg cctgrggatg gctacatctc ggggctgctc aacgacatcc agaagttgtc 420 

agtcatcagc tccaacaccc tgcgtggccg cagccccacc agccggcggc gggcacagtc 480 

cctggggctg ctgggggang agcactgggc caccgaccca gacatgtacc tccagagccc 540 

ccagtctgag cgcactgacc cccacggcct ctacctcagc tgcaacgggg gcacaccagc 600 

aggccacaag cagatgccgt ggcccgagcc acagagccca cgggtcctgc ccaatgggct 660 

ggctgcaaag gcacagtccc tgggccccgc cgagtttcag ggtgcctcgc agcgctgtct 720 

gcagctgggt gcctgcctgc agagctcccc accaggagcc tcgcccccca cgggcaccaa 780 

taggcatgga argaaggctg ccaagcatgg ctctgaggag gcccggccac agtcctgcct 840 

ggtgggctca gccacaggca ggccaggtgg ggaaggcagc cctagcccta agacccggga 900 

gagcagcctg aagcgcaggc tattccgaag catgttcctg tccactgctg ccacagcccc 960 

rccaagcagc agcaagccag gccctccacc acagagcaag cccaactcct ctttccgacc 1020 

gccgcagaaa gacaaccccc caagccrggt ggccaaggcc cagtccttgc cctcggacca 1080 

gccggtgggg accttcagcc crctgaccac ttcggatacc agcagccccc agaagtccct 1140 

ccgcacagcc ccggccacag gccagcttcc aggccggtct tccccagcgg gatccccccg 1200 

cacctggcac gcccagatca gcaccagcaa cctgtacctg ccccaggacc ccacggttgc 1260 

caagggtgcc crggctggtg aggacacagg tgttgtgaca catgagcagt tcaaggctgc 1320 

gctcaggatg gtggtggacc agggtgaccc ccggctgctg ctggacagct acgtgaagat 1380 

tggcgagggc tccaccggca tcgtctgctt ggcccgggag aagcactcgg gccgccaggt 1440 

ggccgtcaag atgatggacc ccaggaagca gcagcgcagg gagctgctct tcaacgaggt 1500 

ggtgatcatg cgggactacc agcacttcaa cgtggtggag atgtacaaga gctacctggt 1560 

gggcgaggag ctgtgggtgc tcatggagtt cctgcaggga ggagccctca cagacatcgt 1620 

ctcccaagtc aggctgaatg aggagcagat tgccactgtg tgtgaggctg tgctgcaggc 1680 

cctggcctac ctgcatgctc agggtgtcat ccaccgggac atcaagagtg actccatcct 1740 

gctgaccctc gatggcaggg rgaagctctc ggactncgga ttctgtgctc agatcagcaa 1800 

agacgtccct aagaggaagt ccctggtggg aaccccctac tggatggctc ctgaagtgat 1860 

ctccaggtct rtgtatgcca ctgaggtgga tatctggtct ctgggcatca tggtgattga 1920 

gatggtagat ggggagccac cgtacttcag tgactcccca gtgcaagcca tgaagaggct 1980 

ccgggacagc cccccaccca agctgaaaaa crctcacaag gtctccccag tgctgcgaga 2040 

cttcctggag cggatgcrgg tgcgggaccc ccaagagaga gcoacaqccc aggagcrcct 2100 

agaccacccc ttcctgctgc agacagggct acctgagtgc ccggcgcccc tgarccagct 2:60 

craccgaaag cagacctcca cctgctgagc ccaccccaag tatgcctgcc acctacgccc 2220 

acaggcaggg cacactgggc agccagcctg ccggcaggac ttgcctgcct cctcctctca 2280 

gtactctcrc caaagattga aatgtgaagc cccagcccca cccrctgccc ttcagcctac 2340 

tgggccaggc cggacctgcc ccctcagtgt crctccctcc cgagtcccca gatggagacc 2400 

cctttctaca ggatgacccc ttgatatttg cacagggata tttctaagaa acgcagaggc 2460 

cagcgttcct ggcctcrgca gccaacacag tagaaaaggc tgctgtggtt ttttaaaggc 2520 

agttgrccac tagtgtccra ggccactgca gagggcagac tgctggtctc cacagatacc 2580 
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tgctgtrctc 

tgaccggaat 

cagcccct cc 

gtgaggggac 

ctttacagaa 

tgtgtgtgtg 

cctccaggtc 

cctgaaggga 

cagtggagtg 

tcctgtactg 

ctttcaagtt 

tcrgggagag 

gtgtgcaaac 

cctgccccac 

accctttcta 

caaagaaatt 

ctagaaattt 

agcc 



agctccagct 

cccgcttcct 

ccgccaagaa 

aatttcrgag 

tggagagaac 

tgtgrgtgtg 

acccacagcc 

cctgctttct 

grccatgcta 

tgcatcgtga 

tatctcaata 

aaggcgaacc 

catctgcccc 

ttcagggcrg 

t ctagataat 

gcaaggactr 

ttgcgaattg 



tcaaacctcg 

ccctcacgtc 

ctgtgaatgg 

tgaaagagaa 

atcgtgtgtg 

tgtgtgtaag 

agtttcagga 

tggcctggct 

agtctaacac 

gggccagaga 

accaggtcat 

acctccgggt 

agctgcctcc 

ctgtgagatg 

atttttagat 

ttttttaagg 

tttgcacttg 



agtctcgaga 

tgatgtcctg 

cctgctccag 

agaatggggt 

tgtgtgtgtg 

gggaggaaag 

aggctgcccc 

tccacctctc 

tcctgggagc 

caggaatgta 

cagggaccca 

ttccatcatg 

acagactgct 

gaattccagg 

tcttctgctc 

gccagagttt 

tgcctgtttt 



gggccacggg 

aaggtgcagt 

qccatggccg 

cggtggtgaa 

tgtgrgtgtg 

ccaccttgac 

tctctcccac 

cactcctgtg 

tcaggaggct 

aggattggca 

ttgttctctt 

tcaaggrcac 

gtctccttgt 

aaagaacttc 

cctagtgacc 

tcaaaacaaa 

aaattaaatt 



gtggttttta 

cccacctgta 

ggggcaggga 

ggtgctctca 

tgtgtgtgtg 

agcccaggtc 

taagttctgg 

tctacctggc 

tctgagcttc 

actgtgttac 

cagaacccta 

aggcatccat 

cctcctcggc 

aggtgtctgg 

tacctggggg 

agcatcttcc 

gagtgttcaa 



2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3604 



<210> 28 
<211> 2050 
<212> DNA 

<213> Mammalian (Human) PAK5 
<400> 28 



ggccagrggg gcgaaactgg cagctggccg gccctttaac acctacccga gggctgacac 
qqaccaccca tcccggggtg cccaggggga gcctcatgac gtggccccta acgggccatc 
agcggggggc ctggccatcc cccagtcctc ctcctcctcc rcccggcctc ccacccgagc 
ccgaggtgcc cccagccctg gagtgctggg accccacgcc tcagagcccc agctggcccc 
tccagcctgc acccccgccg cccctgctgt tcctgggccc cctggccccc gctcaccaca 
qcqggaqcca cagcgagtat cccatgagca gttccgggct gcccrgcagc tggtggtgga 
r-ccaggcgac ccccgctcct acctggacaa cttcatcaag attggcgagg gctccacggg 
catca-gtgc atcgccaccg tgcgcagctc gggcaagctg gtggccgtca agaagatgga 
ccrgcgcaag cagcagaggc gcgagctgct cttcaacgag gtggtaatca tgagggacta 
ccagcacgag aatgtggtgg agatgtacaa cagcracctg gtgggggacg agctctgggt 
ggt-atggag rtcctggaag gaggcgccct caccgacatc gtcacccaca ccaggatgaa 
cgaggagcag atcgcggccg tgtgccttgc agtgctgcag gccctgrcgg tgctccacgc 
ccagggcgtc atccaccggg acatcaagag cgactcgatc ctgctgaccc atgatggcag 
agtgaagcrg tcagactttg ggtrctgcgc ccaggtgagc aaggaagrgc cccgaaggaa 
gtcgctggtc ggcacgccct actggatggc cccagagctc atctcccgcc ttccctacgg 
gccagaggta gacatctggt cgctggggat aatggtgatt gagatggcgg acggagagcc 
cc-c-ac- tc aacgagccac ccctcaaagc catgaagatg attcgggaca accngccacc 
ccaactgaag aacctacaca aggtgtcgcc atccctgaag ggcttcctgg accgcctgct 
ggtgcgagac cctgcccagc gggccacggc agccgagctg ctgaagcacc cattcctggc 
caaagcaaga ccgcctgcca gcatcgtgcc cctcatgcgc cagaaccgca ccagatgagg 
c-cagcgccc ttcccctcaa ccaaagagcc cccccgggtc acccccgccc cactgaggcc 
agtagggggc caggcctccc actcctccca gcccgggaga tgctccgcgt ggcaccaccc 
^^.^^gctgg qagtagatga gaccctacta cngaactcca gttttgatct cgtgactttt 
agaaaaacac agggactcgt gggagcaagc gaggctccca ggacccccac cctctgggac 
aagccctccc ccatgttctt ctgtctccag gaagggcagc ggccctccca Ecacrggaag 
tctgcagtgg gggtcgctgg gggtggagag aacactaaga ggrgaacarg tatgagtgtg 
tocacgcgra tgagtgtgca tgtgtgtgtg tgtgcaaagg tccagccacc ccgtcctcca 
gcccgcaagg ggtgtctggc gccctgcctg acacccagcc ccczczccc<z ctgagc-at- 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
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gtgggggtcg atcatgaatg tccgaagagt ggccttttcc cgtagccctg cgcccccttt 174G 

ctgtggctgg atggggagac aggtcagggc cccccaccct ctccagcccc tgcagcaaat 1800 

gactactgca cctggacagc ctcctctttt ctagaagtct atttatattg tcattttata 1860 

acactctagc ccccgccctt attgggggac agatggrccc tgtcctgcgg ggrggccctg 1920 

gcagaaccac tgccrgaaga accaggttcc tgcccggtca gcgcagcccc agcccgccca 1980 

cccctgcctc gagttagttt tacaattaaa acattgtctt gttttgtgaa aaaaaaaaaa 2040 

aaaaaaaaaa 2050 



<210> 29 
<211> 681 
<212> PRT 

<213> Mammalian (Human) PAK4 



<400> 29 



Met Phe Arg Lys 

r 

Asn Phe Gin His 
20 

Phe Vai Gly Leu 
35 

Arg Pro Lys Pro 
50 



Lys Lys Lys Lys 
5 

Arg Val His Thr 



Pro Pro Gin Trp 
40 

Val Val Asp Pro 
55 



Arg Pro Glu lie 

10 • 

Ser Phe Asp Pro 
25 

Gin Asn lie Leu 



Ser Arg lie Thr 
60 



Ser Aia Pro Gin 
15 

Lys Glu Gly Lys 
30 

Asp Thr Leu Arg 
45 

Arg Val Gin Leu 



Gin Pro Met Lys 
65 

Tyr lie Ser Gly 



Ser Asn Thr Leu 
100 

Ser Leu Gly Leu 
115 

Tyr Leu Gin Ser 
130 

Leu Ser Cys Asn 
145 

Pro Glu Pro Gin 



Ala Gin Ser Leu 
180 

Leu Gin Leu Gly 
195 



Thr Val Val Arg 
70 

Leu Leu Asn Asp 
85. 

Arg Gly Arg Ser 



Leu Gly Asp Glu 
120 

Pro Gin Ser Giu 
135 

Gly Gly Thr Pro 
150 

Ser Pro Arg Val 
165 

^iy Pro Aia Giu 



Ala Cys Leu Gin 
200 



Gly Ser Aia Met 
75 

lie Gin Lys Leu 
90 

Pro Thr Ser Arg 
105 

His Trp Aia Thr 



Arg Thr Asp Pro 
140 

Aia Gly His Lys 
155 

Leu Pro Asn Gly 
170 

Phe Gin Gly Aia 
185 

Ser Ser Pro Pro 



Pro Val Asp Gly 
80 

Ser Val lie Ser 
95 

Arg Arg Aia Gin 
110 

Asp Pro Asp Met 
125 

His Gly Leu Tyr 



Gin Met Pro Trp 
160 

Leu Ala Aia Lys 
175 

Ser Gin Arg Cys 
190 

Gly Ala Ser Pro 
205 
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Pro Thr Giy Thr Asn Arg His Gly Met Lys Ala Aia Lys His Gly Ser 
210 215 220 

Glu Glu Ala Arg Pro Gin Ser Cys Leu Val Gly Ser Ala Thr Gly Arg 
225 230 235 240 

Pro Glv Gly Glu Gly Ser Pro Ser Pro Lys Thr Arg Glu Ser Ser Leu 
245 250 255 

Lvs Arq A-g Leu Phe Arg Ser Met Phe Leu Ser Thr Ala Ala Thr Ala 
^ 260 265 270 

Pro °ro Ser Ser Ser Lys Pro Gly Pro Pro Pro Gin Ser Lys Pro Asn 
275 280 285 

Ser Ser Phe Arg Pro Pro Gin Lys Asp Asn Pro Pro Ser Leu Val Ala 
290 295 300 

Lvs Ala Gin Ser Leu Pro Ser Asp Gin Pro Val Gly Thr Phe Ser Pro 
305 310 315 320 

Leu Thr Thr Ser Asd Thr Ser Ser Pro Gin Lys Ser Leu Arg Thr Ala 
325 330 335 

Pro Ala Thr Gly Gin Leu Pro Gly Arg Ser Ser Pro Ala Gly Ser Pro 
340 345 350 

Arg Th- Tro His Ala Gin He Ser Thr Ser Asn Leu Tyr Leu Pro Gin 
355 360 365 

Asp Pro Thr val Ala Lys Gly Ala Leu Ala Gly Glu Asp Thr Gly Val 
370 375 380 

Val Thr His Glu Gin Phe Lys Ala Ala Leu Arg Met Val Val Asp Gin 
385 390 395 400 

Gly Asp Pro Arg Leu Leu Leu Asp Ser Tyr Val Lys He Gly Glu Gly 
405 410 415 

Ser Thr Gly lie Val Cys Leu Ala Arg Glu Lys His Ser Gly Arg Gin 
420 425 430 

Val Ala Val Lys Met Met Asp Leu Arg Lys Gin Gin Arg Arg Glu Leu 
435 440 445 

Leu Phe Asn Glu Val Val He Met Arg Asp Tyr Gin His Phe Asn Val 
450 455 460 

Val Glu Met Tyr Lys Ser Tvr Leu Val Gly Glu Glu Leu Trp Val Leu 
465 470 475 480 

Met Glu Phe Leu Gin Gly Gly Ala Leu Thr Asp lie Val Ser Gin Val 
485 490 49d 

A-g Leu Asn Glu Glu Gin He Aia Thr Val Cys Glu Ala Val Leu Gin 
500 505 510 
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Ala Leu Ala Tyr Leu His Aia Gin Giy Vai lie His Arg Asp lie Lys 
515 520 525 

Ser Asp Ser lie Leu Leu Thr Leu Asp Giy Arg Vai Lys Leu Ser Asp 
530 535 540 

Phe Giy Phe Cys Aia Gin lie Ser Lys Asp Vai Pro Lys Arg Lys Ser 
545 550 555 560 

Leu Vai Giy Thr Pro Tyr Trp Met Aia Pro Giu Vai lie Ser Arg Ser 
565 570 575 

Leu Tyr Aia Thr Giu Vai Asp lie Trp Ser Leu Giy lie Met Vai lie 
580 585 590 

Giu Met Vai Asp Giy Giu Pro Pro Tyr Phe Ser Asp Ser Pro Vai Gin 
595. . ' 600 605 

Ala Met Lys Arg Leu Arg Asp Ser Pro Pro Pro Lys Leu Lys Asn Ser 
610 615 620 

His Lys Vai Ser Pro Vai Leu Arg Asp Phe Leu Giu Arg Met Leu Vai 
625 630 635 640 

Arg Asp Pro Gin Giu Arg Aia Thr Ala Gin Giu Leu Leu Asp His Pro 
645 650 655 

Phe Leu Leu Gin Thr Giy Leu Pro Giu Cys Leu Vai Pro Leu lie Gin 
660 665 670 

Leu Tyr Arg Lys Gin Thr Ser Thr Cys 
675 680 



<210> 30 
<21i> 398 
<2i2> PRT 

<213> Maminalian (Human) PAK5 
<400> 30 

Ala Ser Giy Aia Lys Leu Aia Aia Giy Arg Pro Phe Asn Thr Tyr Pro 
15 10 15 

Arg Aia Asp Thr Asp His Pro Ser Arg Giy Aia Gin Giy Giu Pro His 
20 25 30 

Asp Vai Aia Pro Asn Giy Pro Ser Ala Giy Giy Leu Aia lie Pro Gin 
35 40 45 

Ser Ser Ser Ser Ser Ser Arg Pro Pro Thr Arg Aia Arg Giy Aia Pro 
50 55 60 
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Ser Pro Giy Val Leu Gly Pro His Ala Ser Glu Pro Gin Leu Ala Pro 
65 70 75 80 

Pro Ala Cys Thr Pro Ala Ala Pro Ala Val Pro Gly Pro Pro Gly Pro 
85 90 95 

Arg Ser Pro Gin Arg Glu Pro Gin Arg Val Ser His Giu Gin Phe Arg 
100 105 110 

Ala Ala Leu Gin Leu Val Val Asp Pro Gly Asp Pro Arg Ser Tyr Leu 
115 120 125 

Asp Asn Phe lie Lys lie Gly Glu Gly Ser Thr Gly lie Val Cys lie 
130 135 140 

Ala Thr Val Arg Ser Ser Gly Lys Leu Val Ala Val Lys Lys Met Asp 
145 150 155 160 

Leu Arg Lys Gin Gin Arg Arg Glu- Leu Leu Phe Asn Glu Val Val lie 
165 170 175 

Met Arg Asp Tyr Gin His Glu Asn Val Val Glu Met Tyr Asn Ser Tyr 
180 185 190 

Leu Val Gly Asp Glu Leu Trp Val Val Met Glu Phe Leu Glu Gly Gly 
195 200 205 

Ala Leu Thr Asp lie Val Thr His Thr Arg Met Asn Glu Glu Gin lie 
210 215 220 

Ala Ala Val Cys Leu Ala Val Leu Gin Ala Leu Ser Val Leu His Ala 
225 230 235 240 

Gin Gly Val lie His Arg Asp lie Lys Ser Asp Ser lie Leu Leu Thr 
245 250 255 

His Asp Gly Arg Val Lys Leu Ser Asp Phe Gly Phe Cys Ala Gin Val 
260 265 270 

Ser Lys Glu Val Pro Arg Arg Lys Ser Leu Val Gly Thr ■ Pro Tyr Trp 
275 280 285 

Met Ala Pro Glu Leu lie Ser Arg Leu Pro Tyr Gly Pro Glu Val Asp 
290 295 300 

lie Trp Ser Leu Gly lie Met Val lie Glu Met Val Asp Gly Glu Pro 
305 310 315 320 

Pro Tyr Phe Asn Glu Pro Pro Leu Lys Ala Met Lys Met lie Arg Asp 
325 330 335 

Asn Leu Pro Pro Arg Leu Lys Asn Leu His Lys Val Ser Pro Ser Leu 
340 345 350 

Lys Giy Phe Leu Asp Arg Leu Leu Val Arg Asp Pro Ala Gin Arg Ala 
355 ' 360 365 
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Thr Ala Ala Glu Leu Leu Lys His Pro Phe Leu Ala Lvs Ala Gly Pr-c 
370 375 380 

Pro Ala Ser He Val Pro Leu Met Arg Gin Asn Arg Thr Ara 
385 390 395 



<210> 31 
<211> 1001 
<212> PRT 

<2i3> Mammalian (murine/human) SULU3 
<400> 31 

Met Pro Ser Thr Asn Arg Ala Gly Ser Leu Lys Asd Pro Glu lie Ala 
^. .... ^ 15 

Glu Leu Phe Phe Lys Glu Asp Pro Glu Lys Leu Phe Thr Asp Leu A-g 
20 25 30 

Glu He Gly His Gly Ser Phe Gly Ala Val Tyr Phe Ala Arg Asp Val 
35 40 45 

Arg Thr Asn Glu Val Val Ala lie Lys Lys Met Ser Tyr Ser Gly Lys 
'50 55 60 

Gin Ser Thr Glu Lys Trp Gin Asp He He Lys Glu Val Lys Phe Leu 

70 75 80 

Gin Arg He Lys His Pro Asn Ser He Glu Tyr Lys Gly Cys Tyr Leu 
35 90 95 

Arg Glu His Thr Ala Trp Leu Val Met Glu Tvr Cvs Leu Giy Ser Ala 
100 105 ' lib 

Ser Asp Leu Leu Glu Val His Lys Lys Pro Lea Gin Glu Val Glu He 
115 120 125 

Ala Ala He Thr His Gly Ala Leu Gin Gly Leu Ala Tyr Leu His Ser 
130 135 140 

His Thr Met He His Arg Asp He Lys Ala Gly Asn He Leu Leu Thr 
1^55 150 155 160 

Glu Pro Gly Gin Val Lys Leu Ala Asp Phe Gly Ser Ala Ser Met Ala 
165 170 ' 175 

Ser Pro Ala Asn Ser Phe Val Gly Thr Pro Tyr Trp Met Ala Pro Glu 
180 185 " ' 190 

Val He Leu Ala Met Asp Glu Gly Gin Tyr Asp Glv Lvs Val Asp Val 
195 200 ' 205 

Trp Ser Leu Gly He Thr Cys He Glu Leu Ala Glu Arg Lys Pro Pre 
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210 215 220 

Leu Phe Asn Met Asn Aia Met Ser Ala Leu Tyr His He Ala Gin Asn 
225 230 235 240 

Glu Ser Pro Thr Leu Gin Ser Asn Glu Trp Ser Asp Tyr Phe Arg Asn 
245 250 255 

Phe Vai Asp Ser Cys Leu Gin Lys lie Pro Gin Asp Arg Pro Thr Ser 
260 265 270 

Giu Glu Leu Leu Lvs His lie Phe Vai Leu Arg Giu Arg Pro Glu Thr 
275 230 285 

Vai Leu He Asp Leu He Gin Arg Thr Lys Asp Aia Vai Arg Giu Leu 
290 295 300 



AsD Asn Leu Gin Tyr Arg Lys Met- Lys Lys Leu Leu Phe Gin Giu Aia 
305 310 315 320 

His Asn Giy Pro Aia Vai Giu Aia Gin Giu Giu Giu Giu Giu Gin Asp 
325 330 335 

His Giv Vai Giv Arg Thr Giy Thr Vai Asn Ser Vai Giy Ser Asn Gin 
340 345 350 

Ser lie Pro Ser Met Ser lie Ser Aia Ser Ser Gin Ser Ser Ser Vai 
355 360 365 

Asn Ser Leu Pro Asp Vai Ser Asp Asp Lys Ser Glu Leu Asp Met Met 
370 375 380 

Giu Giy AsD His Thr Vai Met Ser Asn Ser Ser Vai lie His Leu Lys 
385 ' 390 . 395 400 

Pro Giu Giu Giu Asn Tyr Arg Giu Giu Giy Asp Pro Arg Thr Arg Aia 
405 410 415 

Ser Asp Pro Gin Ser Pro Pre Gin Vai Ser Arg His Lys Ser His Tyr 
420 425 430 

Arg Asn Arg Giu His Phe Aia Thr He Arg Thr Aia Ser Leu Vai Thr 
435 440 445 

Arg Gin Met Gin Giu His Giu Gin Asp Ser Giu Leu Arg Giu Gin Met 
450 455 460 

Ser Giy Tyr Lvs Arg Met Ara Arg Gin His Gin Lys Gin Leu Met Thr 
465 ^ 470 475 480 

Leu Giu Asn Lys Leu Lys Aia Giu Met Asp Giu His Arg Leu Arg Leu 
485 490 455 

Asp Lvs AsD Leu Giu Thr Gin Arg Asn Asn Phe Aia Aia Giu Met Glu 
.500 505 510 
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Lys Leu lie hys hys His Gin Ala Ala Met Glu Lys Glu Ala Lys Val 
515 520 525 

Met Ser Asn Glu Glu Lys Lys Phe Gin Gin His He Gin Ala Gin Gin 
530 535 540 

Lys Lys Glu Leu Asn Ser Phe Leu Glu Ser Gin Lys Arg Glu Tyr Lys 
545 550 555 560 

Leu Arg Lys Glu Gin Leu Lys Glu Glu Leu Asn Glu Asn Gin Ser Thr 
565 570 575 

Pro Lys Lys Glu Lys Gin Glu Trp Leu Ser Lys Gin Lys Glu Asn lie 
580 585 590 

Gin His Phe Gin Ala Glu Glu Glu Ala Asn Leu Leu Arg Arc Gin Arg 
595 600 605 

Gin Tyr Leu Glu Leu Glu Cys P.rg Arg Phe Lys Arg Arg Mez Leu Leu 
610 615 620 

Gly Arg His Asn Leu Glu Gin Asp Leu Val Arg Glu Glu Leu Asn Lys 
^25 630 635 640 

Arg Gin Thr Gin Lys Asp Leu Glu His Ala Met Leu Leu Arg Gin Kis 
645 - 650 655 

Glu Ser Met Gin Glu Leu Glu Phe Arg His Leu Asn Thr He Gin Lys 
660 665 670 

Met Arg Cys Glu Leu He Arg Leu Gin His Gin Thr Glu Leu Thr Asn 
6*75 680 685 

Gin Leu Glu Tyr Asn Lys Arg Arg Glu Ara Glu Leu Aro Arg Lys His 
690 695 700 

Val Met Glu Val Arg Gin Gin Pro Lys Ser Leu Lvs Ser Lys Glu Leu 
705 710 715 720 

Gin He Lys Lys Gin Phe Gin Asp Thr Cys Lys He Gin Thr Arg Gin 
725 730 735 

Tyr Lys Ala Leu Arg Asn His Leu Leu Glu Thr Thr Pro Lys Ser Glu 
740 745 750 

His Lys Ala Val Leu Lys Arg Leu Lys Glu Glu Gin Thr Arc Lvs Leu 
755 760 765 

Ala lie Leu Ala Glu Gin Tyr Asp His Ser He Asn Glu Met Leu Se- 
770 775 730 

Thr Gin Ala Leu Arg Leu Asp Glu Ala Gin Glu Ala Glu Cvs Gin Val 
785 790 795 ' 800 

Leu Lys Mer Gin Leu Gin Gin Glu Leu Glu Leu Leu Asn Ala Tvr Gin 
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805 810 815 

Ser Lys ^ie Lys Met Gin Ala Glu Ala Gin His Asp Arg Glu Leu Arg 
820 825 830 

Glu Leu Glu Gin Arg Val Ser Leu Arg Arg Ala Leu Leu Glu Gin Lys 
835 840 845 

lie Giu Glu Glu Met Leu Ala Leu Gin Asn Glu Arg Thr Glu Arg lie 
850 855 860 

Ara Ser L«u Leu Glu Arg Gin Ala Arg Glu lie Giu Ala Phe Asp Ser 
865 870 875 880 

Glu S-^r Met Arg Leu Gly Phe Ser Asn Met Val Leu Ser Asn Leu Ser 
885 ' 890 895 

Pro Giu Ala Phe Ser His Ser Tyr Pro Gly Ala Ser Gly Trp Ser His 
900 ■ 905 910 

Asn ^ro ^hr Gly Gly Pro Gly Pro His Trp Gly His Pro Met Gly Gly 
915 920 925 

Pro Pro Gin Ala Trp Gly His Pro Met Gin Gly Gly Pro Gin Pro Trp 
930 935 940 

Gly H^s Pro Ser Gly Pro Met Gin Gly Val Pro Arg Gly Ser Ser Met 
945 950 955 960 

Gly Val Arg Asn Ser Pro Gin Ala Leu Arg Arg Thr Ala Ser Gly Gly 
965 970 975 

Arg Thr Glu Gin Gly Met Ser Arg Ser Thr Ser Val Thr Ser Gin lie 
980 985 990 

Ser Asn Gly Ser His Met Ser Tyr Thr 
995 iOOO 



<210> 32 

<211> 25 

<212> DNA 

<213> TRKl 

<220> 

<223> "n" stands for a, c, g or t . 

"y" stands for c or c. 

"r" stands for a or g. 

<400> 32 
ccgaactcgg ngcnttyggn aargt 
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<210> 33 
<211> 24 
<212> DNA 
<213> TRK4 

<220> 

<223> "n" stands for a, c, q or t . 
"y" stands for c or t . 



<400> 33 

gctggatccy tcnggnggca tcca 2 4 



<210> 34 
<211> 23 
<212> DNA 
<2i3> ROSl 

<220> 

<223> "n" stands for a, c, g or t. 

"y" stands for c or t , 

"r" stands for a or g. 

<400> 34 

gcntt yggng argtntayga rgg 23 



<210> 35 
<211> 24 
<212> DNA 
<213> CCK4b 

<220> 

<223> "n" stands for a, c, g or t . 

"y" stands for c or t. 

"s" stands for c or g . 

"w" stands for a or t. 

<400> 35 

gctggatccy ccnggnswca tcca 24 



<210> 36 
<211> 23 
<212> DNA 
<213> CCK4C 
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<220> 

<223> "n" stands for a, c, g or t . 
"y" stands for c or t . 
"r" stands for a or g . 

<400> 36 

gagttyggng argtnttyyt ngc 23 



<210> 37 
<211> 6 
<212> PRT 
<213> TRKl 

<400> 37 

Giy Ala PheGiy Lys Val 
1 5 



<210> 38 

<211> 5 

<212> PRT 

<213> TRK4 

<400> 38 

Trp Met Pro Pro Glu 
1 5 



<210> 39 

<211> 8 

<212> PRT 

<213> ROSl 

.<400> 39 

Ala Phe Gly Glu Val Tyr Glu Gly 
1 5 



<210> 40 

<211> 5 

<212> PRT 

<2i3> CCK4b 

<400> 40 

Trp Met Ser Pro Glu 
1 5 
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<210> 41 

<211> 8 

<212> PRT 

<213> CCK4C 

<400> 41 

Glu Phe Giy Giu Val Tyr Giu Gly 
1 5 



<210> 42 
<211> 21 
<212> -DNA 

<213> Artificial Sequence 
<220> 

<223> Synthesized nucleic acid molecule 
<400> 42 
cacagaaacg gtcagattca c 



<210> 43 
<211> 22 
<212> DNA 

<213> A.rrificial Sequence 
<220> 

<223> Synthesized nucleic acid molecule 
<400> 43 
gatcagggtg acatcaagcg ac 



22 



<210> 44 
<211> 22 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Synthesized nucleic acid molecule 
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<400> 44 
ctcatcrgta cacacttcat gg 



<210> 45 
<2il> 21 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Synthesized nucleic acid molecule 
<4C0> 45 

gattcccacactgtagatgtc 21 



<210> 46 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
. . <220> 

<223> Synthesized nucleic acid molecule 
<400> 46 

ggccctcgac tacatccacc acat 24 



<210> 47 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthesized nucleic acid molecule 



<400> 47 

caacgaaact aacacagcat aagg 24 



<210> 48 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<22C> 
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<223> Synthesized nucleic acid molecule 
<400> 48 
atggcgaacg actctcccgc gaa 



<210> 49 
<21l> 26 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Synthesized nucleic acid molecule 
<400> 49 
a.caccaaaat caacaagttr cacctc 



<210> 50 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthesized nucleic acid molecule 



26 



<400> 50 
agttacaagg aartccaagt tct 



23 



<210> 51 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthesized nucleic acid molecule 
<400> 51 
atgaagagga agaaatcaaa ctg 



<210> 52 
<211> 20 



23 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthesized nucleic acid molecule 
<400> 52 

agatggactg tactgggagg 20 



<210> 53 
<211> 20 
<212> DNA 

<213> Artificial Sequence " , 

<220> 

<:22 3> • Synrhesized nucleic acid molecule 
<400> 53 

actttgtgca gctctgtggg 20 



<210> 54 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthesized nucleic acid molecule 



<400> 54 

aaggrtatgg atgrcacagg g 21 

<210> 55 

<211> 20 

<212> DNA 

<213> Arrificial Sequence 
<220> 

<223> Synthesized nucleic acid molecule 

<400> 55 

ctcacaaggt tgccaacagg 20 
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<2iO> 56 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthesized nucleic acid molecule 
<400> 56 
agtccccacc agaaggttta c 



21 



<210> 57 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthesized nucleic acid molecule 
<400> 57 
tcaggggrca gaggtcacg 

19 



<210> 58 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthesized nucleic acid molecule 
<400> 58 
cccaaaccct accacaaatt c 

21 



<210> 59 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthesized nucleic acid molecule 
<400> 59 
cccccggaaa acaatgacca 

20 
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<210> 60 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthesized nucleic acid molecule 
<400> 60 

agccgctgcc cctcctctac tgt 23 



<210> 61 
<211> 23 
<212> DNA 

. <2 13> ' ArEif icial Sequence- 
<220> 

<223> Synthesized nucleic acid molecule 
<400> 61 

accgcaacat cgccacctac tac 23 



<2i0> 62 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthesized nucleic acid molecule 
<400> 62 

ctcgacgtcg tggaccacc 19 



<210> 63 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthesized nucleic acid molecule 
<400> 63 

caatgttaac ccactctatg tctc 24 
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<210> 64 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<22C> 

<223> Synnhesized nucleic acid molecule 
<400> 64 



agtttgccga tgtttttctt ttc 



<210> 65 
<211> .13 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthesized nucleic acid molecule 
<400> 65 
ccgccatgaa ccccggctt 



<210> 66 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthesized nucleic acid molecule 
<40C> 66 
cgattgccaa agaccgtgtc a 



<210> 67 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthesizea nucleic acid molecule 
<400>. 67 
agaagttgca gctgttgaga gga 



23 



19 



21 
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<210> 68 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthesized nucleic acid molecule 
<400> 68 

tatggcccgt gtaaggattt c 21 



<210> 69 
<211> 23 
. <212>.ONA 

<213> Artificial Sequence 

<220> 

<223> Synthesized nucleic acid molecule 
<400> 69 

gtgccagaag tgttgtgttg taa 23 



<210> 70 
<211> 22 
<212> DNA 

<2 1 3> Artificial Sequence 

<220> 

<223> Synthesized nucleic acid molecule 
<400> 70 

tattgaattg gcggaacgga ag 22 



<210> 71 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<2 2 3> Synthesi zed nucleic acid molecule 
<400> 71 

ttgtttrgtg ctcattcttt ggag 24 
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<210> 72 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthesized nucleic acid molecule 
<400> 72 
gattgctttg tgctcattct ttqq 

2A 



- ' <210> -73 
<2il> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthesized nucleic acid molecule 
<400> 73 
ttgttcraag agtgccctcc g 



21 



<210> 14 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthesized nucleic acid molecule 
<400> 74 
aagaccatgc cgtgcgccg 

19 



<210> 75 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthesized nucleic acid molecule 
<400> 75 
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attccttcag gttctggtta tgg 



23 



<210> 76 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthesized nucleic acid molecule 
<400> 76 

agtccgccaa gcctcaatg ]_9 



.<210>-77 
<211> 21 
<212> DNA . 

<213> Artificial Sequence 
<220> 

<223> Synthesized nucleic acid molecule 



<210> 78 
<211> 12 
<212> ?RT 

<213> Peptide 540A 
<400> 78 

His Gly Asp Pro Arg Pro Glu Pro Arg Pro Thr Gin 
15 10 



<210> 79 

<211> 10 

<212> PRT 

<213> Peptide 539A 

<400> 79 

Cys Leu Asp Phe Pro Lys Glu Asp Tyr Arg 



<400> 77 



caccttcagc tgctctatca 



21 



5 



10 
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<210> 80 
<211> 19 
<212> PRT 

<213> Peptide 541A 



<4G0> 80 



Asp Pro Arg Thr Arg Ala 3er Asp Pro Gin Ser Pro Pro G^n Val Ser 
^3 io 15 

Arg His Lys 



<210> 81 
<211> 14 
<212> PRT 

<213> Peptide 542A 
<400> 81 



Pro Ser Thr Asn Arg Ala Gly Ser Leu Lys Asp Pro Glu Cys 
^ 5 10 



<210> 82 
<211> 15 
<212> PRT 

<213> Pepnide 554A 
<400> 82 

Cys Leu Val Pro Leu lie Gin Leu Tyr Arg Lvs Gin Thr Se- Thr 
^ 10 15 



<210> 83 
<211> 10 
<212> PRT 

<213> Peptide 579A 
<400> 83 



Cys Pro Leu Mez Arg Gin Asn Arg Thr Arg 

10 



1 5 



<210> 84 
<211> 426 
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<212> PRT 
<213> STE20h 



<400> 64 

Met Ala His Leu Arg Gly Phe Ala Asn Gin His Ser Arg Val Asp Pro 
15 10 is' 

Glu Glu Leu Phe Thr Lys Leu Asp Arg lie Gly Lys Gly Ser Phe Gly 
20 25 30 

Glu Val Tyr Lys Gly lie Asp Asn His Thr Lys Glu Val Val Ala lie 
35 40 45 

Lys He lie Asp Leu Glu Glu Ala Glu Asp Glu He Glu Asp He Gin 
50 55 60 

Gin Gl.u He- Thr Val Leu Ser Gin .Cys Asp Ser Pro Tyr He Thr Arg 
65 70 75 80 

Tyr Phe Gly Ser Tyr Leu Lys Ser Thr Lys Leu Trp He He Met Glu 
85 90 95 

Tyr Leu Gly- Gly Gly Ser Ala Leu Asp Leu Leu Lys Pro Gly Pro Leu 
100 105 110 

Glu Glu Thr Tyr He Ala Thr He Leu Arg Glu He Leu Lys Gly Leu 
115 120 125 

Asp Tyr Leu His Ser Glu Arg Lys He His Arg Asp He Lys Ala Ala 
130 135 140 

Asn Val Leu Leu Ser Glu Gin Gly Asp Val Lys Leu Ala Asp Phe Gly 
145 150 155 160 

Val Ala Gly Gin Leu Thr Asp Thr Gin He Lys Arg Asn Thr Phe Val 
165 170 175 

Gly Thr Pro Phe Trp Met Ala Pro Glu Val He Lys Gin Ser Ala Tyr 
180 185 190 

Asp Phe Lys Ala Asp He Trp Ser Leu Gly He Thr Ala He Giu Leu 
195 200 205 

Aia Lys Gly Giu Pro Pro Asn Ser Asp Leu His Pro Met Arg Val Leu 
210 215 220 

Phe Leu He Pro Lys Ash Ser Pro Pro Thr Leu Glu Gly Gin His Ser 
225 230 235 240 

Lys Pro Phe Lys Giu Phe Val Glu Aia Cys Leu Asn Lys Asp Pro Arg 
245 250 255 

Phe Arg Pro Thr Aia Lys Glu Leu Leu Lys His Lys Phe lie Thr Arg 
260 265 270 
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Tyr Thr Lys Lys Thr Ser Phe Leu Thr Giu Leu lie Asd -Arc Tyr Lys 
275 280 285 

Arg Trp Lys Ser Giu Giy His Giy Giu Glu Ser Ser Ser Giu Aso Ser 
290 295 300 

Asp lie Asp Giy Giu Ala Giu Asp Giy Giu Gin Giy Pro lie Trp Thr 
305 310 315 * 320 

Phe Pro Pro Thr lie Arg Pro Ser Pro His Ser Lys Leu His Lys Giy 
325 330 335 

Thr Ala Leu His Ser Ser Gin Lys Pro Ala Glu Pro Val Lys Arg Gin 
340 345 350 

Pro Arg Ser Gin Cys Leu Ser Thr Leu Val Arg 'Pro Val Phe Giy Giu 
355 360 365 

Leu Lys Giu Lys His Lys Gin Ser Giy Giy Ser Vai Giy Ala Leu Glu 
370 375 380 

Glu Leu Glu Asn Ala Phe Ser Leu Ala Glu Glu Ser Cys Pro Giy lie 
385 390 395 400 

Ser Asp Lys Leu Met Vai His Leu Val Giu Arg Val Gin Arg Phe Ser 
405 410 415 

His Asn Arg Asn His Leu Thr Ser Thr Arg 
420 425 



<210> 85 
<211> 431 
<212> PRT 
<213> MST3 

<400> 85 

Met Ala His Ser Pro Vai Gin Ser Giy Leu Pro Giy Met Gin Asn Leu 
15 10 15 

Lys Ala Asp Pro Giu Giu Leu Phe Thr Lvs Leu Giu Lvs lie Giy Lys 
20 25 " - ^30 

Giy Ser Phe Giy Glu Vai Phe Lys Giy lie Asp Asn Arq Thr Gin Lvs 
35 40 45' 

Vai Val Ala lie Lys lie lie Asp Leu Glu Giu Ala Glu Asp Glu lie 
50 55 60 

Glu Asp lie Gin Gin Giu lie Thr Val Leu Ser Gin Cvs Aso Ser Pro 
^5 70 75 ' ' 80 ■ 

Tyr Val Thr Lys Tyr Tyr Giy Ser Tyr Leu Lys Asp Thr Lys Leu Trp 
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85 90 95 

lie lie Met Glu Tyr Leu Gly Giy Giy Ser Ala Leu Asp Leu Leu Glu 
100 105 110 

Pro Gly Pro Leu Asp Glu Thr Gin lie Ala Thr lie Leu Arg Glu lie 
115 120 125 

Leu Lys Gly Leu Asp Tyr Leu His Ser Glu Lys Lys He His Arg Asp 
130 135 140 

He Lys Ala Ala Asn Val Leu Leu Ser Glu His Gly Glu Val Lys Leu 
145 150 155 160 

Ala Asp Phe Gly Val Ala Gly Gin Leu Thr Asp Thr Gin He Lys Arg 
165 170 175 

Asn Thr Phe Val Gly Thr Pro Phe Trp Met Ala Pro Glu Val lie Lys 
.180 185 190 

Gin Ser Ala Tyr Asp Ser Lys Ala Asp He Trp Ser Leu Gly He Thr 
195 200 205 

Ala He Glu Leu Ala Arg Giy Glu Pro Pro His Ser Glu Leu His Pro 
210 215 220 

Met Lys Val Leu Phe Leu He Pro Lys Asn Asn Pro Pro Thr Leu Glu 
225 230 235 240 

Gly Asn Tyr Ser Lys Pro Leu Lys Glu Phe Val Glu Ala Cys Leu Asn 
245 250 255 

Lys Glu Pro Ser Phe Arg Pro Thr Ala Lys Glu Leu Leu Lys His Lys 
260 265 270 

Phe He Leu Arg Asn Ala Lys Lys Thr Ser Tyr Leu Thr Glu Leu He 
275 . 280 285 • 

Asp Arg Tyr Lys Arg Trp Lys Ala Glu Gin Ser His A.sp Asp Ser Ser 
290 295 300 

Ser Glu Asp Ser Asp Ala Glu Thr Asp Gly Gin Ala Ser Gly Gly Ser 
305 310 315 320 

Asp Ser Gly Asp Trp He Phe Thr He Arg Glu Lys Asp Pro Lys Asn 
325 330 335 



Leu Glu Asn Gly Ala Leu Gin Pro Ser Asp Leu Asp Arc Asn Lys Met 
340 345 350 

Lys Asp He Pro Lys Arg Pro Phe Ser Gin Cys Leu Ser Thr He He 

355 36C ^ 365 

Ser Pro Leu Phe Ala Glu Leu Lys Glu Lys Ser Gin Ala Cys Gly Gly 

370 375 380 
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Asn Leu Gly Ser He Giu Glu Leu Arc Giy Ala He Tyr Leu Ala Giu 

400 



395 



Giu Ala Cys Pro Giy He Ser 



405 



Asp Thr Met Val Ala Gin Leu Vai Gin 

415 



Arq Leu Gin Arg Tvr Ser Leu Se^ 

^20 425 



Gly Giy Gly Thr Ser Ser His 

430 



<210> 86 
<211> 443 
<212> PRT 
<213> T19A5.2_ce 

<220> . . 

<223> "Xaa" stands for any' amino acid. 
<400> 86 



Met Thr Thr Thr Ser Ser Asp Glu Leu Pro Arg Gin Ala Asp Asp Asp 
^ 10 15 

ser Met Lys Trp Asp Arg lie Tyr lie Gin Lys Leu Asp Pro Glu Val 
^° 25 30 

lie Phe Thr Lys Gin Glu Arg He Gly Arg Gly Ser Phe Gly Glu Val 
3= 40 45 

Tyr Lys Giy lie Asp Asn Arg Thr Gly Arg Val Val Ala He Lys 11^ 

-''^ 55 



60 



lie Asp Leu Glu Gin Ala Glu Asp Giu He Glu Asp He Gin Gin Glu 

^° 75 80 

He Gin val Leu Ser Gin Cys Asp Ser Gin Tyr Val Thr Lvs Ty^ Phe 
85 90 ^ 95 

Gly Ser Phe Leu Lys Giy Ser Lys Leu Trp lie He Met Glu Tyr Leu 
100 105 

Gly Gly Giy Ser Ala Leu Asp Leu Thr Lys Ser Gly Lys Leu Aso Giu 

12^ 125 

Ser His He Ala Val He Leu Arg Glu He Leu Lys Giy Leu Giu Tyr 



140 



Leu His Ser Glu Arg Lys lie His Arg Asp He Lys Ala Ala Asn Vai 
Leu Val Ser Glu His Giy Asp Val Lys Val Ala Asp Phe Giy Val Ala 



165 170 



175 
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Giy Gin Leu Thr Giu Thr Vai Lys Lys Arg lie Thr Phe Val Giy Ser 
180 185 190 

Pro Phe Trp Met Ala Pro Glu Leu lie Lys Gin Ser Ser Tyr Asp Tyr 
195 200 205 

Lys Ala Asp lie Trp Ser Leu Giy lie Thr Ala lie Glu Leu Ala Asn 
210 215 220 

Giy Glu Pro Pro His Ser Asp Leu Kis Pro Met Arg Val Leu Phe Leu 
225 230 235- 240 

lie Pro Lys Asn Pro Pro Pro Val Leu Gin Giy Ser Gin Trp Ser Lys 
245 250 255 

Pro Phe Lys Giu Phe Val Giu Met Cys Leu Asn Lys Asp Pro Glu Asn 
260 265 270 

Arg Pro Ser Ala Ser Thr Leu Leu Lys His Gin Phe Tie Lys Arg Ala 
275 280 285 

Lys Lys Asn Ser lie Leu Val Asp Leu lie Glu Arg Ala Ala Glu Tyr 
290 295 300 

Arg Leu Arg Thr Giy Val Ser Ser Asp Ser Asp Leu Asp Giu Asp Ser 
305 310 315 ' 320 

Asp Giy Giy Giy Giy Thr Ser Lys Trp Asp Tyr Pro Thr Val Arg Giy 
325 330 335 

Pro Arg Vai Ser Ala Asp Asp Asp Giy Thr Vai Arg Gin Arg Thr Asp 
340 345 350 

Arg Pro Arg Ala Gin Val Asp Arg Arg Ser Pro Ser Giy Ser Pro Giy 
355 360 365 

Giy Thr lie Vai Arg Giy Ser Pro Gin Vai Ala Ala Val Ala Giu Gin 
370 375 380 



Leu Arg Asn Ser 
385 

Asp Vai Glu Asp 



Leu Met Gin Arg 
420 

Ala Ala lie Arg 
435 



Xaa Xaa Ala Leu 
390 

Ser Cys Pro Giy 
405 

lie Ala Val Pro 



Arg Leu Thr Thr 
440 



Asp Gin Leu Arg 
395 

lie Cys Asn Giu 
410 

Gin Vai Ser Gin 
425 

Pro Pro Ser 



His Vai Phe Arg 
400 

Leu lie Giu Giu 
415 

Ser Asp Leu Asp 
4 30 



<210> 87 
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<400> 87 

Leu Leu Tyr Arg Asn Phe Val Lys lie Gly Gin Glv Ala Ser Gly Asd 
^ 5 10 15 ' 

Val Tyr Ser Ala Arg Gin Val Gly Thr Asn Leu Ser Val Ala He Lvs 
20 25 30 

Lys Met Asn lie Asn Gin Gin Pro Lys Lys Glu Phe He Val Asn Glu 
35 40 45 

lie Leu Val Met Lys Ser His His His Lys Asn He Val Asn Phe He 
50 55 60 

Asp Thr Phe Phe Tyr Lys Ser Glu .Leu Trp Met Val Met Glu Tyr Met 

^ 7= 30 

Arg Gly Gly Ser Leu Thr Glu Val Val Thr Asn Asn Thr Leu Se- Giu 
85 90 95 

Gly Gin He Ala Ala He Cys Lys Glu Thr Leu Glu Gly Leu Gin His 
100 105 110 

Leu His Glu Asn Gly He Val His Arg Asp He Lys Ser Asp Asn Tie 
115 120 125 

Leu Leu Ser Leu Gin Gly Asp He Lys Leu Thr Asp Phe Gly Phe Cys 
^ 135 140 

Ala Gin He Asp Ser Asn Met Thr Lys Arg Thr Thr Met Val Gly Thr 

150 155 150 

Pro Tyr Trp Met Ala Pro Glu. Val Val Thr Arg Lvs Giu Ty- Gly Phe 
165 170 

Lys Val Asp Val Trp Ser Leu Gly He Met Ala He Glu Met Val G^u 
180 185 190 

Gly Glu Pro Pro Tyr Leu Asn Glu Asn Pro Leu Arg Ala Leu Ty- Leu 
195 200 205 

lie Ala Thr He Gly Thr Pro Lys He Ser Arg Pro Glu Leu Leu Ser 
210 215 220 

Ser Val Phe His Asp Phe Leu Ser Lys Ser Leu Thr Val Asn Pro Lys 

23d 240 
Gin Arg Pro Ser Ser Gly Giu Leu Leu Arg His Pr 



245 250 



o Phe Leu Lys Gin 
255 



Ala Val Pro Val Ser Ser Leu lie Pro Leu lie Lvs ^ . 

260 ' ;;o 



BNSDOCfD: cWO 995303eA2_L> 



wo 99/53036 



PCT/US99/08150 



32 



Ser Gly Lys 
275 



<210> 83 
<211> 1109 
<212> PRT 
<213> ZC504 . 4_ce 

<400> 88 

Met Ser Ser Ser Gly Leu Asp Giu lie Asp Leu Asn Ser Leu Arg Asp 
15 10 15 

Pro Ala Gly lie Phe Giu Leu lie Glu Val Val Gly Asn Gly Thr Tyr 
20 25 30 

Gly Gin Val Tyr Lys Gly Arg His Val Lys Thr Ala Gin Leu Ala Ala 
35 40 45 

lie Lys lie Met Asn lie Asn Glu Asp Glu Glu Asp Glu lie Lys Leu 
50 55 60 

Glu lie Asn Met Leu Lys Lys His Ser His His Arg Asn Val Ala Thr 
65 70 75 80 

Tyr Tyr Gly Ala Phe lie Lys Lys Leu Pro Ser Ser Thr Gly Lys His 
85 90 95 



Asp Gin Leu Trp Leu Val Met Glu Phe Cys Gly Ser Gly Ser lie Thr 
100 105 110 

Asp Leu Val Lys Asn Thr Lys Gly Giy Ser Leu Lys Giu Glu Trp lie 
115 120 125 

Ala Tyr lie Cys Arg Glu lie Leu Arg Gly Leu Tyr His Leu His Gin- 
• 130 135 140 

Ser Lys Val lie His Arg Asp lie Lys Gly Gin Asn Val Leu Leu Thr 
145 150 155 160 

Asp Ser Ala Glu Val Lys Leu Val Asp Phe Gly Val Ser Ala Gin Leu 
165 170 175 

Asp Lys Thr Val Gly Arg Arg Asn Thr Phe lie Gly Thr Pro Tyr Trp 
180 185 190 

Men Ala Pro Glu Val He Ala Cys Asp Glu Ser Pro Giu Ala Thr Tyr 
195 200 205 



Asp Ser Arg Ser Asp Leu Trp Ser Leu Gly lie Thr Aia Leu Glu Me:: 
210 . 215 220 
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Ala Glu Giy His Pro Pro Leu Cys Asp Met His Pro Met Arg Ala Leu 

230 235 240 



Phe Leu lie Pro Arg Asn Pro Pro Pro Lys Leu Lys Arg Asn Lvs Lys 

250 255 
Trp Thr Lys Lys Phe Glu Thr Phe He Glu Thr Vai Leu Val Lys Asp 



Tyr His Gin Arg Pro Tyr Thr Gly Ala Leu 
275 280 



Leu Arg His Pro Phe He 
285 



"""^ ""^'^ ""^^ ""^-^ '-^^ ^-^5 His Ser He Lys Glu His 
230 295 

lie Asp Arg Asn Arg Arg Val Lys Lys Asp Asp Ala Aso Tvr Glu Tyr 

315 ' 320 

Ser Giy Ser Glu Asp Asp Glu Pro Ser Pro Asn Asn Arg Asp Asp Ser 
325 330 

Glu Ser Ser Ser Met He Pro Met Asp Asa Thr Leu Arg Lys Gly Phe 

345 350 

Gin Lys Leu Gin Glu Ser Ser Arg Gly Phe Ala Glu Pro Gly Ala Gin 

360 



Gin Leu Arg Arg Leu Pro Gin Gin Pro Ala Pro Ala Pro Phe Gin Tyr 

375 380 

Gin Gin Ser Arg Tyr Val Glu Pro Arg Arg Glu Ser Ser Glu Val Lys 

395 400 
Leu Arg Ala Val Ser Ser Arg Gly Ala Ala Asp Gly Pro Arg His Ser 



405 



415 



Pro Ala Ser Arg Pro Arg Pro Arg Ser Pro Gin Gin Ser His Pro Ala 
^20 425 430 

Ala Pro His Leu Ala Asp Leu Ala Asn Tyr Glu Lys Arg Arg Arg Ser 
^35 440 445 

Glu Arg Glu Glu Arg Arg Glu Arg Glu Arg Gin Ala His H.s Ala Me^ 

^55 450 

Pro He Ala Arg Val Ser Ala Ser Val Pro Ala Pro Gin Gin Ser Arg 

Lys Met Ser Giu Pro Leu 



'^^^ ^le Thr His Val Lys Pro Giu Asp Leu 
490 



Asp Val Leu Ala Ser Glu L 
500 



eu ^er Lys Mer Gly Giy His His Asn Giv 



Arg Ser Arg Giu Glu Ser Met Ser Pro Pro Pro Pro Al 



a Pro Pro ?i 
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515 520 525 

Arg Glu Ala Ser lie Ser Ser He Thr Asp Thr He Asp Vai Gly Glu 
530 535 540 

Leu Asp Asn Gly Aia Asp Ala Glu Trp Asp Asp Leu Lys Asp He Met 
545 55C ' 555 560 

Met Asn Gly Glu Gly Thr Leu Arg Gly Pro Asn Lys Pro Leu Pro Pro 
565 570 575 

Thr Pro Thr Asp Gly Glu Asn Thr Leu Val Ser Asp Val Arg Arg Asn 
580 585 ' 590 

Gly Asn Gly Asn Ser Gly His Gly Ala Tyr Lys Gly Lys Lys He Pro 
595 600 605 

Glu He Arg Pro Gly He He Ser Leu Asp Asp Asp Asp Ser Asp Ser 
610 615 620 

Asp Asn Glu Glu Gly Asn Glu Pro Leu Met Phe Lys Pro He Val Arg 
625 630 635 . 640 



Cys Pro Phe Ser He Phe Phe Trp Phe Leu Ser Ala Asn Vai He His 
645 650 655 

Ser Val Asp Gly Ser He Pro Leu Val Lys His Leu He Trp Phe Gin 
660 665 670 

Asn Ala Ser Ser Ser Arg Gly Ala Leu Pro Asp Leu Leu Pro Lys Ser 
675 680 685 

Pro Asp Leu Arq Arg Gin He Asn Asp Gin Thr Arg Gin Met Ser Asp 
690 695 700 

Asp Arg Ala Asp Glu Gin Pro Asn Gly Phe Gin Asn Ser Asp Ser Arg 
705 710 715 720 

Ser Ser He Gin His Ser Phe Ser Asn Arg Asp Arg Glu Lys Ser Phe 
725 730 735 

Val Gly Tyr Phe Gly Gly Gly Ala Gly Ala Gly Gly Gly Thr Vai Asn 
740 745 ' 750 

Arg Pro Gly Arg Pro Gin Asp He Asn Gin Val Gin Vai Asn Vai Thr 
755 760 765 

Pro Asn Ser Asn Gly Thr Pro Ala Glu Asn Asp Aia Pro Glu He Arg 
770 775 780 

Lys Tyr Lys Lys Lys Phe Ser Gly Glu He Leu Cys Ala Ala Leu Trp 
735 790 795 800 

Gly Val Asn Leu Leu He Gly Thr Asp Ser Gly Leu Met Leu Leu Asp 
805 810 815 
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Arc Ser Giy Gin Giy hys Vai ryr Pro Leu lie Ser Arg Arg Arg ?he 

320 825 ^ 830 

Asp Gin Met Thr Vai Leu Giu Giy Gin Asn lie Leu Ala Thr lie Ser 

835 840 845 

Giy Arg Lys Arg Arg lie Arg Vai Tyr Tyr Leu Ser Trp Leu Arg Gin 

850 855 860 

Lys lie Leu Arg Thr Giu Giy Ala Giy Ser Aia Asn Thr Thr Giu Lys 

865 870 875 880 

Arg Asn Giy Trp Vai Asn Vai Giy Asp Leu Gin Giy Aia lie His Phe 

385 890 895 

Lys lie Vai Arg Tyr Giu Arg lie Lys Phe Leu Vai Vai Giy Leu Giu 

900 905 910 



Ser Ser lie Giu lie Tyr Aia Trp Ala Pro Lys Pro Tyr His Lys Phe 
915 920 925 

Met Ser Phe Lys Ser Phe Giy Ser Leu Ser His Vai Pro Leu lie Vai 
930 935 940 

Asp Leu Thr Vai Giu. Asp Asn Aia Arg Leu Lys Vai Leu Tyr Giy Ser 
945 950 955 960 

Thr Giy C y Phe His Aia lie Asp Leu Asp Ser Ala Aia Vai Tyr Asp 
965 970 975 

lie Tyr Thr Pro Aia Gin Ser Giy Gin Thr Thr Thr Pro His Cys lie 
980 985 990 

Vai Vai Leu Pro Asn Ser Asn Giy Met Gin Leu Leu Leu Cys Tyr Asp 
995 1000 1005 

Asn Giu Giy Vai Tyr Vai Asn Thr Tyr Giy Arg Met Thr Lys Asn Vai . 
1010 1015 1020 

Vai Leu Gin Trp Giy Giu Met Pro Ser Ser Vai Ala Tyr lie Ser Thr 
1025 1030 1035 1040 

Giy Gin lie Met Giy Trp Giy Asn Lys Aia lie Giu lie Arg Ser Vai 
1045 1050 1055 

Asp Thr Giy His Leu Asp Giy Vai Phe Met His Lys Lys Ala Gin Lys 
1060 1065 1070 

Leu Lys ?he Leu Cys Giu Arg Asn Asp Lys Vai Phe Phe Ser Ser Aia 
1075 1030 1085 

Lys Giy Giy Giy Ser Cys Gin lie Tyr Phe Met Thr Leu Asn Lys Pro 
1090 1095 1100 
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Giy Leu Thr Asr. Trp 
1105 



<2iO> 89 

<211> 1233 

<212> PRT 

<213> HIK_m 

<400> 89 

Met Ala Asn Asp Ser Pro Ala Lys Ser Leu Val Asp lie Asp Leu Ser 
1 5 .10 15 

Ser Leu Arg Asp Pro Ala Gly He Phe Glu Leu Vai Glu Vai Val Gly 
20 25 30 

Asn Gly Thr Tyr Giy Gin Val Tyr Lys Gly Arg His Vai Lys Thr Val 
35 40 45 

. Thr Ala Ala He Lys Val Met Asp Val Thr Glu Asp Glu Giu Glu Glu 
50 55 60 

He Thr Leu Glu He Asn Met Leu Lys Lys Tyr Ser His His Arg Asn 
65 ^ 70 75 80 

He Ala Thr Tyr Tyr Gly Ala Phe He Lys Lys Ser Pro Pro Gly His 
85 . 90 95 

Asp Asp Gin Leu Trp Leu Val Met Glu Phe Cys Gly Ala Giy Ser He 
100 105 " ' 110 

Thr Asp Leu Val Lys Asn Thr Lys Gly Asn Thr Leu Lys Glu Asp Trp 
115 120 125 

He Ala Tyr He Ser Arg Glu He Leu Arg Gly Leu A.la His Leu His 
130 135 140 

He His His Vai He His Arg Asp He Lys Gly Gin Asn Val Leu -Leu 
145 150 155 160 

Thr Glu Asn Ala Glu Val Lys Leu Val Asp Phe Gly Val Ser Ala Gin 
165 170 175 

Leu Asp Arg Thr Val Giy Arg Arg Asn Thr Phe He Gly Thr Pro Tyr 
130 185 190 

Trp Met Ala Pro Glu Vai He Ala Cys Asp Glu Asn Pro Asp Ala Thr 
195 200 205 

Tyr Asp Tyr Arg Ser Asp Leu Trp Ser Cys Gly He Thr Ala He Glu 
210 215 220 

Met. A.la Glu Gly Gly Pro Pro Leu Cys Asp Met His Pro Met Arg Ala 



BNSDOCID: <WO 9953036A2_L> 



WO99/53036 



PCT/US99/08150 



225 230 

Leu Phe Leu lie Pro Arc Asn Pro 
245 

Trp Ser Lys Lys Phe Phe Ser Phe 
260 

Tyr Met Gin Arg Pro Ser Thr Glu 

275 280 



37 

235 240 

Pro Pro Arg Leu Lys Ser Lys Lys 
250 255 

lie Giu Gly Cys Leu Vai Lys Asn 
265 270 

Gin Leu Leu Lys His Pro Phe lie 
285 



Arg Asp Gin i-ro Asn Giu Arg Gin Vai Arg lie Gin Leu Lys Asp His 
290 295 300 

lie Asp Arg Thr Arg Lys Lys Arg Giy Giu Lys Asp Giu Thr Giu Tyr 
305 310 315 ' 320 

Giu Tyr Ser Giy Ser Giu Giu Giu Giu Giu Giu Vai Pro Giu Gin Giu 
325 330 335 

Giy Giu Pro Ser Ser lie Vai Asn Vai Pro Giy Giu Ser Thr Leu Arg 
340 345 350 

Arg Asp Phe Leu Arg Leu Gin Gin Giu Asn Lys Giu Arg Ser Giu Aia 
355 360 365 

Leu Arg Arg Gin Gin Leu Leu Gin Giu Gin Gin Leu Arg Giu Gin Giu 
370 375 380 

Giu Tyr Lys Arg Gin Leu Leu Aia Giu Arg Gin Lys Arg lie Giu Gin 
385 390 395 400 

Gin Lys Giu Gin Arg Arg Arg Leu Giu Giu Gin Gin Arg Arg Giu Arg 
405 410 415 

Giu Aia Arg Arg Gin Gin Giu Arg Giu Gin Arg Arg Arg Giu Gin Giu 
420 425 430 

Giu Lys A.rg Arg Leu Giu Giu Leu Giu Arg Arg Arg Lys Giu Giu Giu 
435 440 445 

Giu Arg Arg Arg Aia Giu Giu Glu Lys Arg Arg Vai Giu Arg Giu Gin 
450 455 460 

Giu Tyr lie Arg Arg Gin Leu Glu Giu Glu Gin Arg His Leu Giu lie 
^65 470 475 480 

Leu Gin Gin Gin Leu Leu Gin Giu Gin Aia Met Leu Leu His Asp His 
485 490 495 

Arg Arg Pro His Aia Gin Gin Gin Pro Pro Pro Pro Gin Gin Gin Asp 
500 505 510 

Arg Ser Lys Pro Ser Phe His Aia Pro Giu Pro Lys Pre His Tyr Asp 
515 520 525 
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Pro Ala Asp Arg Aia Arg Glu Val Gin Trp 3er His Leu Ala Ser Leu 

530 535 ' 540 

Lys Asn Ash Val Ser Pro Val Ser Arg Ser His Ser Phe Ser Asp Pro 

545 550 555 ' 560 



Ser Pro Lys Phe Aia His His His Leu Arg Ser Gin Asp Pro Cys Pro 
565 570 575 

Pro Ser Arg Ser Glu Giy Leu Ser Gin Ser Ser Asp Ser Lys Ser Glu 
580 585 590 

Val Pro Glu Pro Thr Gin Lys Aia Trp Ser Arg Ser Asp Ser Asp Glu 
595 600 605 

Val Pro Pro Arg Val Pro Val Arg Thr Thr Ser Arg Ser Pro Val Leu 
610 615 620 

Ser Arg Arg Asp Ser Pro Leu Gin Giy Gly Giy Gin Gin Asn Ser Gin 
625 630 635 640 

Ala Gly Gin Arg Asn Ser Thr Ser Ser lie Glu Pro Arg Leu Leu Trp 
645 650 655 

Glu Arg Val Glu Lys Leu Val Pro Arg Pro Giy Ser Gly Ser Ser Ser 
660 665 670 

Gly Ser Ser Asn Ser Gly Ser Gin Pro Giy Ser His Pro Giy Ser Gin 
675 680 685 

Ser Gly Ser Giy Glu Arg Phe Arg Val Arg Ser Ser Ser Lys Ser Glu 
690 695 700 

Giy Ser Pro Ser Pro Arg Gin Glu Ser Ala Aia Lvs Lvs Pro Asp Asp 
^05 710 715 ' ^ 720 

Lys Lys Glu Val Phe Arg Ser Leu Lys Pro Ala Gly Glu Vai Asp Leu 
725 730 735 

Thr Ala Leu Ala Lys Glu Leu Arg Ala Vai Glu Asp Val Arg Pro Pro 
740 745 750 

His Lys Val Thr Asp Tyr Ser Ser Ser Ser Glu Glu Ser Gly Thr Thr 
755 760 765 

Asp Glu Glu Giu Giu Asp Val Glu Gin Glu Gly Ala Asp Asp Ser Thr 
770 775 780 

Ser Gly Pro Giu Asp Thr Arg Aia Ala Ser Ser Pro Asn Leu Ser Asn 
785 790 795 800 

Gly Glu Thr Giu Ser Vai Lys Thr Met lie Val His Asp Asp Val Giu 
805 31C 815 
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Ser Glu Pro Ala Met 
820 



Thr Pro Ser Lys Glu Gly 
825 



?hr Leu lie Val Arc 
830 



Gin Thr Gin Ser Ala Ser Ser Thr Leu Gin Lys His Lys Ser Ser Ser 
835 840 845 

Ser Phe Thr Pro Phe He Asp Pro Arg Leu Leu Gin lie Ser Pro Ser 
850 855 860 

Ser Gly Thr Thr Val Thr Ser Val Val Gly Phe Ser Cys Asp Gly Leu 
865 870 875 ' 880 

Arg Pro Glu Ala lie Arg Gin Asp Pro Thr Arg Lys Gly Ser Val Val 
885 . 890 " ' 395 

Asn Val Asn Pro Thr Asn Thr Arg Pro Gin Ser Asp Thr Pro Glu He 
900 905 910 

Arg Lys Tyr Lys Lys Arg Phe Asn Ser Glu lie Leu Cys Ala Ala Leu 
915 920 925 

Trp Gly Val Asn Leu Leu Val Gly Thr Glu Ser Gly Leu Met Leu Leu 
930 935 940 

Asp Arg Ser Gly Gin Gly Lys Val Tyr Pro Leu He Ser Arg Arg Arg 
945 ^ 950 955 960 

Phe Gin Gin Met Asp Val Leu Glu Gly Leu Asn Val Leu Val Thr He 
965 970 975 

Ser Gly Lys Lys Asp Lys Leu Arg Val Tyr Tyr Leu Ser Trp Leu Arg 
980 985 990 

Asn Lys He Leu His Asn Asp Pro Glu Val Glu Lys Lvs Gin Glv Trp 
995 1000 ^ 1005 

Thr Thr Val Gly Asp Leu Glu Gly Cys Val His Tyr Lys Val Val Lys 
1010 1015 1020 

Tyr Glu Arg He Lys Phe Leu Val He Ala Leu Lys Ser Ser Val Glu 
1025 1030 1035 ' 1040 

Val Tyr Ala Trp Ala Pro Lys Pro Tyr His Lys Phe Met Ala Phe Lys 
1045 1050 1055 

Ser Phe Gly Glu Leu Leu His Lys Pro Leu Leu Val Asp Leu Thr Val 
1060 1065 ^ 1070 

Glu Glu Gly Gin Arg Leu Lys Val He Tyr Gly Ser Cys Ala Giv Phe 
1075 1080 " 1035 

His Ala Val Asp Val Asp Ser Gly Ser Val Tyr Aso He Tyr Leu Pro 
1090 1095 " HOC 
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Thr His lie Gin Cys Ser lie Lys Pro His Aia lie lie lie Leu Pro 

1105 1110 1115 1120 

Asn Thr Asp Giy Met Giu Leu Leu Vai Cys Tyr Giu A.sp Giu Giy Val 
1125 1130 1135 

Tyr Val Asn Thr Tyr Giy Arg lie Thr Lys Asp Vai Vai Leu Gin Trp 
1140 1145 1150 

Giy Giu Met Pro Thr Ser Vai Aia Tyr lie Arg Ser Asn Gin Thr Met 
1155 1160 1165 

Giy Trp Giy Giu Lys Aia lie Giu lie Arg Ser Vai Giu Thr Giy His 
1170 1175 1180 

Leu Asp Giy Vai Phe Met His Lys Arg Aia Gin Arg Leu Lys Phe Leu 
1185 1190 1195 1200 

Cys Giy Arg. Asn Asp Lys Vai Phe Phe Ser Ser Vai Arg Ser Giy Giy 
1205 1210 1215 

Ser Ser Gin Vai Tyr Phe Met Thr Leu Giy Arg Thr Ser Leu Leu Ser 
1220 1225 . 1230 . 

Trp 



. <210> 90 

<211> 982 

<212> PRT 

<213> SULU_ce 

<400> 90 

Met Aia Pro Aia Vai Leu Asp Lys Pro Giy Val lie Lys Asp Pro Ser 
15 10 15 

lie Aia Aia Leu Phe Ser Asn Lys Asp Pro Giu Gin Arg Tyr Gin Asp 
20 25 30 

Leu Arg Giu lie Giy His Giy Ser Phe Giy Aia Vai Tyr Phe Aia Tyr 
35 40 45 

Asp Lys Lys Asn Giu Gin Thr Val Ala lie Lys Lys Met Asn Phe Ser 
50 55 60 

Giy Lys Gin Aia Vai Giu Lys Trp Asn Asp lie Leu Lys Giu Val Ser 
65 70 75 80 

Phe Leu Asn Thr Vai Vai His Pro His lie Val Asp Tyr Lys Aia Cys 
85 90 ' ' 95 

Phe Leu Lys Asp Thr Thr Cys Trp Leu Vai Met Giu Tyr Cys lie Giy 
100 105 110 
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Ser Ala kia Asp lie Val Asp Vai Leu Arg Lys Gly Mer A.rg Glu Vai 
115 120 ' 125 

Glu lie Ala Ala He Cys Ser Gin Thr Leu Asp Ala Leu Arg Tyr Leu 
130 135 * 140 

His Ser Leu Lys Arg lie His Arg Asp He Lys Ala Gly Asn He Leu 
145 150 155 160 

Leu Ser Asp His Ala He Val Lys Leu Ala Asp Phe Gly Ser Ala Ser 
165 170 175 

Leu Val Asp Pro Ala Gin Thr Phe He Gly Thr Pro Phe Phe Met Ala 
180 185 190 

Pro Glu Val He Leu Ala Met Asp Glu Gly His Tyr Thr Aso Arg Ala 
195 200 205 

Asp He Trp Ser Leu Gly He Thr Cys He Glu Leu Ala Glu Arg Arc 
210 215 220 

Pro Pro Leu Phe Ser. Met Asn Ala Met Ser Ala Leu Tyr His He Ala 
225 230 235 240 

Gin Asn Asp Pro Pro Thr Leu Ser Pro He Asp Thr Ser Glu Gin Pro 
245 250 255 

Glu Trp Ser Leu Glu Phe Val Gin Phe He Asp Lys Cys Leu Arg Lys 
260 265 ' 270 

Pro Ala Glu Glu Arg Met Ser Ala Glu Glu Cys Phe Arg His Pro Phe 
275 280 285 

He Gin Arc Ser Arg Pro Ser Asp Thr He Gin Glu Leu He Gin Arg 
290 295 300 

Thr Lys Asn Met Val Leu Glu .Leu Asp Asn Phe Gin Tyr Lys Lvs Met 
305 310 315 ' 320 

Arg Lys Leu Met Tyr Leu Asp Glu Thr Glu Gly Lys Glu Gly Ser Glu 
325 330 " ^ 335 

Gly Asn Gly Ala Ser Asp Asp Leu Asp Phe His Gly Asn Glu Ala Asn 
340 345 " 350 

Ser He Gly Arg Ala Gly Asp Ser Ala Ser Ser Arg Ser Ala Ser Leu 
355 360 365 

Thr Ser Phe Arg Ser Mer Gin Ser Ser Gly Gly Ala Gly Leu Leu Val 
370 375 330 

Ser Thr Asn Thr Thr Gly Ala Met Asp Asn Val His Gly Ser Ser Gly 
335 390 395 40C 

Tyr Gly Asn Gly Ser Ser Ser Thr Thr Ser Ser Ala Arg A.rg Arg Pro 
405 410 " 415 
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Pro lie Pro Ser Gin Met Leu Ser Ser Thr Ser Thr Ser Giy Vai Gly 
420 425 430 

Thr Met Pro Ser His Gly Ser Vai Gly Ala Ser lie Thr Ala lie Ala 
435 440 445 

Vai Asn Pro Tor Pro Ser Pro Ser Glu Pro lie Pro Thr Ser Gin Pro 
450 455 460 

Thr Ser Lys Ser Glu Ser Ser Ser lie Leu Glu Thr Ala His Asp Asp 
465 470 475 480 

Pro Leu Asp Thr Ser lie Arg Ala Pro Vai Lys Asp Leu His Met Pro 
485 490 495 

His Arg Ala Vai Lys Glu Arg lie Ala Thr Leu Gin Asn His Lys Phe 
500 505 510 

Ala Thr Leu Arg Ser Gin Arg lie lie Asn Gin Glu Gin Giu Glu Tyr 
515 520 525 

Thr Lys Giu Asn Asn Met Tyr Glu Gin Met Ser Lys Tyr Lys His Leu 
530 535 540 

Arg Gin Ala His His Lys Glu Leu Gin Gin Phe Glu Giu Arg Cys Ala 
545 550 555 560 

Leu Asp Arg Glu Gin Leu Arg Vai Lys Met Asp Arg Glu Leu Giu Gin 
565 570 . 575 

Leu Thr Thr Thr Tyr Ser Lys Glu Lys Met Arg Vai Arg Cys Ser Gin 
580 585 590 

Asn Asn Glu Leu Asp Lys Arg Lys Lys Asp lie Giu Asp Giy Glu Lys 
595 600 605 

Lys Met Lys Lys Thr Lys Asn Ser Gin Asn Gin Gin Gin Met Lys Leu 
610 615 620 

Tyr Ser Ala Gin Gin Leu Lys Giu Tyr Lys Tyr Asn Lys Glu Ala Gin 
625 630 635 640 



Lys Thr Arg Leu Arg Ser Leu Asn Met Pro Arg Ser Thr Tyr Giu Asn 
645 650 655 

Ala Met Lys Glu Vai Lys Ala Asp Leu Asn Arg Vai Lys Asp Ala Arg 
660 665 670 

Glu Asn Asp Phe Asp Giu Lys Leu Arg Ala Giu Leu Giu Asp Giu lie 

675 680 685 

Vai Arg Tyr Arg Arg Gin Gin Leu Ser Asn Leu His Gin Leu Glu Giu 
690 695 700 
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Gin Leu Asp Asp Glu Asp Val Asn Val Gin Glu Arg Gin Met Asp Thr 
705 710 715 * 720 

Arc His Gly Leu Leu Ser Lys Gin His Glu Met Thr Arg Asp Leu Glu 
"725 730 ^ 735 

lie Gin His Leu Asn Glu Leu His Ala Met Lys Lys Arg His Leu Glu 
740 745 750 

Thr Gin His Glu Ala Glu Ser Ala Ser Gin Asn Glu Tyr Thr Gin Arg 
755 760 765 

Gin Gin Asp Glu Leu Arg Lys Lys His Ala Met Gin Ser Arg Gin Gin 
770 775 780 

Pro Arg Asp Leu Lys He Gin Glu Ala Gin He Arg Lys Gin Tyr Arg 
785 790 ^ 795 ^ 800 

Gin Val Val Lys Thr Gin Thr Arg Gin Phe Lys Leu Tyr Leu Thr Gin 
805 810 815 

Met Val Gin Val Val Pro Lys Asp Glu Gin Lys Glu Leu Thr Ser Arg 
820 825 830 

Leu Lys Gin Asp Gin Met Gin Lys Val Ala Leu Leu Ala Ser Gin Tyr 
835 840 845 

Glu Ser Gin He Lys Lys Met Val Gin Asp Lys Thr Val Lys Leu Glu 
850 855 860 

Ser Trp Gin Glu Asp Glu Gin Arg Vai Leu Ser Glu Lys Leu Glu Lys 
865 870 875 880 

Glu Leu Glu Glu Leu lie Ala Tyr Gin Lys Lys Thr Arg Ala Thr Leu 
885 890 895 

Glu Glu Gin He Lys Lys Glu Arg Thr Ala Leu Glu Glu Arg He Gly 
900 905 910 



Thr Arg Arg Ala Met Leu Glu Gin Lys He He Glu Glu Arg Glu Gin 
915 920 925 

Met Gly Glu Met Arg Arg Leu Lys Lys Glu Gin He Ara Asd Arg His 
930 935 940 

Ser Gin Glu Arg His Arg Leu Glu Asn His Phe Val Arg Thr Gly S^r 

950 955 960 

Thr Ser Arg Ser Ser Giy Gly He Ala Pro Gly Vai Gly Asn Ser Ser 
965 970 975 

Ser He Gin Mer Ala Met 
980 
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<210> 91 
<211> 842 
<212> PRT 
<213>' KHSl_h 

<400> 91 



Leu Arg Pro Ala Ala 
1 5 

Giu Leu Val Gin Arg 
20 

Ala Arg Asn Val His 
35 

Leu Glu Pro Gly Asp 
50 

Val Lys Glu Cys Lys 
65 

Leu Ser Arg Glu Lys 
85 

Ser Leu Gin Asp lie 
100 

lie Ala Tyr Val Cys 
115 

Thr Lys Gly Lys Met 
130 



Asp lie Leu Arg Arg Asn 
10 

Val Gly Ser Gly Thr Tyr 
25 

Thr Gly Glu Leu Ala Ala 
40 

Asp Phe Ser Leu lie Gin 
55 

His Cys Asn lie Val Ala 
70 75 

Leu Trp lie Cys Met Glu 
90 

Tyr His Val Thr Gly Pro 
105 

Arg Glu Thr Leu Gin Gly 
120 

His Arg Asp lie Lys Gly 
135 



Pro Gin Gin Asp Tyr 
15 

Gly Asp Val Tyr Lys 
30 

Val Lys lie lie Lys 
45 

Gin Glu lie Phe Met 
60 

Tyr Phe Gly Ser Tyr 
80 

Tyr Cys Gly Gly Gly 
95 

Leu Ser Giu Leu Gin 
110 

Leu Ala Tyr Leu His 
125 

Ala Asn lie Leu Leu 
140 



Thr Asp His Gly 
145 

lie Thr Ala Thr 



Trp Met Ala Pro 
180 

Gin Leu Cys Asp 
195 

Glu Leu Gin Pro 
210 

Leu Met Ser Lys 
225 

Lys Trp Ser Ser 



Asp Val Lys Leu 
150 

lie Ala Lys Arg 
165 

Glu Val Ala Ala 



lie Trp Ala Val 
200 

Pro Met Phe Asp 
215 

Ser Asn Phe Gin 
230 

Thr Phe His Asn 
245 



Ala Asp Phe Gly 
155 

Lys Ser Phe lie 
170 

Val Glu Lys Asn 
185 

Gly lie Thr Ala 



Leu His Pro Met 
220 

Pro Pro Lys Leu 
235 

Phe Val Lys lie 
250 



Val Ala Ala Lys 
160 

Gly Thr Pro Tyr 
175 

Gly Gly Tyr Asn 
190 

lie Glu Leu Gly 
205 

Arg Ala Leu Phe 



Lys Asp Lys Thr 
240 

Ala Leu Thr Lys 
255 
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Asn Pro Lys Lys Arg Pro Thr Ala Giu Arg Leu Leu tHk m- 

265 270 
Val Ala Gin Pro Gly Leu Ser Arg Ala Leu Ala Va^ ri. - 

280 285 

Lys v,l p„ 



300 

Asp Asp Phe Glu Pro His Ala He He Ara Hie; tk.- . 
305 -..n ^ ^^^^ -^^^ Ser Thr 

■^-^^ 320 

Asn Arg Asn Ala Arg Ala Glu Arg Thr Ala S^r ri,, - . 

325 ^ ^^'^ --'-e Asn Phe Asd 

335 

.ys Leu cm Pje CIu P.o P.o .eu Lys Clu Th. CIu' Ala A., .sp 

Giu Met Giy Leu Ser Ser Asp Pro Asn Phe Met Leu 

36 

Phe Val Asp Gly Ala Asn Thr Gl 



355 ""^"^ -"''^ "^'^ Gin Trp Asn Pro 

365 



370 „c ^h"^ Ser Lys Arg Ala He 



375 

Pro Pro Pro Leu Pro Pro Lys Pro Ara Ilf. c:<=^ c; 

385 ^ ^^'^ Ser Tyr Pro Glu Asp 

400 



380 

395 



Asn Phe Pro Asp Glu Glu Lys Ala Ser Thr He Lys H.s Cys Pro Asp 



415 



Ser Glu Ser Arg Al 



a Pro Gin He Leu Arg Arg Gin 



420 425 



Ser Ser Pro Se 



430 



Cy= Giv P.O V.X r.r S„ S„ XI, Oly I.. 

Ser Lys Leu Met Ser Glu Asn Thr Glu Glv Ser Al^ rm ^, o 

450 .irc; ^ A-l-a Pro Gin 

460 

Leu Pro Arg Lys Asn Asp Lys Ara Asd Phf. p^^ t 

4 65 ^ ^ ^® ^"^^ Ala He Asn 
Gly Leu Pro Pro Thr Pro Lys Val Leu Met Gly Al 



485 ,on 

4 95 

Val Phe Asp Gly Cys Pro Leu Lys He Asn rw<= ;m ^. 

500 ^^i^ Tnr Ser Trp He 

i05 510 

His Pro Asp Thr Lys Asp Gin Tyr He He Phe -Iv Th^ r^ . 

515 con '^-^ Asp Gly 

525 

II. Ty. T.. I.,. .eu J,„ Clu Hi. =1„ 

^-^^ 540 
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Leu Phe Pro Arg Lys Cys Thr Trp Leu Tvr Val lie Asn Asn Thr Leu 

5"*^ 550 555 560 

Met Ser Leu Ser Glu Giy Lys Thr Phe Gin Leu Tyr Ser His Asn Leu 
565 570 575 

lie Ala Leu Phe Glu His Ala Lys Lys Pro Gly Leu Ala Ala His lie 
580 585 590 

Gin Thr His Arg Phe Pro Asp Arg lie Leu Pro Arg Lys Phe Ala Leu 
595 600 605 

Thr Thr Lys lie Pro Asp Thr Lys Gly Cys His Lys Cys Cys lie Val 
610 615 620 

Arg Asn Pro Tyr Thr Gly His Lys Tyr Leu Cys Gly Ala Leu Gin Ser 
625 630 635 640 

Gly He Val Leu Leu Gin Trp Tyr Glu Pro Met Gin Lys Phe Met Leu 
645 650 655 

He Lys His Phe Asp Phe Pro Leu Pro Ser Pro Leu Asn Val Phe Glu 
660 665 670 

Met Leu Val He Pro Glu Gin Glu Tyr Pro Met Val Cys Val Ala He 
675 680 685 

Ser Lys Gly Thr Glu Ser Asn Gin Val Val Gin Phe Glu Thr He Asn 
690 695 700 

Leu Asn Ser Ala Ser Ser Trp Phe Thr Giu He Gly Ala Gly Ser Gin 
"705 710 715 720 

Gin Leu Asp Ser He His Val Thr Gin Leu Glu Arg Asp Thr Val Leu 
725 730 735 

Val Cys Leu Asp Lys Phe Val Lys He Val Asn Leu Gin Gly Lys Leu 
"7^0 745 750 

Lys Ser Ser Lys Lys Leu Ala Ser Glu Leu Ser Phe Asp Phe Arg He 
755 760 765 

Glu Ser Val Val Cys Leu Gin Asp Ser Val Leu Ala Phe Trp Lys His 
"770 775 780 

Gly Met Gin Giy Lys Ser Phe Lys Ser Asp Glu Val Thr Gin Glu He 
765 790 795 800 

Ser Asp Glu Thr Arg Val Phe Arg Leu Leu Gly Ser Asp Arg Val Val 
805 810 ^ 815 

Val Leu Glu Ser Arg Pro Thr Glu Asn Pro Thr Ala His Ser Asn Leu 
820 825 830 

Tyr He Leu Ala Gly His Glu Asn' Ser . Tyr 
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840 



<210> 92 
<211> 911 
<212> PRT 
<213> LOK m 



<400> 92 

Met Ala Phe Ala Asn Phe Arg Arg He Leu Arg Leu Ser Thr Phe Glu 



10 15 



Lys Arg Lys Ser Arg Glu Tyr Glu His Val Arg Arg Asp Leu Asp Pro 

25 30 

Asn Asp Val Trp Glu He Val Gly Glu Leu Gly Asp Gly Ala Phe Gly 
Lys val Tyr Lys Ala Lys Asn Lys Glu Thr Gly Ala Leu Ala Ala Ala 



60 



Lys Val lie Glu Thr Lys Ser Glu Glu Glu Leu Glu Aso Tyr He Val 

75 " 80 

Glu He Glu He Leu Ala Thr Cys Asp His Pro Tyr He Val Lys Leu 



85 90 



95 



Leu Gly Ala Tyr Tyr Tyr Asp Gly Lys Leu Trp He Met He Glu 



100 



105 



Phe 



110 



Cys Pro Gly Gly Ala Val Asp Ala He Met Leu Glu Leu Asp Arg Gly 



125 

Ala 



Leu Thr Glu Pro Gin He Gin Val Val Cys Arg Gin Met Leu Glu 

Leu Asn Phe Leu His Gly Lys Arg He He His Arg Asp Leu Lys Al 

150 155 

Gly Asn val Leu Met Thr Leu Glu Gly Asp He Arg Leu Ala Asp Phe 



a 
60 



175 



Gly val Ser Ala Lys Asn Leu Lys Thr Leu Gin Lys Arg Asp Ser Phe 

185 190 

He Gly Thr Pro Tyr Trp Met Ala Pro Glu Val Val Leu Cys Glu Thr 

200 205 



Met Lys Asp Ala Pro Tyr Asp Tyr Lys Ala Asp He Trp Ser Leu Gly 

220 

lie Thr Leu He Glu Met Ala Gin He Glu Pro Pro His His- Glu Leu 



230 



235 



240 
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Asn Pro Met Arg Val Leu Leu Lys lie Ala Lys Ser Asp Pro Pro Thr 
245 250 255 

Leu Leu Thr Pro Ser Lys Trp Ser Val Glu Phe Arg Asp Phe Leu Lys 
260 265 270 

lie Ala Leu Asp Lys Asn Pro Glu Thr Arg Pro Ser Ala Ala Gin Leu 
275 280 285 

Leu Gin His Pro Phe Val Ser Arg Val Thr Ser Asn Lys Ala Leu Arg 
290 295 300 

Glu Leu Val Ala Glu Ala Lys Ala Glu Val Met Glu Glu lie Glu Asp 
305 310 315 320 

Gly Arg Glu Asp Gly Glu Glu Glu Asp Ala Val Asp Ala Val Pro Pro 
325 330 ' 335 



Leu Val Asn His Thr Gin Asp Ser Ala Asn Val Thr Gin Pro Ser Leu 
340 345 350 

Asp Ser Asn Lys Leu Leu Gin Asp Ser Ser Thr Pro Leu Pro Pro Ser 
355 360 365 

Gin Pro Gin Glu Pro Val Asn Gly Pro Cys Ser Gin Pro Ser Gly Asp 
370 375 380 

Gly Pro Leu Gin Thr Thr Ser Pro Ala Asp Gly Leu Ser Lys Asn Asp 
385 390 395 400 

Asn Asp Leu Lys Val Pro Val Pro Leu Arg Lys Ser Arg Pro Leu Ser 
405 410 415 

Mez Asp Ala Arg lie Gin Met Asp Glu Glu Lys Gin He Pro Asp Gin 
420 425 430 

Asp Glu Asn Pro Ser Pro Ala Ala Ser Lys Ser Gin Lys Ala Asn Gin • 
435 440 445 

Ser Arg Pro Asn Ser Ser Ala Leu Glu Thr Leu Gly Gly Glu Ala Leu 
450 455 460 

Thr Asn Gly Gly Leu Glu Leu Pro Ser Ser Val Thr Pro Ser His Ser 
465 470 475 480 

Lys Arg Ala Ser Asp Cys Ser Asn Leu Ser Thr Ser Glu Ser Met Asp 
485 490 495 

Tyr Gly Thr Ser Leu Ser Ala Asp Leu Ser Leu Asn Lys Glu Thr Glv 
500 505 ' 510 

Ser Leu Ser Leu Lys Gly Ser Lys Leu His Asn Lys Thr Leu Lys Arg 
515 520 525 
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Thr Arg Arg Phe Val Val Asp Gly Val Glu Val Ser He Thr th^ se- 
Lys lie lie Ser Glu Asp Glu Lys Lys Asp Glu Glu Mer Arg Phe Leu 



545 550 555 



560 



Arg Arg Gin Glu Leu Arg Glu Leu Arg Leu Leu Gin Lys Glu Glu His 
555 570 

Arg Asn Gin Thr Gin Leu Ser Ser Lys His Glu Leu Gin Leu Glu Gin 
580 585 590 

Met His Lys Arg Phe Glu Gin Glu He Asn Ala Lys Lys Lys Phe Tyr 
595 600 605 

Asp Val Glu Leu Glu Asn Leu Glu Arg Gin Gin Lys Gin Gin Val Glu 



615 



620 



Lys Met Glu Gin Asp His Ser Val Arg Arg Lys Glu Glu Ala Lys Arg 

"° "5 640 

He Arg Leu Glu Gin Asp Arg Asp Tyr Ala Lys Phe Gin Glu Gin Leu 
645 650 655 

Lys Gin Met Lys Lys Glu Val Lys Ser Glu Val Glu Lys Leu Pro Arq 
660 665 670 

Gin Gin Arg Lys Glu Ser Met Lys Gin Lys Met Glu Glu His Ser Gin 
675 680 685 

Lys Lys Gin Arg Leu Asp Arg Asp Phe Val Ala Lys Gin Lys Glu Asp 

695 700 

Leu Glu Leu Ala Met Arg Lys Leu Thr Thr Glu Asn Arg Arg Glu He 

715 

Cys Asp Lys Glu Arg Asp Cys Leu Ser Lys Lys Gin Glu Leu Leu Arg 

730 735 

Asp Arg Glu Ala Ala Leu Trp Glu Met Glu Glu His Gin Leu Gin Glu 

745 750 

Arg His Gin Leu Val Lys Gin Gin Leu Lys Asp Gin Tyr Phe Leu Gin 

760 765 

Arg His Asp Leu Leu Arg Lys His Glu Lys Glu Arg Glu Gin Met Gin 

Arg Tyr Asn Gin Arg Met Met Glu Gin Leu Lys Val Arg Gin Gin Gin 

795 800 

Glu Lys Ala Arg Leu Pro Lys He Gin Arg Ser Asp Glv Glu Thr Arq 
805 810 " 



815 



Met Ala Met Tyr Lys Lys Ser Leu His He Asn Gly Ala Gly Ser Ala 
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100 



820 



825 



830 



Ser Giu Gin Arg Giu Lys lie Lys Gin Phe Ser Gin Gin Glu Giu Lys 

835 840 845 

Arg Gin Lys Ala Glu Arg Leu Gin Gin Gin Gin Lys His Giu His Gin 

850 855 860 

Met Arg Asp Met Vai Ala Gin Cys Giu Ser Asn Met Ser Giu Leu Gin 

865 870 875 880 



Gin Leu Gin Asn Giu Lys Cys Tyr Leu Leu Vai Glu His Glu Thr Gin 
885 890 895 



Lys Leu Lys Ala Leu Asp Giu Ser His Asn Gin Ser Leu Lys Giu 
900 905 910 



<210> 93 

<2il> 545 

<212> PRT 

<213> PAKi_h 

<400> 93 

Met Ser Asn' Asn Gly Leu Asp lie Gin Asp Lys Pro Pro Ala Pro Pro 
15 10 15 



Met Arg Asn Thr 
20 

Thr Leu Asn His 
35 

Lys Lys Lys Asp 
50 

Asn Lys Lys Lys 
65 

Phe Giu His Thr 



Thr Gly Met Pro 
100 

Thr Lys Ser Giu 
115 

Giu Phe Tyr Asn 
130 



Ser Thr Met lie 



Gly Ser Lys Pro 
40 

Arg Phe Tyr Arg 
55 

Giu Lys Glu Arg 
70 

lie His Vai Gly 
85 

Glu Gin Trp Ala 



Gin Lys Lys Asn 
120 

Ser Lys Lys Thr 
135 



Gly Ala Gly Ser 
25 

Leu Pro Pro Asn 



Ser lie Leu Pro 
60 

Pro Giu lie Ser 
75 

Phe Asp Ala Vai 
90 

Arg Leu Leu Gin 
105 

Pro Gin Ala Vai 



Ser Asn Ser Gin 
140 



Lys Asp Ala Gly 
30 

Pro Giu Glu Lys 
45 

Gly Asp Lys Thr 



Leu Pro Ser Asp 
80 

Thr Gly Giu Phe 
95 

Thr Ser Asn lie 
110 

Leu Asp Vai Leu 
125 

Lys Tyr Met Ser 
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Phe Thr Asp Lys Ser Ala Glu Asp Tyr Asn Ser c;^^ a., - 
145 

160 

Val Lys Ala Val Ser Glu Thr Pro Ala Val Pro Pro Val Ser Glu Asp 

1^0 175 

Glu Asp ASP Asp Asp Asp Asp Ala Thr Pro Pro Pro Val He Ala Pro 

185 

Arg Pro Glu His Thr Lys Ser Val Tyr Thr Arg Ser Val He Glu Pro 

200 205 

Leu Pro val Thr Pro Thr Arg Asp Val Ala Thr Ser Pro He Ser Pro 

^■^^ 220 
Thr Glu Asn Asn Thr Thr Pro Pro Ac:o zii = i 

225 230 ^^^^ 

235 240 

Lys Gin Lys -Lys Lys Pro Lys Met Ser Asp Glu Glu He Leu Glu Lys 

250 255 

Leu Arg Ser He Val Ser Val Glv Asn Prn t wc: t 

260 ^ '^^^ Arg 

265 270 

Phe Glu Lys He Gly Gin Gly Ala Ser Gly Thr Val Tyr Thr Ala Met 

285 

Asp val Ala Thr Gly Gin Glu Val Ala He Lys Gin Met Asn Leu Gin 



300 

Cln Gin Pro Lys Lys Glu Leu He He Asn Glu He Leu Val Met Arg 

320 

Asn He Val Asn Tyr Leu A<.n q^>- t.,^ t ... 

325 



Glu Asn Lys Asn Pro Asn He Val Asn Tyr Leu Asp Ser Tyr Leu Val 

330 

Gly ASP Glu Leu Trp Val Val Met Glu Tyr Leu Ala Gly Gly Ser Leu 

350 

Thr ASP Val Val Thr Glu Thr Cys Met Asp Glu Gly Gin He Ala Ala 

365 

Val cys Arg Glu Cys Leu Gin Ala Leu Glu Ser Leu His Ser Asn Gin 

380 

val He His Arg Asp He Lys Ser Asp Asn He Leu Leu Gly Met Aso 



380 

Lys Ser Asp Asn lie 

390 

Gly Ser Val Lys Leu Thr Asp Phe Gly Phe Cys Ala Gin II 



405 



410 



395 400 



e Thr Pro 



415 



Glu Gin S 



r Lys Arg Ser Thr Met Val Gly Thr Pro Tyr Trp Met Ala 

430 

Pro Glu val val Thr Arg Lys Ala Tyr Gly Pro Lys Val Asp He Tro 

" 440 
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Ser Leu Gly He 
450 

Leu Asn Glu Asn 
465 

Thr Pro Glu Leu 



Phe Leu Asn Arg 
500 

Lys Glu Leu Leu 
515 

Ser Leu Thr Pro 
530 

His ' 
545 



Met Ala He Glu 
455 

Pro Leu Arg Ala 
470 

Gin Asn Pro Glu 
485 

Cys Leu Glu Met 



Gin His Gin Phe 
520 

Leu He Ala Ala 
535 



Met He Glu Gly 
460 

Leu Tyr Leu He 
475 

Lys Leu Ser Ala 
490 

Asp Val Glu Lys 
505 

Leu Lys He Ala 



Ala Lys Glu Ala 
540 



Glu Pro Pro Tyr 



Ala Thr Asn Gly 
480 

He Phe Arg Asp 
495 

Arg Gly Ser Ala 
510 

Lys Pro Leu Ser 
525 

Thr Lys Asn Asn 



<210> 94 
<211> 506 
<212> PRT 
<213> PAK65_h 

<400> 94 



Met Glu Glu Thr 
1 

His Ser Leu Lys 
20 

His Lys He He 
35 

Lys Glu Lys Glu 
50 

Thr He His Val 
65 

Pro Glu Gin Trp 



Glu Gin Lys Lys 
100 

Asp Ser Asn Thr 
115 



Gin Gin Lys Ser 
5 

Pro Leu Pro Ser 



Ser He Phe Ser 
40 

Arg Pro Glu He 
55 

Gly Phe Asp Thr 
70 

Ala Arg Leu Leu 
85 

Asn Pro Gin Ala 



Val Lys Gin Lys 
120 



Asn Leu Glu Leu 
10 

Val Pro Glu Glu 

25 

Gly Thr Glu Lys 



Ser Pro Pro Ser 
60 

Val Thr Gly Glu 
75 

Gin Thr Ser Asn 
90 

Val Leu Asp Val 
105 

Tyr Leu Ser Phe' 



Leu Ser Ala Asn 
15 

Lys Lys Pro Arg 
30 

Gly Ser Lys Lys 
45 

Asp Phe Glu His 



Phe Thr Gly Met 
80 

He Thr Lys Leu 
95 

Leu Lys Phe Tyr 
110 

Thr Pro Pro Glu 
125 
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Lys Asp Giy Phe Pro Ser Gly Thr Pro Ala Leu Asn Ala Lys Gly Thr 
130 135 

Glu Ala Pro Ala Val Val Thr Glu Glu Glu Asp Asp Asp Glu Glu Thr 

150 

Ala Pro Pro Vai He Ala Pro Arg Pro Asp His Thr Lys Ser He Tyr 
165 170 

Thr Arg Ser Val He Asp Pro Val Pro Ala Pro Val Gly Asp Ser His 



180 185 



190 



val Asp Gly Ala Ala Lys Ser Leu Asp Lys Gin Lys Lys Lys Thr Lys 
195 200 205 

Met Thr Asp Glu Glu lie Met Glu Lys Leu Arg Thr lie Val Ser He 
210 215 220 

Gly. Asp Pro Lys Lys Lys Tyr Thr Arg Tyr Glu Lys He Gly Gin Gly 

235 240 

Ala Ser Gly Thr Val Phe Thr Ala Thr Asp Val Ala Leu Gly Gin Glu 
245 250 255 

val Ala He Lys Gin He Asn Leu Gin Lys Gin Pro Lys Lys Glu Leu 
260 265 270 

He He Asn Glu He Leu Val Met Lys Glu Leu 



275 280 



Lys Asn Pro Asn He 
285 



val Asn Phe Leu Asp Ser Tyr Leu Val Gly Asp Glu Leu Phe Val Val 



290 295 



300 



Met Glu Tyr Leu Ala Gly Arg Ser Leu Thr Asp Val Val Thr Glu Thr 
Cys Met Asp Glu Ala Gin He Al 



325 



a Ala Val Cys Arg Glu Cys Leu Gin 

335 

Ala Leu Glu Phe Leu His Ala Asn Gin Val He His Arg Asp He Lys 
340 345 

ser Asp Asn Val Leu Leu Gly Met Glu Gly Ser Val Lys Leu Thr Asp 
355 360 

Phe Gly Phe Cys Ala Gin He Thr P 



370 



375 



'ro Glu Gin Ser Lys Arg Ser Th 



r 



380 



Met val Gly Thr Pro Tyr Trp Met Ala Pro Glu Val Val Thr Arg Lys 

390 395 400 



Ala Tyr Gly Pro Lys Val Asp He Trp Ser Leu Glv He Met Ala 11- 

410 - 

Glu Met Val Glu Gly Glu Pro d^o Tvt t<=,- a=„ ^-t 

y -i-^ - J. -I ._o lyr Leu Asn Glu Asn Pro Leu Arc 
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420 

Ala Leu Tyr Leu 
435 

Glu Lys Leu Ser 
450 

Met Asp Val Glu 
465 

Phe Leu Lys Leu 



Ala Ala Lys Glu 
500 



lie Ala Thr Asn 
440 

Pro lie Phe Arg 
455 

Lys Arg Gly Ser 
470 

Ala Lys Pro Leu 
485 

Ala Met Lys Ser 



104 

425 

Gly Thr Pro Glu 



Asp Phe Leu Asn 
460 

Ala Lys Glu Leu 
475 

Ser Ser Leu Thr 
490 

Asn Arg 
505 



430 

Leu Gin Asn Pro 
445 

Arg Cys Leu Glu 



Leu Gin His Pro 
480 

Pro Leu lie Met 
495 



<210> 95 
<211> 544 
<212> PRT 
<213> PAK3_m 

<400> 95 

Met Ser Asp Ser Leu Asp Asn Glu Glu Lys Pro Pro Ala Pro Pro Leu 
15 10 15 

Arg Met Asn Ser Asn Asn Arg Asp Ser Ser Ala Leu Asn His Ser Ser 
20 25 30 

. Lys Pro Leu Pro Met Ala Pro Glu Glu Lys Asn Lys Lys Ala Arg Leu 
35 40 45 

Arg Ser lie Phe Pro Gly Gly. Gly Asp Lys Thr Asn Lys Lys Lys Glu 
^0 . 55 60 

Lys Glu Arg Pro Glu He Ser Leu Pro Ser Asp Phe Glu His Thr He 

70 75 80 

His Val Gly Phe Asp Ala Val Thr Gly Glu Phe Thr Gly He Pro Glu 
85 90 95 

Gin Trp Ala Arg Leu Leu Gin Thr Ser Asn He Thr Lys Leu Glu Gin 
100 105 110 

Lys Lys Asn Pro Gin Ala Val Leu Asp Val Leu Lys Phe" Tyr Asp Ser 
115 120 125 

Lys Glu Thr Val Asn Asn Gin Lys Tyr Met Ser Phe Thr Ser Gly Asp 
1^0 135 140 

Lys Ser Ala His Gly Tyr He Ala Ala His Gin Ser Asn Thr Lys Thr 

150 155 160 
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Giy Ser Glu Pro Pre Leu Ala Pro Pro Val Ser Glu Glu Glu Asp Giu 
165 170 175 

Glu Glu Glu Glu Giu Glu Asp Asp Asn Glu Pro Pro Pro Val lie Ala 
1.80 185 190 

Pro Arg Pro Glu His Thr Lys Ser He Tyr Thr Arg Ser Val Val Glu 
195 200 205 

Ser He Ala Ser Pro Ala Ala Pro Asn Lys Glu Asp He Pro Pro Ser 
210 215 220 

Ala Glu Asn Ala Asn Ser Thr Thr Leu Tyr Arg Asn Thr Asp Arg Gin 
225 230 235 240 

Arg Lys Lys Ser Lys Met Thr Asp Glu Giu lie Leu Glu Lys Leu Arg 
245 250 255 

Ser lie Val Ser Val Giy Asp Pro Lys Lys Lys Tyr Thr Arg Leu Glu 
260 265 270 

Lys He Giy Gin Giy Ala Ser Giy Thr Val Tyr Thr Ala Leu Asp He 
275 280 285 

Ala Thr Giy Gin Glu Val Ala lie Lys Gin Met Asn Leu Gin Gin Gin 
290 295 300 

Pro Lys Lys Giu Leu lie lie Asn Glu He Leu Val Met Arg Glu Asn 
305 310 315 320 

Lys Asn Pro Asn lie Val Asn Tyr Leu Asp Ser Tyr Leu Val Giy Asp 
325 330 335 

Glu Leu Trp Vai Val Met Glu Tyr Leu Ala Giy Giy Ser Leu Thr Asp 
340 345 350 

Val Vai Thr Giu Thr Cys Met Asp Vai Giy Gin He Ala Ala Vai Cys 
355 360 365 

Arg Giu Cys Leu Gin Ala Leu Asp Phe Leu His Ser Asn Gin Val He 
370 375 380 

His Arg Asp lie Lys Ser Asp Asn He Leu Leu Giy Met Asp Giy Ser 
385 390 395 400 

Vai Lys Leu Thr Asp Phe Giy Phe Cys Ala Gin lie Thr Pro Giu Gin 
405 410 415 

Ser Lys Arg Ser Thr Met Val Giy Thr Pro Tyr Trp Met Ala Pro Glu 
420 425 430 

Val Val Thr Arg Lys Ala Tyr Giy Pro Lys Val Asp He Trp Ser Leu 
435 440 445 

Giy He Met Ala He Glu Met Val Giu Giy Glu Pro Pro Tyr Leu Asn 
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45C 455 460 

Glu Asn Pro Leu Arg Ala Leu Tyr Leu lie Ala Thr Asn Gly Thr Pro 
465 470 475 480 

Glu Leu Gin Asn Pro Glu Arg Leu Ser Ala Val Phe His Asp Phe Leu 
485 490 495 

Asn Arg Cys Leu Glu Met Asp Val Asp Arg Arg Gly Ser Ala Lys Glu 
500 505 510 

Leu Leu Gin His Pro Phe Leu Lys Leu Ala Lys Pro Leu Ser Ser Leu 
515 520 525 

Thr Pro Leu He lie Ala Ala Lys Glu Ala He Lys Asn Ser Ser Arg 



<210> 96 
<211> 2110 
<212> DNA 

<213> Full Length Mammalian (Huinan) STLK5 
<400> 96 

ggccaagacg gtcggggctg cttgctaact ccaggaacag gtttaagttt ttgaaactga 60 

agtaggtcta cacagtagga actcatgtca tttcttgtaa gtaaaccaga gcgaatcagg 120 

cggtgggtct cggaaaagtt cattgttgag ggcttaagag atttggaact atttggagac 180 

caatgatgcg agctcagagt caatagcatc cttctctaaa caggaggtca tgagtagctt 240 

tctgccagag ggagggtgtt acgagctgct cactgtgata ggcaaaggat ttgaggacct 300 
gatgactgtg aatctagcaa ggtacaaacc aacaggagag tacgtgactg tacggaggat ■ ■ 360 

taacctagaa gcttgttcca atgagatggt: aacattcttg cagggcgagc tgcatgtctc 420 

caaactcttc aaccatccca atatcgtgcc atatcgagcc acttttattg cagacaatga 480 

gctgtgggrt gtcacatcat tcatggcata cggttctgca aaagatctca cctgtacaca 540 

cttcatggat ggcatgaatg agctggcgat tgcttacatc ctgcaggggg tgctgaaggc 600 

cctcgactac atccaccaca tgggatatgt acacaggagt gtcaaagcca gccacatcct 660 

gatctctgtg gatgggaagg tctacctgtc tggtttgcgc agcaacctca gcatgataag 720 

ccatgggcag cggcagcgag tggtccacga ttttcccaag tacagtgtca aggttctgcc 780 

gtggctcagc cccgaggtcc tccagcagaa tctccagggt tatgatgcca agtctgacat 840 

ctacagtgtg ggaatcacag cctgrgaact ggccaacggc catgtcccct ttaaggatat 900 

gcctgccacc cagatgctgc tagagaaact gaacggcaca gtgccctgcc tgttggatac 960 

cagcaccatc cccgctgagg agctgaccat gagcccttcg cgctcagtgg ccaactctgg 1020 

cctgagtgac agcctgacca ccagcacccc ccggccctcc aacggtgact cgccctccca 1030 

cccctaccac cgaaccttct ccccccactt ccaccacttt gtggagcagt gccttcagcg 1140 

caacccggat gccaggccca grgccagcac cctcctgaac cactctttct tcaagcagat 1200 

caagcgacgr gcctcagagg ctttgcccga attgcttcgt cctgtcaccc ccatcaccaa 1260 

ttttgagggc agccagtctc aggaccacag tggaatcttt ggcctggtaa caaacctgga 1320 

agagctggag gtggacgatt gggagttc-g agcctctgca aactgtgcgc attctccagc 1380 

cagggatgca gaggccaccc agaggcccct cctgagggcc ggccacattc ccgccctcct 1440 

gggcagattg ggtagaaagg acattcttcc aggaaagttg actgctgact gattgggaaa 1500 

gaaaatcctg gagagatact tcactgctcc aaggcttttg agacacaagg gaatcccaac 1560 

aaccagggat caggagggtc caaagccgac atrcccagtc ctgrgagcuc aggtgacctc 1620 
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ctccgcagaa 
ctcct taaac 
aactt tgctc 
ctgcctccct 
taagcacaga 
acctaagtct 
ttagcgcctc 
cttgagggct 
agagacgctg 



gagagatgct 
ccgaggaccg 
tcagatgcct 
tccct cagac 
ctaaggctgg 
ttcccacggt 
ctgccagctg 
taagtcctca 



gctctggccc 
ccacctcttc 
cagatgcta r 
ctctccctca 
aacagtccat 
ttatgtgtgt 
ccctggt get 
tgctgtgtta 



tgggagctga 
ccagtgct tg 
aggtcagtga 
taar tccaga 
ccttatccct 
gcctcattcc 
ttctccaagg 
gtttcgttgt 



atrccaagcc 
cgaccagcct 
aagggcaagu 
gaagggcat t 
cttctggctt 
tttcccacca 
gccatcagtg 
cagaacaaat 



cagggtttgg 
cattctattt 
agtaagctgc 
tctgtctttt 
gggccctgac 
agaatccatc 
tcttgcctag 
taaaattttc 



1680 
1740 
1800 
1360 
1920 
1980 
2040 
2100 
2110 



<210> 97 
<21i> 373 
<212> PRT 

<213> Full Length Mammalian (Human) STLK 5 
<400> -97 



Met Ser Ser Phe Leu Pro Glu Gly Gly Cys Tyr Glu Leu Leu Thr Val 
1 5 10 15 

lie Gly Lys Gly Phe Glu Asp Leu Met Thr Val Asn Leu Ala Arg Tyr 
20 25 30 

Lys Pro Thr Gly Glu Tyr Val Thr Val Arg Arg lie Asn Leu Glu Ala 
35 40 45 

Cys Ser Asn Glu Met Val Thr Phe Leu Gin Gly Glu Leu His Val Ser 
50 55 60 



Lys Leu Phe Asn 
65 

Ala Asp Asn Glu 



Ala Lys Asp Leu 
100 

Ala lie Ala Tyr 
115 

His His Met Gly 
130 

lie Ser Val Asp 
145 

Ser Met lie Ser 



Lys Tyr Ser Val 
180 



His Pro Asn lie 
70 

Leu Trp Val Val 
85 

lie Cys Thr His 



lie Leu Gin Gly 
120 

Tyr Val His Arg 
135 

Gly Lys Val Tyr 
150 

His Gly Gin Arg 
165 

Lys Val Leu Pro 



Val Pro Tyr Arg 
75 

Thr Ser Phe Met 
90 

Phe Met Asp Gly 
105 

Val Leu Lys Ala 



Ser Val Lys Ala 
140 

Leu Ser Gly Leu 
155 

Gin Arg Val Val 
170 

Trp Leu Ser Pro 
185 



Ala Thr Phe lie 
80 

Ala Tyr Gly Ser 
95 

Met Asn Glu Leu 
110 

Leu Asp Tyr lie 
125 

Ser His lie Leu 



Arg Ser Asn Leu 
160 

His Asp Phe Pro 
175 

Glu Val Leu Gin 
190 
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Gin Asn Leu Gin Gly Tyr Asp Ala Lys Ser Asp He Tyr Ser Vai Giy 
195 200 '"205 

He Thr Ala Cys Glu Leu Ala Asn Gly His Val Pro Phe Lys Asp Met 
210 215 220 

Pro Ala Thr Gin Met Leu Leu Glu Lys Leu Asn Gly Thr Val Pro Cys 
225 230 235 240 

Leu Leu Asp Thr Ser Thr He Pro Ala Glu Glu Leu Thr Met Ser Pro 
245 250 255 

Ser Arg Ser Val Ala Asn Ser Giy Leu Ser Asp Ser Leu Thr Thr Ser 
260 ■ 265 270 

Thr Pro Arg Pro Ser Asn Gly Asp Ser Pro' Ser His Pro Tyr His Arg 
275 280 285 

Thr Phe Ser Pro His Phe His His Phe Val Glu Gin Cys Leu Gin Arg 
290 295 300 

Asn Pro Asp Ala Arg Pro Ser Ala Ser Thr Leu Leu Asn His Ser Phe 
305 310 315 320 

Phe Lys Gin He Lys Arg Arg Ala Ser Glu Ala Leu Pro Glu Leu Leu 
325 330 335 

Arg Pro Val Thr Pro He Thr Asn Phe Glu Gly Ser Gin Ser Gin Asp 
340 345 350 

His Ser Gly He Phe Gly Leu Val Thr Asn Leu Glu Glu Leu Glu Val 
355 ■ 360 365 

Asp Asp Trp Glu Phe 
370 



<210> 98 
<211> 2001 
<212> DNA 

<213> Mammalian (Human) STLK6 
<400> 98 



aaggaagata aaacaaaagc cttctttgga atagatggat ttttgtcact ttctgtgtga 

actaaagtga ttcaargtct cctttggact gcttctgcac ttcaagaaca caagttgaat 120 

cactcagacc tgaaaaacag tctgaaacca gtatccatca atacttggtt gatgagccaa 

ccctttccrg gtcacgtcca tccactagag ccagtgaagt actatgttcc accaacgttt 240 

ctcactatga gctccaagta gaaacaggaa gaggarttga caacttgacc tctgtccatc 300 

ttgcacggca tactcccacg ggaacactgg taactataaa aattacaaat crggaaaact 360 

gcaatgaaga acgcctgaaa gctttacaga aagccgtgat tctarcccac ttttcccggc 420 
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atcccaazaz tacaacttat tggacagttt tcactgttgg cagctggctt tgggttattc 480 

ctccatttat ggcctatggt tcagcaagtc aactcttgag gacctatttt cctgaaggaa 540 

cgagtgaaac tttaataaga aacattctct ttggagccgr gagagggttg aactatctgc 600 

accaaaatgg ctgtattcac aggagtatta aagccagcca tatcctcatt tctggtgatg 660 

gcctagtgac cctctctggc ctttcccatc tgcatagttt ggttaagcat ggacagaggc 720 

atagggcrgt gtatgatttc ccacagttca gcacatcagt gcagccgtgg ctgagtccag 780 

aactactgag acaggattta catgggtata atgrgaagtc agatatttac agrgttggga 840 

ttacagcatg tgaattagcc agtgggcagg tgcctttcca ggacatgcat agaactcaga 900 

tgctgttaca gaaactgaaa ggtcctcctt atagcccatt ggatatcagt attttccctc 960 

aatcagaatc cagaatgaaa aatrcccagt caggtgtaga ctctgggatt ggagaaagtg 1020 

tgcttgtctc cagtggaact cacacagtaa atagtgaccg attacacaca ccatcctcaa 1080 

aaactttctc tcctgccttc tttagcttgg tacagctctg tttgcaacaa gatcctgaga 1140 

aaaggccatc agcaagcagt ttattgtccc atgttttctt caaacagatg aaagaagaaa 1200 

gccaggattc aatactttca ctgttgccrc ctgcttataa caagccatca atatcatrac 1260 

ctccagtgtt accttggact gagccagaac gtgattttcc tgatgaaaaa gactcatact 1320 

gggaattcta gggctgccaa atcattttat gtcctatata cttgacactt tctccttgct 1380 

gctttttctt ctgtatttct aggtacaaat accagaatta tacttgaaaa tacagttggt 1440 

gcactggaga atctactatt taaaaccact ctgttcaaag gggcaccagt ttgtagtccc 1500 

tc-tgtttcgc acagagtact atgacaagga aacatcagaa ttactaatct agctagtgtc 1560 

atttattctg gaattttttt ctaagctgtg actaactctt tttatctctc aatataattt 1620 

ttgagccagt taattttttr cagtattttg ctgtcccttg ggaatgggcc ctcagaggac 1680 

agtgcttcca agtacatcrt ctcccagatt ctctggcctt tttaatgagc tattgttaaa 1740 

ccaacaggct agtttatctt acatcagacc cttttctggt agagggaaaa tgtttgtgct 1800 

ttcccttttt cttctgttaa tacttatggt aacacctaac tgagcctcac tcacattaaa i860 

tgattcactr gaaatatata cagaaattgt aatttgcttt tttttaaaaa agggggctaa 1920 

agtaacactt tcctacttat gtaaattata gatcctaaat tcacgcaccc cgtgggagct 1980 

caataaagat ttactgaatt g 2001 



<210> 99 
<211> 373 
<212> PRT 

<213> Mammalian (Human) STLK6 
<400> 99 



lie Gly Lys Gly Phe Glu Asp Leu Met Thr Val Asn Leu Ala Arq Tvr 
20 25 30 

Lys Pro Thr Gly Glu Tyr Val Thr Val Arg Arg lie Asn Leu Glu Ala 

35 40 45 

Cys Ser Asn Glu Mer Val Thr Phe Leu Gin Gly Glu Leu His Val Ser 

^0 55 60 

Lys Leu Phe Asn His Pro Asn He Val Pro Tyr Arg Ala Thr Phe He 

70 75 80 

Ala Asp Asn Glu Leu Trp Val Val Thr Ser Phe Met Ala Tyr Gly Ser 
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Ala Lys Asp Leu lie Cys Thr His Phe Met Asp Gly Met Asn Giu Leu 
100 105 110 

Ala lie Ala Tyr lie Leu Gin Gly Val Leu Lys Ala Leu Asp Tyr lie 
115 120 125 

Kis His Met Gly Tyr Val His Arg Ser Val Lys Ala Ser His lie Leu 
130 135 140 

lie Ser Val Asp Gly Lys Val Tyr Leu Ser Gly Leu Arg Ser Asn Leu 
145 ' 150 155 160 

Ser Met lie Ser His Gly Gin Arg Gin Arg Val Val His Asp Phe Pro 
165 170 175 

Lys Tyr Ser Val Lys Val Leu Pro Trp Leu Ser Pro Glu Val Leu Gin 
180 185 190 

Gin Asn Leu Gin Gly Tyr Asp Ala Lys Ser Asp lie Tyr Ser Val 
195 200 * 205 

lie Thr Ala Cys Glu Leu Ala Asn Gly His Val Pro Phe Lys Asp Met 
210 215 220 

Pro Ala Thr Gin Met Leu Leu Glu Lys Leu Asn Gly Thr Val Pro 
225 230 235 240 

Leu Leu Asp Thr Ser Thr lie Pro Ala Glu Glu Leu Thr Met Ser Pro 
245 . 250 255 

Ser Arg Ser Val Ala Asn Ser Gly Leu Ser Asp Ser Leu Thr Thr 
260 265 270 

Thr Pro Arg Pro Ser Asn Gly Asp Ser Pro Ser His Pro Tyr His Arg 
275 280 285 

Thr Phe Ser Pro His Phe His His Phe Val Glu Gin Cys Leu Gin 
290 295 300 

Asn Pro Asp Ala Arg Pro Ser Ala Ser Thr Leu Leu Asn His Ser Phe 
305 310 315 320 

Phe Lys Gin lie Lys Arg Arg Ala Ser Glu Ala Leu Pro Glu Leu 
325 330 335 

Arg Pro Val Thr Pro lie Thr Asn Phe Glu Gly Ser Gin Ser Gin Asp 
340 345 350 

His Ser Gly lie Phe Gly Leu Val Thr Asn Leu Glu Glu Leu Glu 
355 360 365 

Asp Asp Trp Glu Phe 

370 
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<210> 100 
<211> 311 
<212> DNA 

<213> Mammalian (Human) STLK7 
<400> 100 



la":-???: ^^^^ ^..a.caa. .ct..a..cc 

aatgtcaaac tagcatggat gaactc^tgj aS^aS^^; ^^^^^ggata aaccttgaga 
atcctaatat cgtatcttac ^ac^cS?? ^Jl.^^r agccatgagt caatgccatc iso 
tgaagctgct aagtggaggt cc^gt'c^gg ^tlStJr "^^'^^^^^^^ tggcttgtca 240 
aacacaaaag t gT:tctgg atattattaa gcacattgtg gcaaaagggg 300 

311 



<210> 101 
<211> 103 
<212> PRT 

<213> Mammalian (Human) STLK7 
<400> 101 

Asn A.g ASP ASP Tyr Glu Leu Gin Glu Val He Gly Ser Gly Ala 

^ 10 15 

Ala val Val Gin Ala Ala Tyr Cys Ala Pro Lys Lys Glu Lys Val 

2^ 30 
lie Lys Arg He Asn Leu Glu Lys Cys Gin Thr Ser Met Asp Glu Leu 

45 

,y, 0,u „e „ , 3.r Oln Cys His Hi. v.i 



Ala 



" 60 

ser Tyr Tyr Thr Ser Phe Val Val Lys Asp Glu Leu Trp Leu Val Met 

75 

Lys Leu Leu Ser Gly Gly Ser Val Leu Asp He He Lys His He Val 



90 95 



Ala Lys Gly Glu His Lvs Ser 
100 



<210> 102 
<211> 2806 
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<212> DNA 

<213> Full Length Mammalian (Human) PAK5 
<40C> 102 



cgggagtgrc cgcggtggtg gcggtgcaag agagctgaag gaggcgcgag ggcgcggagt 60 

tccaggccga gcagttaggc cgcgagcgac tgcggcgccg agccgatgag taacccgaag 120 

cccctagagg agtggtcacc tgcctgaggg cacttctgtc ccaccagcat cagaccaggc 180 

cgcaccgagt ccccggcacc atgtttggga agaggaagaa gcgggtggag atctccgcgc 240 

cgtccaactt cgagcaccgc gtgcacacgg gcttcgacca gcacgagcag aagttcacgg 300 

ggctgccccg ccagtggcag agcctgatcg aggagtcggc tcgccggccc aagcccctcg 360 

tcgaccccgc ctgcatcacc tccatccagc ccggggcccc caagaccatc gtgcggggca 420 

gcaaaggtgc caaagatggg gccctcacgc tgctgctgga cgagtttgag aacatgtcgg 480 

rgacacgctc caactccctg cggagagaca gcccgccgcc gcccgcccgt gcccgccagg 540 

aaaatgggat gccagaggag ccggccacca cggccagagg gggcccaggg aaggcaggca 600 

gccgaggccg gttcgccggt cacagcgagg cgggtggcgg cagtggtgac aggcgacggg 660 

cggggccaga gaagaggccc aagtcttcca gggagggctc agggggtccc caggagtcct 720 

cccgggacaa acgccccctc tccgggcctg atgrcggcac cccccagcct gctggtctgg 780 

ccagtggggc gaaactggca gctggccggc cctttaacac ctacccgagg gctgacacgg 840 

accacccatc ccggggtgcc cagggggagc ctcatgacgt ggcccctaac gggccatcag 900 

cggggggcct ggccarcccc cagtcctcct cctcctcctc ccggcctccc acccgagccc 960 

gaggtgcccc cagccctgga gtgctgggac cccacgcctc agagccccag ctggcccctc 1020 

cagcctgcac ccccgccgcc cctgctgttc ctgggccccc tggcccccgc tcaccacagc 1080 

gggagccaca gcgagtatcc catgagcagt tccgggctgc cctgcagctg gtggtggacc 1140 

caggcgaccc ccgctcctac ctggacaact tcatcaagar tggcgagggc- tccacgggca 1200 

tcgtgtgcat cgccaccgtg cgcagctcgg gcaagctggt ggccgtcaag aagatggacc 1260 

tgcgcaagca gcagaggcgc gagctgctct tcaacgaggt ggtaatcatg agggactacc 1320 

agcacgagaa tgtggtggag atgtacaaca gctacctggt gggggacgag ctctgggtgg 1380 

tcatggagtt cctggaagga ggcgccctca ccgacatcgt cacccacacc aggatgaacg 1440 

aggagcagat cgcggccgtg tgccttgcag tgctgcaggc cctgtcggtg ctccacgccc 1500 

agggcgtcat ccaccgggac atcaagagcg actcgatcct gctgacccat gatggcaggg 1560 

tgaagctgtc agactttggg ttctgcgccc aggtgagcaa ggaagtgccc cgaaggaagt 1620 

cgctggtcgg cacgccctac tggatggccc cagagctcat ctcccgcctt ccctacgggc 1680 

cagaggtaga catctggtcg ctggggataa tggtgattga gatggtggac ggagagcccc 1740 

cccacttcaa cgagccaccc ctcaaagcca rgaagatgat tcgggacaac ctgccacccc 1800 

gactgaagaa cctgcacaag gtgtcgccat ccctgaaggg cttcctggac cgcctgctgg 1860 

tgcgagaccc tgcccagcgg gccacggcag ccgagctgct gaagcaccca ttcctggcca 1920 

aggcagggcc gcctgccagc atcgtgcccc tcatgcgcca gaaccgcacc agatgaggcc 1980 

cagcgccctt cccctcaacc aaagagcccc cccgggtcac ccccgcccca ctgaggccag 2040 

tagggggcca ggcctcccac tcctcccagc ccgggagatg ctccgcgtgg caccaccctc 2100 

cttgctgggg gtagatgaga ccctactact gaactccagt tttgatctcg tgacttttag 2160 

aaaaacacag ggactcgtgg gagcaagcga ggctcccagg acccccaccc tctgggacag 2220 

gccctccccc atgttcttct gtctccagga agggcagcgg ccctcccatc actggaagtc 2280 

tgcagtgggg gtcgctgggg gtggagagaa cactaagagg tgaacatgta tgagtgtgtg 2340 

cacgcgtgtg agtgtgcatg tgtgtgtgtg tgcaaaggtc cagccacccc gtcctccagc 2400 

ccgcaagggg tgtctggcgc crtgcctgac acccagcccc ctctccccct gagccattgt 2460 

gggggtcgat cacgaatgtc cgaagagtgg ccttttcccg tagccctgcg ccccctttct 2520 

gtggctggat ggggagacag grcagggccc cccaccctct ccagcccctg cagcaaatga 2580 

ctactgcacc tggacagcct cctcttttct agaagtctat ttatattgtc attttataac 2640 

actctagccc ctgcccttat tgggggacag atggrccctg tcctgcgggg tggccccggc 2700 

agaaccactg cctgaagaac caggttcctg cccggtcagc gcagccccag cccgcccacc 2760 

cctgcctcga gttagtttta caattaaaac actgrcttgt tttgtg 2806 
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<210> 103 
<211> 591 
<212> PRT 

<213> Full Length Mammalian (Human) PAK5hu 
<400> 103 

Met Phe Gly Lys Arg Lys Lys Arg Val Glu lie Ser Ala Pro Ser Asn 
1 5 10-15 

Phe Glu His Arg Val His Thr Gly Phe Asp Gin His Glu Gin Lys Phe 
20 25 30 

Thr Gly Leu Pro Arg Gin Trp Gin Ser Leu lie Glu Glu Ser Ala Arg 
35 40 45 

Arg Pro Lys .Pro Leu Val Asp Pro Ala Cys lie Thr Ser lie Gin Pro 
50 55 60 

Gly Ala Pro Lys Thr He Val Arg Gly Ser Lys Gly Ala Lys Asp Gly 
65 70 75 80 

Ala Leu Thr Leu Leu Leu Asp Glu Phe Glu Asn Met Ser Val Thr Arg 
85 90 95 

Ser Asn Ser Leu Arg Arg Asp Ser Pro Pro Pro Pro Ala Arg Ala Arg 
100 105 110 

Gin Glu Asn Gly Met Pro Glu Glu Pro Ala Thr Thr Ala Arg Glv Gly 
lis 120 125 

Pro Gly Lys Ala Gly Ser Arg Gly Arg Phe Ala Gly His Ser Glu Ala 
130 135 140 

Gly Gly Gly Ser Gly Asp Arg Arg Arg Ala Gly Pro Glu Lys Arg Pro 

150 155 160 

Lys Ser Ser Arg Glu Gly Ser Gly Gly Pro Gin Glu Ser Ser Arg Asp 
165 170 175 

Lys Arg Pro Leu Ser Gly Pro Asp Val Gly Thr Pro Gin Pro Ala Gly 
180 185 190 

Leu Ala Ser Gly Ala Lys Leu Ala Ala Gly Arg Pro Phe Asn Thr Tyr 
195 200 205 

Pro Arg Ala Asp Thr Asp His Pro Ser Arg Gly Ala Gin Gly Glu Pro 
210 215 220 

His Asp Val Ala Pro Asn Gly Pro Ser Ala Gly Gly Leu Ala He Pro 
225 230 235 240 

Gin Ser Ser Ser Ser Ser Ser Arg Pro Pro Thr Arg Ala Arg Gly Ala 
245 250 255 
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Pro Ser Pro Gly Val Leu Gly Pro His Ala Ser Glu Pro Gin Leu Ala 
260 265 270 

Pro Pro Ala Cys Thr Pro Ala Ala Pro Ala Val Pro Gly Pro Pro Gly 
275 280 285 

Pro Arg Ser Pro Gin Arg Glu Pro Gin Arg Val Ser His Glu Gin Phe 
290 295 300 

Arg Ala Ala Leu Gin Leu Val Val Asp Pro Gly Asp Pro Arg Ser Tyr 
305 310 315 320 

Leu Asp Asn Phe lie Lys lie Gly Glu Gly Ser Thr Gly lie Val Cys 
325 330 335 

lie Ala Thr Val Arg Ser Ser Gly Lys Leu Val Ala Val Lys Lys Met 
340 345 350 

Asp Leu Arg Lys Gin Gin Arg Arg Glu Leu Leu Phe Asn Glu Val Val 
355 360 365 

lie Met Arg Asp Tyr Gin His Glu Asn Val Val Glu Met Tyr Asn Ser 
370 375 380 

Tyr Leu Val Gly Asp Glu Leu Trp Val Val Met Glu Phe Leu Glu Gly 
385 390 395 400 

Gly Ala Leu Thr Asp lie Val Thr His Thr Arg Met Asn Glu Glu Gin 
405 410 415 

lie Ala Ala Val Cys Leu Ala Val Leu Gin Ala Leu Ser Val Leu His 
420 425 430 

Ala Gin Gly Val lie His Arg Asp He Lys Ser Asp Ser He Leu Leu 
435 440 445 

Thr His Asp Gly Arg Val Lys Leu Ser Asp Phe Gly Phe Cys Ala Gin- 
450 455 460 

Val Ser Lys Glu Val Pro Arg Arg Lys Ser Leu Val Gly Thr Pro Tyr 
465 470 475 480 

Trp Met Ala Pro Glu Leu He Ser Arg Leu Pro Tyr Gly Pro Glu Val 
485 490 495 

Asp He Trp Ser Leu Gly He Met Val He Glu Met Val Asd Gly Glu 
500 505 510 

Pro Pro Tyr Phe Asn Glu Pro Pro Leu Lys Ala Met Lys Met He Arg 
515 520 525 



Asp Asn Leu Pro Pro Arg Leu Lys Asn Leu His Lys Val Ser Pro Ser 
530 535 540 
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Leu Lys Gly Phe Leu Asp Arg Leu Leu Val Arg Asp Pro Ala Gin Arg 

550 555 

Ala Thr Ala Ala Glu Leu Leu Lys His Pro Phe Leu Ala Lys Ala Gly 
565 570 575 

Pro Pro Ala Ser He Val Pro Leu Met Arg Gin Asn Arg Thr Arg 

585 590 



<210> 104 

<211> 3684 

<212> DNA 

<213> Full Length Mammalian (Huinan) ZC4 

<400> 104 



atggcgggac ctgggggctg gagggacagg gaggtcacgg atctgggcca cctgccqqat fin 

tt^oaac'r ^^"^^^^^^ agataaaacc attggccttg gtact?!tgg cagaatcXt 120 

ttgggacttc atgagaagac tggtgcattt acagctgtta aagtgatgaa cgctcqtaao IRn 

a::a;ta;:? Tr^'r^ .actgaactc aaccttctga ggaagta^tc tttccac::a 240 

aacattgtgt ccttctatgg agcatttttc aagctgagtc cccctggtca gcggcaccaa 300 

ctttggatgg tgatggagtt atgtgcagca ggttcggtca ctgatgtagt gagaatgacc 360 

agtaatcaga gtttaaaaga agattggatt gcttatatct gccgagaaat cc^tcagggc 420 

lln. : ttcacgcaca ccgagtaatt caccgggaca tcaaaggtca gaatgtg^tg 480 

^c^aat^f ' -tgctgaagt aaaactggtt gattttggag tgagtgccca ggtgagcag! 54^ 

aa;?aJa^ta T"""""^"'' tttcattggg acaccatact ggatggcacc tgaggtga^t loo 

ItTJl ^^^^'^^"^^ acgctcctat gattacagaa gtgatgtgtg gtctgtggga 660 

octcSr" "g^-tggc tgaaggagcc cctcctctgt gtaaccttca acccttggaa 7I0 

gctctcttcg ttattttgcg ggaatctgct cccacagtca aatccagcgg atggtcccgt 780 

gcaaa^"" -"tcatgga aaagtgtacg ataaaaaatt tcctgttt^g tcc^acttct 840 

taTttZlt: atttgttcgg gatataaaaa atgaacgaca tgttgttgag 900 

tcattaacaa ggcatcttac tggaatcatt aaaaaaagac agaaaaaaga acaggcacgg 960 

TolrrT"^ tactctgagg caagcactgg caaaaagact atcaccaaag io|S 

aggttcaggg caaagtcatc atggagacct gaaaagcttg aactctcgga tttagaagcc 1080 

cgcaggcaaa ggcgccaacg cagatgggaa gatatcttta atcagcatga ggaagaa^tg iLo 
agacaagttg ataaagacaa agaagatgaa tcatcagaca atgaJgaagt ^^ttcaJtcg ' 1200 

olaTJT^" ""^^'^"^^^ agagccattg aagccataca tttcaaatcc taaaaaaat? 1260 

ttltllt "^^5^^=^^=^ ttctgtgcct aacaaccagg atcatgcaca tcatgtcaag 1320 

ttctcttcaa gcgttcctca gcggtctctt ttggaacaag ctcagaagcc cattgacatc 1380 

aa:a";:?a': f ^^^^^^^^ tcgtcaaaat tggctggcag catcaVga ttc^LgLc 440 

tt^T I caggcaaaac acagagctac tgtttaacaa tttatatttc agaagtcaaq 1500 

aaagaagaat ttcaagaagg aatgaatcaa aagtgtcagg gagcccaaot agga^tagga 1560 

^-^tgaaggcc attgtatttg gcaattgggt gaatcttctt ctgaggaaga aaSccJg^g isto 

actggaagga ggtctcagtc atcaccacct tattctacta ttga^cagaa gSgctggt? 1680 

gacatccarg ttccagatgg atttaaagta ggaaaaatat caccccc^gt atacttgaca 1740 
gcaccS'a tgcactctct gaaatcttcc ggaatgat^g gttaac^ccg 
qctqaJaer^ ttcagccacc tgaagaggar ggtgattatg ttgaactcta tgatgccag? 
caJo^.r^ atggtgatga tgatgatgag tctaatgata cttttgaaga tacctatgat 
gaccatq^t'o cttggataac caggttgatc aggctaatga tgtttgtaaa 
a-rlnr^ ^ ^^^"^ taagtttgtt gatgatgtaa ataataatta ttatgaggcg 
oallatT." caagggcaag ctatggcaga gatggaagct gcaagcaaga tggttatgat 
otraT.n ^ gaaaagagga agcctacaga ggctatggaa gccatacagc caatagaagc 

qccaa"a?!o ^'^"^f^""^ tgaggacaat gcagccattg gagatcagga agaacatgca ..20 

gccaa.a.ag gcagtgaaag aagaggcagt gagggtgacg gaggtaaggg agtcgttcga 228^ 



1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
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accagtgaag agagtggagc ccttggaccc aatggagaag aaaattgctc agagacagat 2340 

ggtccaggat tgaagagacc tgcgccccag gactttgaat atctacagga ggagccaggt 2400 

ggtggaaatg aggcctcaaa tgccattgac tcaggtgctg caccgtcagc acctgarcar 24 60 

gagagtgaca ataaggacat atcagaatca tcaacacaat cagatttttc tgccaatcac 2520 

tcatctcctt ccaaaggttc tgggatgtct gctgatgcta actttgccag tgccatctta 2580 

tacgctggat tcgtagaagt acctgaggaa tcacctaagc aaccctctga agtcaatgtt 2640 

aacccactct angtctctcc tgcatgtaaa aaaccactaa tccacatgta tgaaaaggag 2700 

ttcacttctg agatctgctg tggttcttrg tggggagtca atttgctgtt gggaacccga 2760 

tctaatcrac atctgatgga cagaagtgga aaggctgaca ttactaaact tataaggcga 2820 

agaccattcc gccagattca agtcttagag ccactcaatt tgcrgattac catctcaggt 2880 

cataagaaca gacttcgggt gtatcatctg acctggttga ggaacaagat tttgaataat 2940 

gatccagaaa gtaaaagaag gcaagaagaa atgctgaaga cagaggaagc ctgcaaagcr 3000 

attgataagt taacaggctg tgaacacttc agtgtcctcc aacatgaaga aacaacatat 3060 

attgcaatzg ctttgaaatc atcaartcac ctttatgcat gggcaccaaa gtcctttgat 3120 

gaaagcactg ctattaaagt atttccaaca cttgatcata agccagtgac agttgacctg 3180 

gctattggTit ctgaaaaaag actaaagatt ttcttcagct cagcagatgg atatcacctc 3240 

arcgatgcag aatctgaggt tatgtctgat gtgaccctgc caaagaatcc cctggaaatc 3300 

attataccac agaatatcat catttnacct gattgcttgg gaattggcat gatgctcacc 3360 

ttcaatgctg aagccctctc tgtggaagca aatgaacaac tcttcaagaa garccttgaa 3420 

atgtggaaag acataccatc tt ctatagct ' t ttgaatgta cacagcgaac cacaggatgg 3480 

ggccaaaagg ccattgaagr gcgctctttg caarccaggg ttctggaaag tgagccgaag 3540 

cgcaggrcaa ttaagaagct gagattcctg tgcacccggg gtgacaagct gtt'cttracc 3600 

rctaccctgc gcaatcacca cagccgggtt tacttcatga cacttggaaa acttgaagag 3660 

ctccaaagca attatgatgt ctaa 3684 



<210> 105 
<211> 1227 
<212> PRT 

<213> Full Length Mammalian (Human) ZC4 



<400> 105 



Met Ala Gly Pro 
1 

His Leu Pro Asp 
20 

Leu Gly Thr Tyr 
35 

Ala Phe Thr Ala 
50 

Asp Leu Arg Thr 
65 

Asn lie Val Ser 



Gin Arg His Gin 
100 



Gly Gly Trp Arg 



Pro Thr Gly lie 



Gly Arg lie Tyr 
40 

Val Lys Val Met 
55 

Glu Leu Asn Leu 
70 

Phe Tyr Gly Ala 
85 

Leu Trp Met Val 



Asp Arg Glu Val 
10 

Phe Ser Leu Asp 
25 

Leu Gly Leu His 



Asn Ala Arg Lys 
60 

Leu Arg Lys Tyr 
75 

Phe Phe Lys Leu 

; 90 

Met Glu Leu Cys 
105 



Thr Asp Leu Gly 
15 

Lys Thr lie Gly 
30 

Glu Lys Thr Gly 
45 

Asp Glu Glu Glu 



Ser Phe His Lys 
80 

Ser Pro Pro Gly 
95 

Ala Ala Gly Ser 
110 
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Val Thr Asp Val Val Arg Met Thr S 
115 120 



117 

er Asn Gin Ser Leu Lys Glu Asp 
125 



Trp lie Ala Tyr He Cys Arg Glu He Leu Gin Gly Leu Ala His Leu 

135 



His Ala His Arg Val lie His Arg Asp lie Lys Gly Gin Asn Val Leu 

160 



145 150 155 



Leu Thr His Asn Ala Glu Val Lys Leu Val Asp Phe Gly Val Ser Ala 
165 170 

Gin Val Ser Arg Thr Asn Gly Arg Arg Asn Ser Phe He Gly Thr Pro 
180 185 , 190 

Tyr Trp Met Ala Pro Glu Val He Asp Cys Asp Glu Asp Pro Arg Aro 
195 200 205 

Ser Tyr Asp Tyr Arg Ser Asp Val Trp Ser Val Gly He Thr Ala He 
210 215 220 

Glu Met Ala Glu Gly Ala Pro Pro Leu Cys Asn Leu Gin Pro Leu Glu 

230 235 240 

Ala Leu Phe Val He Leu Arg Glu Ser Ala Pro Thr Val Lys Ser Ser 
245 250 255 

Gly Trp Ser Arg Lys Phe His Asn Phe Met Glu Lys Cys Thr He Lvs 
260 265 270 

Asn Phe Leu Phe Arg Pro Thr Ser Ala Asn Met Leu Gin His P>-o Phe 
275 280 285 

Val Arg Asp He Lys Asn Glu Arg His Val Val Glu Ser Leu Thr Arg 

295 300 

His Leu Thr Gly He He Lys Lys Arg Gin Lys Lys Glu Gin Ala Arg 

310 315 320 

Glu Lys Lys Ser Lys Val Ser Thr Leu Arg Gin Ala Leu Ala Lys Arg 
325 330 

Leu Ser Pro Lys Arg Phe Arg Ala Lys Ser Ser Tro Arg Pro Glu Lvs 
340 "350 

Leu Glu Leu Ser Asp Leu Glu Ala Arg Arg Gin Arg Arg Gin Arg Arg 
355 360 365 



Trp Glu Asp He Phe Asn Gin His Glu Glu Glu Leu Arg Gin Val A 
° 375 380 

Lys Asp Lys Glu Asp Glu Ser Ser Asp Asn Asp Glu Val Phe His Ser 



390 395 



400 
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lie Gin Ala Glu Val Gin lie Giu Pro Leu Lys Pro Tyr lie Ser Asn 
405 410 415 

Pro Lys Lys lie Glu Val Gin Glu Arg Ser Pro Ser Val Pro Asn Asn 
420 425 430 

Gin Asp His Ala His His Val Lys Phe Ser Ser Ser Val Pro Gin Arg 
435 440 445 

Ser Leu Leu Glu Gin Ala Gin Lys Pro lie Asp lie Arg Gin Arg Ser 
450 455 460 

Ser Gin Asn Arg Gin Asn Trp Leu Ala Ala Ser Gly Asp Ser Lys His 
465 470 475 480 

Lys lie Leu Ala Gly Lys Thr Gin Ser Tyr Cys Leu Thr lie Tyr lie 
485 490 495 

Ser Glu Val Lys Lys Glu Glu Phe Gin Glu. Gly Met Asn Gin Lys Cys 
500 505 510 

Gin Gly Ala Gin Val Gly Leu Gly Pro Giu Gly His Cys lie Trp Gin 
515 520 525 

Leu Gly Glu Ser Ser Ser Glu Glu Glu Ser Pro Val Thr Gly Arg Arg 
530 535 540 

Ser Gin Ser Ser Pro Pro Tyr Ser Thr lie Asp Gin Lys Leu Leu Val 
545 550 555 560 

Asp lie His Val Pro Asp Gly Phe Lys Val Gly Lys lie Ser Pro Pro 
565 570 575 

Val Tyr Leu Thr Asn Glu Trp Val Gly Tyr Asn Ala Leu Ser Glu lie 
580 585 590 

Phe Arg Asn Asp Trp Leu Thr Pro Ala Pro Val lie Gin Pro Pro Glu 
595 600 605 

Glu Asp Gly Asp Tyr Val Glu Leu Tyr Asp Ala Ser Ala Asp Thr Asp 
610 615 620 

Gly Asp Asp Asp Asp Glu Ser Asn Asp Thr Phe Glu Asp Thr Tyr Asp 
625 630 635 640 

His Ala Asn Gly Asn Asp Asp Leu Asp Asn Gin Val Asp Gin Ala Asn 
645 650 655 



Asp Val Cys Lys Asp His Asp Asp Asp Asn Asn Lys Phe Val A.sp Asp 
660 665 670 



Val Asn Asn Asn Tyr Tyr Giu Ala Pro Ser Cys Pro Arg Ala Ser Tyr 
675 . 630 685 
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Gly Arg Asp Gly Ser Cys Lys Gin Asp Gly Tyr Asp Gly Ser Arg Gly 
690 695 700 

Lys Glu Glu Ala Tyr Arg Gly Tyr Gly Ser His Thr Ala Asn Arg Ser 
705 710 715 720 

His Gly Gly Ser Ala Ala Ser Glu Asp Asn Ala Ala lie Gly Asp Gin 
725 730 735 

Glu Glu His Ala Ala Asn lie Gly Ser Glu Arg Arg Gly Ser Glu Gly 
740 745 ' 750 

Asp Gly Gly Lys Gly Val Val Arg Thr Ser Glu Glu Ser Gly Ala Leu 
755 760 765 

Gly Leu Asn Gly Glu Glu Asn Cys Ser Glu Thr Asp Gly Pro Gly Leu 
770 775 780 

Lys Arg Pro AJ..a Ser Gin Asp Phe Glu Tyr .Leu Gin Glu Glu Pro Gly 
785 790 795 800 

Gly Gly Asn Glu Ala Ser Asn Ala lie Asp Ser Gly Ala Ala Pro Ser 
805 810 815 

Ala Pro Asp His Glu Ser Asp Asn Lys Asp lie Ser Glu Ser Ser Thr 
820 825 830 

Gin Ser Asp Phe Ser Ala Asn His Ser Ser Pro Ser Lys Gly Ser Gly 
835 840 845 

Met Ser Ala Asp Ala Asn Phe Ala Ser Ala lie Leu Tyr Ala Gly Phe 
850 855 860 

Val Glu Val Pro Glu Glu Ser Pro Lys Gin Pro Ser Glu Val Asn Val 
865 870 875 880 

Asn Pro Leu Tyr Val Ser Pro Ala Cys Lys Lys Pro Leu lie His Met 
885 890 895 

Tyr Glu Lys Glu Phe Thr Ser Glu lie Cys Cys Gly Ser Leu Trp Gly 
900 905 910 

Val Asn Leu Leu Leu Gly Thr Arg Ser Asn Leu Tyr Leu Met Asp Arg 
915 920 925 

Ser Gly Lys Ala Asp lie Thr Lys Leu lie Arg Arg Arg Pro Phe Arg 
930 935 940 

Gin lie Gin Val Leu Glu Pro Leu Asn Leu Leu lie Thr lie Ser Gly 
^45 950 955 960 

His Lys Asn Arg Leu Arg Val Tyr His Leu Thr Trp Leu Arg Asn Lys 
965 970 " 975 

lie Leu Asn Asn Asp Pro Glu Ser Lys Arg Arg Gin Glu Glu Met Leu 
980 985 990 
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Lys Thr Glu Glu Ala Cys Lys Ala lie Asp Lys Leu Thr Gly Cys Glu 
995 1000 1005 

His ?he Ser Val Leu Gin His Glu Glu Thr Thr Tyr lie Ala lie Ala 
1010 1015 1020 

Leu Lys Ser Ser lie His Leu Tyr Ala Trp Ala Pro Lys Ser Phe Asp 
1025 1030 1035 1040 

Glu Ser Thr Ala lie Lys Val Phe Pro Thr Leu Asp His Lys Pro Val 
1045 1050 1055 

Thr Val Asp Leu Ala lie Gly Ser Glu Lys Arg Leu Lys lie Phe Phe 
1060 1065 1070 

Ser Ser Ala Asp Gly Tyr His Leu He Asp Ala Glu Ser Glu Val Met 
1075 1080 1085 

Ser Asp Val Thr Leu Pro Lys Asn Pro Leu Glu He He He Pro Gin 
1090 1095 1100 

Asn He He He Leu Pro Asp Cys Leu Gly He Gly Met Met Leu Thr 
1105 1110 1115 1120 

Phe Asn Ala Glu Ala Leu Ser Val Glu Ala Asn Glu Gin Leu Phe Lys 
1125 1130 1135 

Lys He Leu Glu Met Trp Lys Asp He Pro Ser Ser He Ala Phe Glu 
1140 1145 1150 

Cys Thr Gin Arg Thr Thr Gly Trp Gly Gin Lys Ala He Glu Val Arg 
1155 1160 1165 

Ser Leu Gin Ser Arg Val Leu Glu Ser Glu Leu Lys Arg Arg Ser He 
1170 1175 1180 

Lys Lys Leu Arg Phe Leu Cys Thr Arg Gly Asp Lys Leu Phe Phe Thr 
1185 1190 1195 1200 

Ser Thr Leu Arg Asn His His Ser Arg Val Tyr Phe Met Thr Leu Gly 
1205 1210 1215 

Lys Leu Glu Glu Leu Gin Ser Asn Tyr Asp Val 
1220 1225 



<210> 106 
<211> 2962. 
<212> DNA 

<213> Full Length Mammalian (Human) GEK2 



<400> 106 
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cgaagccaca gcccgagccc gagcccgagc ccgagccggc gccaccgcgc ccccggccat 60 

ggcttttgcc aatttccgcc gcatcc-gcg cctgtcracc ttcgagaaga gaaagrcccg 120 

cgaatatgag cacgtccgcc gcgacctgga ccccaacgag gtgtgggaga tcgtgggcga 180 

gctgggcgac ggcgccttcg gcaaggttta caaggccaag aataaggaga cgggtgcttt 240 

ggctgcggcc aaagrcactg aaaccaagag tgaggaggag ctggaggact acatcgtgga 300 

gatrgagatc ccggccacct gcgaccaccc cracattgtg aagctcctgg gagcctacta 360 

tcacgacggg aagctgtgga tcatgattga gttctgtcca gggggagccg tggacgccat 420 

catgctggag ctggacagag gcctcacgga gccccagata caggtggttt gccgccagat 480 

gctagaagcc ctcaacttcc tgcacagcaa gaggatcatc caccgagatc tgaaagctgg 540 

caacgtgctg atgaccctcg agggagacat caggctggct gactttggrg tgtctgccaa 600 

gaatctgaag actctacaga aacgagattc cttcatcggc acgccttacr ggatggcccc 660 

cgaggtggtc atgtgtgaga ccatgaaaga cacgcccrac gactacaaag ccgacatctg 720 

gtccctgggc accacgctga ttgagarggc ccagatcgag ccgccacacc acgagctcaa 780 

ccccatgcgg gtcctgctaa agatcgccaa gtcggaccct cccacgctgc tcacgccctc 840 

caagtggtct gtagagtrcc gtgacrtcct gaagatagcc ctggataaga acccagaaac 900 

ccgacccagT: gccgcgcagc tgctggagca tcccttcgtc agcagcatca ccagtaacaa 960 

ggctctgcgg gagccggtgg ctgaggccaa ggccgaggtg atggaagaga tcgaagacgg 1020 

ccgggatgag ggggaagagg aggacgccgt ggatgccgcc tccaccctgg agaaccatac 1080 

tcagaactcc tctgaggrga gtccgccaag cctcaatgct gacaagcctc tcgaggagtc 1140 

accttccacc ccgcrggcac ccagccagrc tcaggacagt gtgaatgagc cctgcagcca 1200 

gccctctggg gacagatccc tccaaaccac cagtccccca gtcgtggccc ctggaaatga 1260 

gaacggcctg gcagtgcctg tgcccctgcg gaagtcccga cccgtgtcaa tggatgccag 1320 

aattcaggta gcccaggaga agcaagttgc tgagcagggt ggggacctca gcccagcagc 1380 

caacagatct caaaaggcca gccagagccg gcccaacagc agcgccctgg agaccttggg 1440 

tggggagaag ctggccaatg gcagcctgga gccacctgcc caggcagctc cagggccttc 1500 

caagagggac tcggactgca gcagcctctg cacctctgag agcatggact atggtaccaa 1560 

tctctccact gacctgtcgc tgaacaaaga gatgggctct ctgtccatca aggacccgaa 1620 

actgtacaaa aaaaccctca agcggacacg caaatttgtg gtggatggtg tggaggtgag 1680 

catcaccacc tccaagatca tcagcgaaga tgagaagaag gatgaggaga tgagatttct 1740 

caggcgccag gaactccgag agcttcggct gctccagaaa gaagagcatc ggaaccagac 1800 

ccagctgagt aacaagcatg agctgcagct ggagcaaatg cataaacgtt ttgaacagga i860 

aatcaacgcc aagaagaagt rctttgacac ggaattagag aacctggagc gtcagcaaaa 1920 

gcagcaagtg gagaagatgg agcaagacca tgccgtgcgc cgccgggagg aggccaggcg 1980 

gatccgcctg gagcaggatc gggactacac caggttccaa gagcagctca aaccgatgaa 2040 

gaaagaggtg aagaacgagg cggagaagct cccccgacag cagcggaagg aaagcatgaa 2100 

gcagaagatg gaggagcaca cgcagaaaaa gcagcttctt gaccgggact ttgtagccaa 2160 

gcagaaggag gacctggagc tggccatgaa gaggctcacc accgacaaca ggcgggagat 2220 

ctgtgacaag gagcgcgagt gcctcatgaa gaagcaggag ctccttcgag accgggaagc 2280 

agccctgtgg gagatggaag agcaccagct gcaggagagg caccagctgg tgaagcagca 2340 

gctcaaagac cagtacttcc tccagcggca cgagctgctg cgcaagcatg agaaggagcg 2400 

ggagcagarg cagcgctaca accagcgcat gatagagcag ctgaaggtgc ggcagcaaca 2460 

ggaaaaggcg cggcrgccca agatccagag gagtgagggc aagacgcgca tggccatgta 2520 

caagaagagc ctccacatca acggcggggg cagcgcagct gagcagcgtg agaagatcaa 2580 

gcagttctcc cagcaggagg agaagaggca gaagrcggag cggctgcagc aacagcagaa 2640 

acacgagaac cagatgcggg acatgctggc gcagtgcgag agcaacatga gcgagctgca 2700 

gcagctgcag aacgaaaagt gccacctcct ggtagagcac gaaacccaga aactgaaggc 2760 

cctggatgag agccataacc agaacctgaa ggaatggcgg gacaagcttc ggccgcgcaa 2820 

gaaggctctg gaagaggatc tgaaccagaa gaagcgggag caggagatgt tcttcaagcr 2880 

gagcgaggag gcggagtgcc caaacccctc caccccaagc aaggccgcca agttcttccc 2940 

ctacagctct ggggatgctt cc 2962 



BNSDOCID: <WO 9953036A2J_> 



wo 99/53036 



PCT/US99/08150 



122 



<210> 107 

<211> 968 

<212> PRT 

<213> Full Length Mammalian (Human) GEK2 

<400> 107 

Met Ala Phe Ala Asn Phe Arg Arg lie Leu Arg Leu Ser Thr Phe Glu 
1 5 10 15 

Lys Arg Lys Ser Arg Glu Tyr Glu His Val Arg Arg Asp Leu Asp Pro 
20 25 30 

Asn Glu Val Trp Glu lie Val Giy Glu Leu Gly Asp Gly Ala Phe Gly 
35 40 45 

Lys Val Tyr Lys Ala Lys Asn Lys Glu Thr Gly Ala Leu Ala Ala Ala 
.50 55 60 

Lys Val lie Glu Thr Lys Ser Glu Glu Glu Leu Glu Asp Tyr lie Val 
65 70 75 80 

Glu lie Glu lie Leu Ala Thr Cys Asp His Pro Tyr lie Val Lys Leu 
85 90 95 

Leu Gly Ala Tyr Tyr His Asp Gly Lys Leu Trp lie Met lie Glu Phe 
100 105 110 

Cys Pro Giy Gly Ala Val Asp Ala lie Met Leu Glu Leu Asp Arg Gly 
115 120 125 

Leu Thr Glu Pro Gin lie Gin Val Val Cys Arg Gin Met Leu Glu Ala 
130 135 140 

Leu Asn Phe Leu His Ser Lys Arg lie lie His Arg Asp Leu Lys Ala 
145 150 155 160 

Gly Asn Val Leu Met Thr Leu Glu Gly Asp lie Arg Leu Ala Asp Phe 
165 170 175 

Gly Val Ser Ala Lys Asn Leu Lys Thr Leu Gin Lys Arg Asp Ser Phe 
180 185 ' 190 

lie Gly Thr Pro Tyr Trp Met Ala Pro Glu Val Val Met ,Cys Glu Thr 
195 200 205 

Met Lys Asp Thr Pro Tyr Asp Tyr Lys Ala Asp lie Trp Ser Leu Gly 
210 215 220 

lie Thr Leu lie Glu Met Ala Gin lie Glu Pro Pro His His Glu Leu 
225 230 235 240 

Asn Pro Met Arg Val Leu Leu Lys lie Ala Lys Ser Asp Pro Pro Thr 
245 250 255 
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Leu Leu Thr Pro Ser Lys Trp Ser Val Giu Phe Arg Asp Phe Leu Lvs 
260 265 270 

He Ala Leu Asp Lys Asn Pro Giu Thr Arg ■ Pro Ser Ala Ala Gin Leu 
275 280 285 

Leu Giu His Pro Phe Val Ser Ser He Thr Ser Asn Lys Ala Leu Ara 
290 295 300 

Giu Leu Val Ala Giu Ala Lys Ala Giu Val Met Giu Giu He Giu Asp 
305 310 -JT c 

■^■^u 315 • 320 

Gly Arg Asp Giu Gly Giu Giu Giu Asp Ala Val Asd Ala Ala Ser Thr 
325 330 

Leu Giu Asn His Thr Gin Asn Ser Ser Giu Val Ser Pro Pro Ser Leu 
340 345 

Asn. Ala Asp Lys Pro Leu Giu Giu Ser Pro Ser Thr Pro Leu Ala Pro 
355 360 365 



Ser Gin Ser Gin Asp Ser Val Asn Giu Pro Cys Ser Gin Pro Ser Gl 

380 



' '•■^^ v-yo otir ^j.n fro aer Gly 

370 375. ^ 



Asp Arg Ser Leu Gin Thr Thr Ser Pro Pro Val Val Ala Pro Gly Asn 

390 395 400 

Giu Asn Gly Leu Ala Val Pro Val Pro Leu Arg Lys Ser Arg Pro Val 
405 410 

Ser Met Asp Ala Arg lie Gin Val Ala Gin Giu Lys Gin Val Ala Giu 
420 425 430 

Gin Gly Gly Asp Leu Ser Pro Ala Ala Asn Arg Ser Gin Lys Ala Ser 
435 440 

Gin Ser Arg Pro Asn Ser Ser Ala Leu Giu Thr Leu Gly Gly Giu Lys 
4^0 455 

Leu Ala Asn Gly Ser Leu Giu Pro Pro Ala Gin Ala Ala Pro Gly Pro 

475 480 



Ser Lys Arg Asp Ser Asp Cys Ser S 
485 

Asp Tyr Gly Thr Asn Leu Ser Thr Asp Leu Ser Leu Asn Lys Giu Met 



er Leu Cys Thr Ser Giu Ser Met 
490 495 



500 505 



510 



Gly Ser Leu Ser lie Lys Asp Pro Lys Leu Tyr Lys - Lvs Thr Leu Lys 
515 520 525 

Arg Thr Arg Lys Phe Val Val Asp Gly Val Giu Val Ser He Thr Thr 
=30 535 

Ser Lys He He Ser Giu Asp Giu Lys Lys Asp Giu Giu Met Arg Phe 

550 555 
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Leu Arg Arg Gin Glu Leu Arg Giu Leu Arg Leu Leu Gin Lys Giu Giu 
565 570 575 

His Arg Asn Gin Thr Gin Leu Ser Asn Lys His Giu Leu Gin Leu Giu 
580 585 590 

Gin Met His Lys Arg Phe Giu Gin Giu lie Asn Aia Lys Lys Lys Phe 
595 600 605 

Phe Asp Thr Giu Leu Giu Asn Leu Giu Arg Gin Gin Lys Gin Gin Vai 
6iO 6i5 620 

Giu Lys Met Giu Gin Asp His Aia Vai Arg Arg Arg Giu Giu Aia Arg 
625 630 635 640 

Arg lie Arg Leu Giu Gin Asp Arg Asp Tyr Thr Arg Phe Gin Giu Gin 
645 650 655 

Leu Lys Leu Met Lys Lys Giu Vai Lys Asn Giu Vai Glu Lys Leu Pro 
660 665 670 

Arg Gin Gin Arg Lys Giu Ser Met Lys Gin Lys Met Giu Giu His Thr 
675 680 685 

Gin Lys Lys Gin Leu Leu Asp Arg Asp Phe Vai Aia Lys Gin Lys Giu 
690 695 700 

Asp Leu Giu Leu Aia Met Lys Arg Leu Thr Th • Asp Asn Arg Arg Giu 
705 710 71o ' 720 

lie Cys Asp Lys Giu Arg Giu Cys Leu Mer Lys Lys Gin Giu Leu Leu 
725 730 735 

Arg Asp Arg Giu Aia Aia Leu Trp Giu Met Giu Giu His Gin Leu Gin 
740 745 750 

Giu Arg His Gin Leu Vai Lys Gin Gin Leu Lys Asp Gin Tyr Phe Leu 
755 760 765 

Gin Arg His Giu Leu Leu Arg Lys His Glu Lys Giu Arg Glu Gin Met 
770 775 780 

Gin Arg Tyr Asn Gin Arg Met lie Giu Gin Leu Lys Vai Arg Gin Gin 
"785 790 795 800 

Gin Giu Lys Aia Arg Leu Pro Lys lie Gin Arg Ser Giu Giy Lys Thr 
805 810 815 

Arg Met Aia Met Tyr Lys Lys Ser Leu His lie Asn Giy Giy Giy Ser 
820 825 630 

Ala Aia Giu Gin Arg Giu Lys lie Lys Gin Phe Ser Gin Gin Glu Giu 
835 840 ^ 845 
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Lys Arg Gin Lys Ser Glu Arg Leu Gin Gin Gin Gin Lys His Glu Asn 

855 



Gin Met Arg Asp Met Leu Ala Gin Cys Glu Ser Asn Met Ser Gl 
865 870 875 



u Leu 
880 



Gin Gin Leu Gin Asn Glu Lys Cys His Leu Leu 
885 



Val Glu His Glu Thr 

890 



Gin Lys Leu Lys Ala Leu Asp Glu Ser His Asn Gin Asn Leu Lys Glu 
900 905 

Trp Arg Asp Lys Leu Arg Pro Arg. Lys Lys Ala Leu Glu Glu Asp Leu 
915 920 925 

Asn Gin Lys Lys Arg Glu Gin Glu Met Phe Phe Lys Leu Ser Glu Glu 
930 935 



940 



Ala Glu Cys Pro Asn Pro Ser Thr Pr 
945 



o Ser Lys Ala Ala Lys Phe Phe 
955 960 



Pro Tyr Ser Ser Gly Asp Ala Ser 
965 



<210> 108 
<211> 11 
<212> PRT 

<213> Mammalian (human) 2C1 and/or ZC3 
<220> 

<223> "Xaa can stand for any amino acid 
<400> 108 

Xaa Leu Xaa Leu Xaa Leu Xaa Leu Xaa Leu 
5 10 



<210> 109 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthesized nucleic acid molecule 
<4Q0> 109 
gcagcaagtg gagaagatgg 



20 



<210> 110 
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<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthesized nucleic acid molecule 
<400> 110 

ggaagcatcc ccagagctgt ag 22 



<210> 111 
<211> 12 
<212> PRT 

<213> Mammalian (Human) STLK2 
<400> 111 

Glu Lys Phe Gin Lys Cys Ser Ala Asp Glu Ser Pro 
15 10 



<210> 112 
<211> 16 
<212> PRT 

<213> Mammalian (Human) STLK4 
<400> 112 

Ser lie Ser Asn Ser Glu Leu Phe Pro Thr Thr Asp Pro Vai Gly Thr 
15 10 15 



<210> 113 
<211> 9 
<212> PRT 

<213> Mammalian (Human) SULUl 

<400> 113 

Leu Asp Phe Pro Lys Glu Asp Tyr Arg 
1 5 



<210> 114 
<211> 12 
<2i2> PRT 

<213> Mammalian (Human) SULUl 
<400> 114 
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His Giy Asp Pro Arg Pro Glu Pro Arg Pro Thr Gin 
^ 5 10 



<210> 115 
<211> 14 
<212> PRT 

<213> Mammalian (Human) SULU3 
<400> 115 

Pro Ser Thr Asn Arg Ala Gly Ser Leu Lys Asp Pro Glu Cvs 
1 5 10 



<210> 116 
<211> 19 
<212> PRT 

<213> Mammalian (Human) SULU3 
<400> 116 

Asp Pro Arg Thr Arg Ala Ser Asp Pro Gin Ser Pro Pro Gin Val Ser 
5 10 

Arg His Lys 



<210> 117 
<211> 16 
<212> PRT 

<213> Mammalian (Human) PAK4 
<400> 117 

Cys Leu Val Pro Leu He Gin Leu Tyr Arg Lys Gin Thr Ser Thr Cys 
5 10 ^5 



<210> 118 
<211> 9 
<212> PRT 

<213> Mammalian (Human) PAK5 
<400> 118 

Pro Leu Met Arg Gin Asn Arg Thr Ara 
1 5 
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<210> 119 
<211> 16 
<212> PRT 

<213> Mammalian (Human) PAK5 
<400> 119 

Ser Giy Asp Arg Arg Arg Ala Gly Pro Glu Lys Arg Pro Lys Ser Ser 
15 10 15 



<210> 120 
<211> 16 
<212> PRT 

<213> Mammalian (Human) PAK5 
<400> 120 

Cys Arg Arg Lys Ser Leu Vai Gly Thr Pro Tyr Trp Met Ala Pro Glu 
15 10 15 



<210> 121 
<211> 20 
<212> PRT. 

<213> Mammalian (Human) ZCl 
<400> 121 

Arg Thr Val Gly Arg Arg Asn Thr Phe lie Gly Thr Pro Pro Tyr Trp 
15 10 15 

Met Ala Pro Glu 
20 



<210> 122 
<211> 15 
<212> PRT 

<213> Mammalian (Human) KHSl 
<400> 122 

Lys Arg Lys Ser Phe lie Giy Thr Pro Tyr Trp Met Ala Pro Glu 
15 10 15 
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<210> 123 
<211> 15 
<212> PRT 

<213> Mammalian (Human) STLK2 
<400> 123 

Lys Arg Asn Thr Phe Val Gly Thr Pro- Phe Trp Met Ala Pro Glu 
15 10 15 



<210> 124 
<211> 15 
<212> PRT 

<213> Mammalian (Human) SULUl 
<400> 124 

Pro Ala Asn Ser Phe Val Gly Thr Pro Tyr Trp Met Ala Pro Glu 
1 5 10 15 



<210> 125 
<211> 15 
<212> PRT 

<213> Mammalian (Human) ZCl 
<400> 125 



Arg Arg Asn Thr Phe lie Gly Thr Pro Tyr Trp Met Ala Pro Glu 



1 5 



10 15 



<210> 126 

<211> 15 

<212> PRT 

<213> Mammalian (Human) ZCl 

<400> 126 

Arg Arg Asn Thr Phe He Gly Thr Pro Tyr Trp Met Ala Pro Glu 
1 5 10 15 



<210> 127 
<211> 18 
<212> PRT 

<213> Mammalian (HuiTtan) STLK4 
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<400> 127 

Arg Asn Lys Val Arg Lys Thr Phe Val 
1 5 

Pro Glu 



<210> 128 
<211> 22 
<212> DNA 

<213> Mammalian (Human) STLK3 



<4 00> 128 
ctcccatttc ctagcaaaat ca 



<210> 129 
<211> 22 
<212> DNA 

<213> Mammalian (Human) STLK3 
<400> 129 
agaggcagta ttgtcagatg ta 



<210> 130 
<211> 22 
<212> DNA 

<213> Mammalian (Human) 3TLK4 
<400> 130 
ccacacatgc gtatctctgt tg 



<210> 131 
<211> 24 
<212> DNA 

<213> Mammalian (Human) STLK4 
<400> 
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<210> 132 
<211> 22 
<212> DNA 

<2i3> Mammalian (Human) ZCl 
<4 00> 1:32 
- ^ - at ccG t^ oacactgctt ct 



<210> 133 
<211> 22 
_ <212> DNA 
<213> Mammalian (Human) ZCl 

<400> 133 

caaggtgttc tttgcctctg tt 



: v<210> 134 

- <211> 21 

<^12> DNA 

- -'^^^^^ Mammalian (Human) ZC2 

. " ■<400> 134 

agatggactg tactgggagg g 2 
<212> DNA 

<213> Mammalian (Human) ZC2 
<400> 135 

agaagagcac ttggcactta tc 2; 



<210> 136 
<211> 20 
<212> DNA 

<213> Mammalian (Human) ZC3 
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<400> 136 
catcatgaac cggtgacggg 



<210> 137 
<211> 23 
<212> DNA 
<213> Mammalian (Human) 2C3 

<400> 137 
ccagtgaaat caaaccagta aaa 



<210> 138 

<211> 24 

<212> DNA 

<213> Mammalian (Human) SULUl 

<400> 138 

caaaacctgg ccgtctcttc tatt 



<210> 139 

<211> 23 

<212> DNA 

<213> Mammalian (Human) SULUl 

<400> 139 

atttgtgcta ctgggattct gtg 



<210> 140 
<211> 24 
<212> DNA 
<213> Mammalian (Human) KHS2 

<400> 140 
gaatagcggt accatgatag aata 



<210> 141 
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133 



<211> 23 
<212> DNA 

<213> Marrjnalian (Human) KHS2 
<400> 141 
taccaaaaag agccaaaagt gtg 



23 



<210> 142 
<211> 23 
<212> D^3A 

<213> iyiammalian (Human) PAK4 
<400> 142 
ctcagtattc tctccaaaga ttg 



23 



<210> 143 
<211> 23 
<212> DNA 

<213> Mammalian (Human) PAK4 
<400> 143 
gatgttctct ccattctgra aag ' 



23 



<210> 144 
<211> 21 
<212> DNA 

<213> Mammalian (Human) PAK5 
<400> 144 
catcactgga agtctgcagt g 



<210> 145 
<211> 20 
<212> DNA 

<213> Mammalian (Human) PAK5 
<400> 145 
:aggtgcagi: agtcattcgc 



20 
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134 

<210> 146 
<211> 20 
<212> DNA 
<213> T7 phage 

<400> 146 

ggagctgtcg tattccagtc 20 

<210> 147 
<211> 21 
<212> DNA 
<213> T7 phage 

<400> 147 

aacccctcaa gacccgctta g 21 
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STE2C- RELATED PROTEIN KINASES 

RELATED APPLICATIONS 
The presenc application claims priority to U.S. 
Provisional Patent Application Serial No. 60/051,734 bv 
Plowman and Martinez, entitled STE20-Related Protein 
kinases, filed April 14, 1998 (Lyon & Lyon Docket No. 
232/279) , hereby incorporated by reference herein in its 
entirety, including any drawings, tables, or figures. 

FIELD OF THE INVENTION 
The present invention relates to novel kinase 
polypeptides, nucleotide sequences encoding the novel kinase 
polypeptides, as well as various products and methods useful 
for the diagnosis and treatment of various kinase- related 
diseases and conditions. 

BACKGROUND OF THE INVENTION 
The following description of the background of the 
invention is provided to aid in understanding the invention, 
but is not admitted to be or to describe prior art to the 
invention. 

Cellular signal transduction is a fundamental mechanism 
whereby external stimuli that regulate diverse cellular 
processes are relayed to the interior of cells. One of the 
key biochemical mechanisms of signal transduction involves 
the reversible phosphorylation of proteins, which enables 
regulacion cf the activity of mature proteins by altering 
their structure and functic-n. 

The best characterized protein kinases m eukaryoces 
phosphcrylate proteins on the hydroxy! moiety of serine, 
threonine and tyrosine residues. These kinases largely fall 
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inco c'.vG groups, chose specific for phosphoryiac: ing serines 
and. chrec.nines, and chose specific for phosphorylacing 
cyrosmes. Some kinases, referred to as "dual specificity^' 
kinases, are able to phosphorylate on cyrosine as v.-eil as 
serine,''threonine residues. 

Protein kinases can also be characterized by their 
locac^on within the cell. Some kinases are transmembrane 
receptor-type proteins capable of directly alcering their 
cataiycic activity m response to the external environmenc 
such as the binding of a ligand. Others are non- receptor- 
type proceins lacking any ■ transmembrane domain. Thev can be 
found m a variecy of cellular comparrmencs from the inner 
surrace of the cell m.embrane to the nucleus. 

-Many kinases are involved in regulatory cascades 
1^ v/herein their subsrrates may include other kinases whose 

accivicies are regulated by their phosphorylation state. 
Ultimately the activity of some downstream effector is 
modulated by phosphorylation resulting from activation of 
such a pathway. 

Protein kinases are one of the largest families of 
eukaryotic proteins with several hundred known m.embers. 
These proteins share a 250-300 ammo acid domain that can be 
subdivided into 12 distinct subdomams that comprise the 
com.m.on catalytic core structure. These conserved protein 
motifs have recently been exploited using PCR-based cloning 
strategies leading to a significant expansion of the known 
kinases . 

Multiple aiignmient of the sequences m the catalytic 
aom,ain of protein kinases and subsequent parsimony analysis 
psrm.ir-s the segregation of related kinases into distinct 
branches or subfamilies including: tyrosine kinases, 
cyclic -nuclectide-dependent kinases, cal cium/ calmodul in 
kinases, cycl in-dependent kinases and -M^vP - kinases , serme- 



20 
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Kinase rececc 



and several other less define-:; 



suDi arr.i 



lies . 
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20 



25 



Through che use of a targeted FCR cicning scracegy and 
of a ''motif extraccion" bioinf ormac ics scripc , mammalian 
m.embers of the STE20-kina3e family have been identified as ■ 
part: of the presenc invention. Mulciple alignment and 
parsimony analysis of the cacalycic domain of all of chese 
STE20- family members reveals that these proteins cluscer ■ 
into J? distinct subgroups. Classification in this manner 
has proven highly accurate noz only in predicting motifs 
presenc in the rem^aining non- ca taiyt ic portion of each 
protein, but also in their regulation, subsrraces, and 
signaling pathv/ays. The present invention includes the 
parcial or comipleie sequence of new memtbers of the STE2 0- 
family, their classification, predicted or deduced protein 
structure, and a strategy for elucidating their biologic and 
therapeutic relevance. 

Thus, a first aspect of the invention features an 
isolated, enriched; or purified nucleic acid molecule 
encoding a kinase polypeptide selected from the group 
consisting of STLK2 , STLK3 , STLK4 , STLKS , STLK6 , STLK7 , ZCl , 
ZC2, ZC3, ZC4 , KHS2, .SULUl , SULU3 , GEK2 , PAK4 , and PAK5 . 

. Bv "isolated" in reference to nucleic acid is m.eant a 
polymer of nucleotides conjugated to each other, including 
DNA and RNA, chat is isolated from, a nacural source or that 
is synthesized. The isolated nucleic acid of the presenc 
invention is unique in the sense that ic is not found in a 
pure or separated state in nature. Use of the term 
"isolated" indicates that a naturally occurring sequence has 
been removed from, its normial cellular (i.e., chrom.osomal ; 
environment. Thus, the sequence m.ay be in a cell -free 
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solution or placed m a different cellular environ-enc. Tne 
term does not x^ply chat -he sequence is zhe only nucleotide 
Cham presenc, but chac ic is essenrially free (aboue 90 - 
95% pure at least) of non-nucleotide material nacurally 
associated with it, and thus is distinguished from isolaced 
chromosomes . 

3y the use of the term "enriched" in reference co 
nucleic acid is meant thac the specific DMA or RNA sequence 
ccnscicutes a significantly higher fraction ?2 - 5 fold) of 
Che tocal DIMA or RMA presenc in the cells or solucion of 
interes- than in normal or diseased cells or in che cells 
from which Che sequence was taken. This could be caused bv 
a person by preferential reduction m the amounc of other 
DMA or RMA present, or by a preferential increase in che 
amounc of the specific DMA or RMA sequence, or by a 
comiinacicn of the cwo. However, it should be noced that 
enriched does not imply thac there are no other DMA or RMA 
sequences presenc, just thac the relacive amount of che 
sequence of incerest has been signif icancly increased. The 
cerm. "significant" is used to indicate that the level of 
increase is useful to the person m.aking such an increase, 
and generally m.eans an increase relacive co other nucleic 
acids of about at least 2 fold, more preferably at least 5 
to IC fold or even more. The term also does not im.piy that 
there is no DMA or RMA from other sources. The other source 
DMA may, for exam.ple, comprise DMA from a yeast or bacterial 
genom.e, or a cloning veccor such as pUCi3. This term 
discmguishes from nacurally occurring evencs, such as viral 
infeccion, or tumor cype growchs, in which the level of one 
mRMA may be nacurally increased relative cc other species of 
nfiRNA. That is, the cerm is meant to cover only chose 
situations m. which a person has incervened co elevace che 
proportion of the desire-d nucleic acid. 
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Iz is also advantageous for some purposes zhat 

nucleoside sequence be in purified form. The cerm 
"purified'' in reference to nucleic acid does noc require 
absoluce purity (such as a homogeneous preparation) . 
5 Instead, ic represents an indication thac the sequence is 

relatively more pure than in the natural, environm.ent 
(com.pared zo the natural level this level should be at least 
2-5 fold gx^eacer, e.g., in terms of mg/m.L) . Individual 
clones isolated from a cDNA library may be purified to 

10 elec trophoret ic homiogenei ty . The claimed DMA molecules 

obtained from these clones could be obtained directly from 
totcLl DMA or from tocal RMA . The cDNA clones are noc 
nacurally occurring , buc racher are preferably obtained via 
manipulation of a partially purified naturally occurring 

15 substance (messenger RNA} . The conscruccicn of a cDNA 

library from m^RNA involves the creation of a synchecic 
substance (cDNA) and pure individual cDNA clones can be 
isolated from: the synthetic library by clonal selection of 
the cells carrying the cDNA library. Thus, the process 

20 which includes the construction of a cDNA library frcm m.RNA 

and isolation of distinct cDNA clones yields an 
approximately IC^-fold purification of the native m.essage . 
Thus, purification of at least one order of magnitude, 
preferably tv/o or three orders, and more preferably four or 

25 five orders of m^agnitude is expressly contemplated, ■ 

By a "kinase polypeptide" is meant 3 2 (preferably 40, 
m.ore preferably 45, most preferably 55) or m.ore contiguous 
amino acids set forth in the amino acid sequence of SEQ ID 
NO: 5, SEQ ID MC : .6 , or SEQ ID NO : 7 , or the corresponding 

?tJ full-length amino acid sequence; 250 (preferably 255, more 

preferably 250, most preferably 270) or mo^re contiguous 
am.ino acids set forth in the "amdno acid sequence SEQ ID 
MO: 13, SEQ ID NO: 14, SEQ ID MO: 15, or SEQ ID NO: 105, or the 

5 
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corresponding full -length amino acid seauence ; 2 7 
(preferably 30, more preferably 40, most preferably 45; or 
more contiguous amino acids set forch in the am:ino acid 
sequence SEQ ID NO: IS; 16 (preferably 20, m.ore preferablv 
^ 25, m-ost preferably 35) or more contiguous amino acids sec 

forth in the amtino acid seauence SEQ ID NO: 22, SEQ ID NO: 23, 
SEQ ID NO: 24, SEQ ID NO : 2 9 , SEQ ID NO : 3 1 , or SEQ ID NO : 1 0 3 
or the corresponding full-length am:inc acid sequence; o 
(preferably 10, more preferably 15, most preferably 25) or 

10 more contiguous amino acids set forth in the amino acid 

sequence of SEQ ID NO: 97 or SEQ ID NO: 99, 22 (preferablv 30, 
more preferably 35, mosc preferably 45) or more conciguous 
ammo acids set forch in the amino acid sequence of SEQ ID 
NO: 101, or che corresponding full-length amino acid 

15 sequence; 73 (preferably 80, more preferably 35, raosz 

preferably 90) or more ccnciguous amtino acids set forth in 
the amino acid sequence SEQ ID NO: 107 or functional 
derivatives thereof as described herein. For sequences for 
which the full-length sequence is not given, the rem.aining 

20 sequences can be determdned using methods v.-eil -known co 

those in the art and are intended to be included m the 
invention. In certain aspects, polypeptides of 100, 200, 
3 00 or miore am.ino acids are preferred. The kinase 
polypeptide can be encoded by a full-length nucleic acid 

25 sequence or any portion of the full-length nucleic acid 

sequence, so long as' a functional activity of the 
polypeptide is retained, not to include fragm^ents containing 
only amino acids 1-22 of SEQ ID NO: 13 or onlv amino acids 1- 
33 of SEQ ID NO : 107 . 

-'0 -"he am.ino acid sequence v/ill be substantially sim.ilar 

CO the sequence shown in SEQ ID NO : 5 , SEQ ID NO : 6 , SEQ ID- 
NO : 7 , SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO : 1 5 , SEQ ID 
NO: IS. SEQ ID NO: 22, SEQ ID NO : 2 3 , SEQ ID NO : 2 4 , SEQ ID 
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NO:29, SEQ ID NO : 3 1 , SEQ ID MC : 9 7 , SEQ ID NO : 9 9 , SEQ ID 
MO: 100, SEQ ID NO: 103, SEQ ID NO: 105, or SEQ ID NO: 107, or 
che corresponding full-length amino acid sequence, or 
fragn^.entis thereof, noc to include fragments consisting onl-/ 
5 of the amino acid sequences 1-22 of SEQ ID NO: 13 or 1-33 of 

SEQ ID NO: 107. A sequence that is subst anc i al ly similar co 
-he sequence of SEQ ID NO : 5 , SEQ ID NO : 6 , SEQ ID NO : 7 , SEQ 
ID NO: 13, SEQ ID NO: 14, SEQ ID NO : 1 5 , SEQ ID NO : 1 S , SEQ ID 
NO: 22, SEQ ID NO : 2 3 , SEQ ID NO: 24, SEQ ID NO : 2 9 , SEQ ID 

10 N0:31, SEQ ID NO:97, SEQ ID NO : 9 9 , SEQ ID NO : 1 0 1 , SEQ ID 

NO: 103, SEQ ID NO : 1 0 5 , or SEQ ID NO: 107 v;ill preferably have 
at: least 90% identity (mLore preferably ac lease 95% and mosc 
preferably 99-100%) to the sequence of SEQ ID NO : 5 , SEQ ID 
NO: 6, SEQ ID NO : 7 , SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, 

15 SEQ ID NO:lS, SEQ ID NO:22, SEQ ID NO : 2 3 , SEQ ID NO:24, SEQ 

ID NO: 29, SEQ ID NO : 3 1 , SEQ ID NO.: 97, SEQ ID NO::99, SEQ ID 
N0:101, SEQ ID NO:lC3, SEQ ID NO : 1 0 5 , or SEQ ID NO:lC7. 

By "identity" is meanc a property of sequences chat 
measures cheir similarity or relationship. Identity is 

20 m.easured by dividing the nuTiber of identical residues by the 

total numjoer of residues and gaps and multiplying the 
product by 100. "Gaps" are spaces in an alignment: that are 
che result of additions or deletions of amino acids. Thus;, 
cv/o copies of exactly the sam.e sequence have 100% identity, 

25 but sequences that are less highly conserved, and have 

deletions, addicions, or replacements, may have a lower 
degree of identity. Those skilled in the art will recognize 
that several com.pucer programts are available for determining 
sequence identity using standard parameters, for example 

30 Blase (Al-schul, ec ai . (1997) Nucleic Acids Res. 25:3339- 

3402;, Blasc2 (Altschul, ec al.' (1990) J. mol . biol . 
215:403-410), and Smi th- Wat erman (Smich, ec ai . (1931) J. 
Mcl . B-icl , 147:195-197). 
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In preferred embodimencs, the invention features 
isolated, enriched, or purified nucleic acid molecules 
encoding a kinase polypeptide comprising a nucleotide 
sequence that: (a) encodes a polypeptide having the arrdno 
acid sequence set forth in SEQ ID NO : 5 , SEQ ID NO : S , SEQ ID 
NO:7, SEQ ID MO : 1 3 , SEQ ID NO:14, SEQ ID NO:l5, SEQ ID 
MO:iS, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID 
NO:29, SEQ ID NO : 3 1 , SEQ ID NO:97, SEQ ID N0:5S, SEQ ID 
NO: 101, SEQ ID MO: 103, SEQ ID WO: 105, or SEQ ID NO: 107; (b> 
is the complement of the nucleotide sequence of {a} ,- (c) 
hybridizes under highly stringent conditions to the 
nucleotide n^olecule of (a) and encodes a naturally occurrinc 
kinase polypeptide; (di encodes a kinase polypeptide having 
the amino acid sequence of ' SEQ ID NO : 5 , SEQ ID NO: 6., SEQ ID 
15 NO:-, SEQ ID N0:13, SEQ ID NO:14, SEQ ID NO : 1 5 , SEQ ID 

NO: IS, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID 
NO: 29, SEQ ID NO : 3 i , SEQ ID NO: 97, SEQ ID NO: 99, SEQ ID NO: 
103, SEQ ID NO: 105, or SEQ ID NO: 107, except that it lacks 
cne or rrore, but not all, of the following segments of amino 
20 acid residues: i-21, 22-274, or 275-416 of SEQ ID NO : 5 , 1 

31, 32-303, 3G9-4S9 or 490-516 of SEQ ID NO : 5 , 1-178 or 179- 
414 of SEQ ID N0:7, 1-22, 23-239, 290-526, 527-640, 641-S96, 
cr 397-1239 of SEQ ID NO:13, 1-255, 256-442, 443-626, 627- 
554, or 955-1297 of SEQ ID N0:14, 1-255, 256-476, 477-6S0, 
25 6S1-9S3, or 9S4-1326 of SEQ ID NO:15, 1-13, 14-273, 274-346, 

347-534, or 535-894 of SEQ ID NO:lS, 1-21, 22-277, 278-427, 
42S-637, 638-751, or 752-89S of SEQ ID NO:22, 1-65, 67-215, 
216-425, 425-539, 540-786, or 787-887 of SEQ ID NO:22, 1-25, 
26-273, 274-422, 423-632, or 633-74S of SEQ ID NO:24, 1-51, 
30 52-224, 225-393, 394-553, or 659-6S1 of SEQ ID NO:2S, 1-25, 

26-2;:.l, 284-430, 431-540, 641-754, 755-901, or 902-1001 of 
SEQ ID NO: 31, 1-10, 11-321, or 322-373 of SEQ ID NO: 97, 1- 
57, 58-359, or 370-418 of SEQ ID NO:99, 1-52, 53-173, 174_ 
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307, 3CS-572, or 575-591 of 3EQ ID NC:103, 1-24, 25-289, 
290-357, 393-523, 629-372, or 373-1227 of SEQ ID NC:1C5, or 
1-33, 34-294, 295-337, 33S-472, 473-724, or 725-963 of SEQ 
ID NO: 107; (e) is the complement of zhe nucleotide sequence 
5 of (d) ; (f ) encodes a polypeptide having tiie arr.mo acid 

sequence sec forch in SEQ ID NO : 5 , SEQ ID NO : 6 , SEQ ID NO : 7 , 
SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO : 1 5 , SEQ ID NO: IS, SEQ 
ID NO: 22, SEQ ID NO : 2 3 , SEQ ID NO: 24, SEQ ID NO : 2 9 , SEQ ID 
NO: 31; SEQ ID NO: 97, SEQ ID NO: 99, SEQ ID NO: 103, SEQ ID 
10 NO:lC5, or SEQ ID NO: 107 from amino acid residues 1-21, 22- 

274, or 275-416 of SEQ ID NO : 5 , 1-31, 32-308, 309-439, or 
490-515 of SEQ ID NO : 6 , 1-178 or 179-414 of SEQ ID. N0:7, 23- 
289, 290-526, 527-640, 641-895, or S97-1239 of SEQ ID N0:13, 

I- 255, 255-442, 443-526,- 627-954, or 955-1297 of SEQ ID 

15 NO:14, 1-255, 256-476, 477-680, 681-983, or 984-1326 of SEQ 

ID NO:15, 1-13, 14-273, 274-346, 347-534, or 535-394 of SEQ 
ID N0:1S, 1-21 22 -277 , 278-427, 423-63':', 638-751, or 752 - 
8 93 of SEQ ID NO :22, 1-66, 67-215, 216-425, 425-539, 54 0- 
78 6, or 78 7 -88 7 of SEQ ID NO:23, 1-25, 26-273, -274-422, 423- 

20 632; or 633-748 of SEQ ID NO:24, 1-51, 52-224, 225-393, 3 9.4- 

658, or 659-581 of SEQ ID NO : 2 9 , 1-25, 26-281, 282-430, 431- 
640, 641-754, 755-901, or 902-1001 of SEQ ID NO : 3 1 , 1-10, 

II- 321, or 322-373 of SEQ ID NO: 97, 1-57, 58-359, or 370-418 
of SEQ ID NO:99, 1-52, 53 -173, 1 74 - 3 0 7 , 3 0 3 - 5 7 2 , or 573-591 

25 of SEQ ID NO:103, 1-24, 25-289, 290-397, 3 9 8 - 6 2 £ , 6 2 9 - 8 7 2 , 

or 873-1227 of SEQ ID NO:105,or 1-33, 34-294, 295-337, 338- 
472, 473-724, or 725-963 of SEQ ID NO:107; (g) is che 
con-.plemenc of che nucleotide sequence of (f ) ; (h) encodes a 
polypeptide having the amino acid sequence sec forch in SEQ 

30 ID NO: 5, SEQ ID NO : 5 , SEQ ID NO : 7 , SEQ ID NO : 1 3 , SEQ ID 

NO:14, SEQ ID NO : 1 5 , SEQ ID N0:13, SEQ ID NO : 2 2 , SEQ ID 
NO: 23, SEQ ID NO : 2 4 , SEQ ID NO : 2 9 , SEQ ID NO : 3 1 , SEQ ID 
NO: 97, SEQ ID NO : 9 9 , SEQ ID NO: 101, SEQ ID NO : 1 0 3 , SEQ ID 
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NO:10., or SEQ ID IvO:107, excepc chat ic lacks one or more ' ' 
of the domains selected from the group consisting of a K- 
cermmal domain, a catalytic domain, a C-terminal domain, a 
coiled-coil scructure region, a proiine-rich region, a 
spacer region, an insert, and a C-term.inal tail; or (i) is 
the com.plement of the nucleoside sequence of (h) . 

The term "complement" refers to cwo nucleotides that 
can form multiple favorable interactions with one another. 
For example, adenine is complementary co thymine as they can 
form cwc hydrogen bonds. Similarly, guanine and cytosme 
are complem.entary since they can form three hydrogen bonds. 

A nucleotide sequence is che comiplement of another 
nucleocide sequence if all of the nucleotides of the firsc 
sequence are complem.entary to all of che nucleotides of che 
second sequence . 

The -erm "domain" refers to a region of a polypeptide 
which contains a particular function. For instance, N- 
terminal or C-terminal domains of signal transduction 
pro-eins can serve functions including, but not limited to, 
binding molecules that localize the signal transduction 
m.olecule to different regions of the cell or binding other 
signaling molecules directly responsible for propagating a 
particular cellular signal. Some dom.ains can be expressed 
separately from the rest of the protein and function by 
■themselves, while ethers must remain part of the intact • 
protein to retain function. The latter are term.ed 
functional regions of proteins and also relate to domains. 

The term "N-terminal domain" refers to the 
extracatalytic region located betv.-een the initiator 
methionine and the catalytic domain of the protein kinase. 
The N_ terminal dom.ain can be identified following a Smith- 
Waterm.an alignm^ent of the protein sequence against the non- 
redundant protein database to define the N-termmal boundarv 
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of the cacaiytic dcTiain. Depending on its length, che N- 
termmai domain may or m.ay not play a regula::ory role in 
kinase function. An example of a protein kinase whose N- 
terminal domain has been shown to play a regulatory role is 
5 PAK6 5, v;hich contains a CRIB motif used for Cdc4 2 and rac 

binding (Burbelc, P.D. ec al . (1995) J. Biol. Chem. 270, 
2 9G71 -2 90740) . 

The N- terminal domain spans amino acid residues 1-21 of 
the sequence set forth m SEQ ID NO : 5 , amino acid residues 

10 1-3 1 of the sequence set forth m SEQ ID NO : 6 , amino acid 

residues 1-22 of the sequence sen forth in SEQ ID HO: 13, 
amino acid residues 1-13 of the sequence sec forth m SEQ ID 
NO: IS, amiino acid residues 1-21 of the sequence sec forth in 
SEQ ID NO: 22, amiino acid residues 1-25 of che sequence set 

15 forch in SEQ ID NO: 24, amino acid residues 1-51 of the 

sequence set forth in SEQ ID NO : 2 9 , amino acrd residues 1-25 
■ of the sequence set forch in SEQ ID NO: 31, am.ino acid 
residues 1-57 of the sequence set forch xn SEQ ID NO: 99, 
amino acid residues 1-52 of the sequence, sec fcrch m SEQ ID 

20 N0:1G3, amino acid residues 1-24 of the sequence sec forth 

in -SEQ ID NO: 105, or ammo acid residues 1-33 of che 
sequence sec forch m SEQ ID NO: 107. 

The cerm "catalytic domain" refers co a region of the 
protein kinase uhat is typically 25-3 00 am.ino acids long and 

25 is responsible for carrying out the phosphate transfer 

reaction fromi a high-energy phosphate donor molecule such as 
ATP or GTP to icself (autophosphorylat ion) or co -other 
proteins (exogenous phosphorylac ion) . The CcLcalycic domain 
of procein kinases is made up of 12 subdom.ains that contain 

30 highly conserved ammo acid residues, and are responsible 

. for proper polypepcide folding and for caccilysis . The 
cacaiytic domain can be identified following a Smich- 
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Waterman alignment of zhe protein sequence against the non- 
redundan- protein database. 

The catalyric dom.ain spans amino acid residues 22-274 
of the sequence set forth in SEQ ID NO : 5 , residues 32-30S of 
zhe sequence set forth in SEQ ID MO: 6, residues 1-178 of che 
sequence set forth in SEQ ID NO : 7 , residues 23-2S9 of the 
sequence set forth in SEQ ID NO: 13, residues 1-2 55 of the 
sequence set forth in SEQ ID NO: 14, residues 1-255 of the 
sequence sec forch in SEQ ID NO: 15, residues 14-273 of the 
sequence set forth in SEQ ID NO: 18, residues 22-277 of che 
sequence set forch in SEQ. ID NO: 22, residues 1-66 of che 
sequence set forth in SEQ ID NO: 23, residues 25-273 of the 
sequence set forth in SEQ ID NO: 24, residues 394-65S of the 
sequence sec forch in SEQ ID NO: 29, residues 26-231 of che 
sequence set forth in SEQ ID NO:31, residues 1-273 of the 
sequence set forth in SEQ ID NO:97, residues 58-369 of the 
sequence set forth in SEQ ID NO: 99, residues 1-103 of the 
sequence set forth in SEQ ID NO: 101, residues 30S-572 of the 
sequence set forth m SEQ ID NO: 103, residues 25-239 of the 
sequence sec forth in SEQ ID NO: 105, or residues 34-294 of 
the sequence set forth in SEQ ID NO: 107. 

The cerm "catalytic activicy" , as used herein, defines 
the rare at which a kinase cataiycic domain phosphorylates a 
subscrace. Cacalytic activity can be m.easured, for example, 
by determining the am:Ount of a subscrace converted co a 
phosphorylated produce as a function of time. Catalytic 
ac- ivity can be measured by methods of the invention by 
holding cime constant and determining che concencrac ion of a 
phosphorylated subscrace after a fi.-ed period of time. 
Phosphorylation of a substrate occurs ac the accive-sice of 
. a pro-em kinase. The active-site is normally a cavity in 
which the substrate binds to the protein kinase and is 
phosphorylated . 
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The cernr, ''subscrace" as used herein refers zo a 
molecule phosphorylat ed by a kinase of che invention. 
Kinases phosphoryla te subscraces on serine/threonine or 
tyrosine amino acids. The molecule may be another protein 
5 or a polypeptide. 

The term ''C- terminal dom.ain" refers to the region 
located between the catalytic domain or the last ; located 
closest to the C-term.inus) functional domain and the 
carboxy- terminal amino acid residue of the pi'otein kinase. 

10 By "functional" domain is meant any region of the 

polypeptide that may play a regulatory or catalytic role as 
predicted from amino acid sequence homology to other 
proteins or by the presence of am.mo acid sequences that may 
give rise to specific structural conformations (i.e. coiled- 

15 coils) . The C-terminal domain can be identified by using a 

Smith-Waterman alignm.ent of the protein sequence against the 
non- redundant protein database to define the C-terminal 
boundary of the catalytic domain or of any functional C- 
term.inal extracatalyt ic domtain. Depending on its length and 

20 amino acid composition, the C-terminal domain may or may not 

play a regulatory role in kinase function. An examiple of a 
protein kinase v/hose C-terminal domain m^ay play a regulatory 
role is ?AK3 v;hich contains a heterot rim^eric G. subunit- 
binding site near its -C-termtinus (Leeuv/, T. et al (1998) 

25 Nature, 391, 191-195). 
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or the sequence ser forth in SEQ id N0:22, residues 275-330 
of -he sequence sec forch in SEQ ID NO: 97, residues 370-418 
of cne sequence set forth in SEQ ID NO: 99, or residues S73- 
1227 of the sequence set forth in SEQ ID NO: 105. 

The term "signal transduction pathway" refers co the 
rr.olecules thac propagate an extracellular signal through the 
cell membrane to become an intracellular signal. This 
signal can then s-in-.ulace a cellular response. The 
polypeptide m.clecules involved in signal transduction 
processes are. typically receptor and non-receptor protein 
tyrosine kinases, receptor and non- receptor protein 
phosphatases, SRC homology 2 and 3 domains, phosphotyrosi.ne 
binding proteins (SRC homology 2 (SH2) and phosphotyrosine 
binding (?TB and FH^ domain containing proteins), prolme- 
rich binding proteins {SH3 dom.ain containing protei.ns) , 
nucleotide exchange factors, and transcription factors. 

The term "coiled-coil structure region" as used herein, 
refers to a polypeptide sequence that has a high probabilitv 
of adopting a coiled-coil structure as predicted oy computer 
algorithms such as COILS (Lupas, A. (1996) Meth. Enzvmologv 
265:513-525). Coiled-coils are formed by two or three 
amphipatnic «-heiices m parallel. Coiled-coils can bind to 
coiled-coil domains of other polypeptides resulting m hom>o- 
or heterodimers (Lupas, A. (1991) Science 252 : 1162 - 1164 ) . 
Coiled-coil-depsndent ol igom.eri zat ion has been shown to be 
necessary for protein function including catalytic activity 
of serine/threonine kinases (Roe, J. et aJ . (1997) J. Biol' 
Chem. 272 : 5833-5845 . 

The coiied-coil structure region spans amino acid 
residues 290-525 of the sequence set forth in SSQ ID NO: 13, 
residues 255-442 of the sequence set forth in SSQ id NO: 14,' 
.residues 256-475 of tne sequence set forth in SEQ ID. NO: 15,' 
residues 423-637 of the sequence set forth in SEQ ID NO:22! 

H 
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residues 216-425 or 540-7S6 of che sequence set forch in SEC 
ID NO: 23, residues 423-632 of the sequence sec forth in SEQ 
ID NO:24, residues 431-640 or 755-901 of the sequence sen 
forth in SEQ ID NO : 3 1 , residues 291-398 or 629-668 of the 
5 sequence set forch in SEQ ID NO: 105, or residues 473-724 or 

725-968 of the sequence sec forth in SEQ ID NO: 107. 

The term "prol ine - rich region" as used herein, refers 
to a region of a protein kinase whose proline content over a 
given amiino acid length is higher chan the average concent 

10 of this amino acid found in prot eins { i . e . , >10%) . Proline- 

rich regions are easily discernable by visual inspection of 
amino acid sequences and quanticated by scandard compucer 
sequence analysis programs such as che DNAStar program 
EditSeq. ?roline-rich regions have been demonstraced co 

15 parcicipace in regulatory protein -procein interact ions . 

Among chese interactions, chose chac are m.ost relevant to 
this invention involve the "PxxP" proline rich motif found 
in cerca:Ln procein kinases (i.e., human PAKl ) and the SH3 
domain of the adaptor molecule Nek (Galisteo/ M.L. ec al . 

20 (1996) J. Biol. Chem.. 2 71:20997-21000): Other regulacory - 

inceraccions involving "Pxx?" proline-rich motifs include 
the WW domain (Sudol, M. (1996) Prog. Biochys . Mol . Bio. 
65 : 113-132; . 

The proline-rich region spans amino acid residues 527- 
25 640 of the sequence set forth in SEQ ID NO: 13, residues 443- 

626 of che sequence set forth in SEQ ID NO: 14, residues 477- 
680 of the sequence set forth m SEQ ID MO: 15,- residues 347- 
534 of the sequence sec forth in SEQ ID MO : 18 , residues 3 98- 
62 3 of the sequence sec forch m SEQ ID NO: 105, or residues 
30 333-4 72 of the sequence set forth m SEQ ID NO: 107. 

The term "spacer region" as used herein, refers co a 
region of the protein kinase located becween predicced 
functional dom.ains. The spacer region has no deteccable 

16" 
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hcrnology to any amine acid sequence in the dacabase 
be identified by using a Smi th- Waterman alignment of the 
procein sequence against the ncn - redundant prccein database 
to define che C- and N-tsrminai boundaries of che flanking 
5 functional domains. Spacer regions may or may not play a 

fundamental role in protein kinase function. Precedence for 
the regulacory role of spacer regions in kinase function is 
provided by the role of che src kinase spacer m inter- 
domam interactions (Xu, W. st al. (1997) Nature 335:595- 
10 602) . 

The spacer region spans amine acid residues 64 1-896 of 
the sequence se- forth inSEQ ID N0:13, residues 627-954 of 
che sequence set forth m SEQ ID NO: 14, residues 5S1-983 of 
the sequence sec forth in SEQ ID NO: 15, residues 274-346 of 
the sequence set forth in SEQ ID NO: IS, residues 278-427 or 
■538-751 of the sequence set forth in SEQ ID NO: 22, residues 
S~-215 or 426-539 cf the sequence set forth in SEQ ID NO: 23, 
residues 274-422 or 633-748 of the sequence sec forth m SEQ 
ID NO:24, residues 225-393 of the sequence set forth in SEQ 
ID NO:29, residues 2S2-43 0 or 641-754 of the sequence set 
. forth in SEQ ID NO: 31, residues 174-307 of che sequence set 
forth in SEQ ID NO: 103, residues 569-872 of the sequence set 
forth in SEQ ID NO: 10-5, or residues 295-337 of the sequence 
set forth in SEQ ID NO: 107. 

The term "insert" as used herein refers to a portion of 
a protein kinase that is absent from a close hom.olog. 
Inserts may or may not by the product alternative splicing 
of exons. Inserts can be identified by using a Sm.ith- 
Waterman sequence alignm.ent of the protein sequence against 
the non-redundant protein database, or by means of a 
multiple sequence alignment of homologous sequences usmq 
the DNAStar program .Megalign. Inserts may play a functional 
role by presenting a nev.^ interface for protein-protein 
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interactions, or by interfering with such interactions. 

Insercs span arr:ino acid residues 52-224 of che seauenze seu 
forch m SEQ ID NO: 29 or residues 53-173 of the sequence set 
forch in SEQ ID NO: 103. 
5 The term "C- terminal call" as used herein, refers co a 

C- terminal domain of a protein kinase, than by homology 
extends or protrudes past the C- terminal amino acid of its 
closest homolog . C- terminal tails can be idencified by 
using a Smi th- VJacerman sequence alignm.ent of che protein 

10 sequence against the non- redundant protein database, or by 

means of a multiple sequence alignm.ent of hom.clogous 
sequences using the DNAStar program. Megalign. Depending on 
its length, a C-terminal tail may or may not play a 
regulatory role in kinase function. 

15 The C-terminal tail spans amino acid residues 490-516. 

of the sequence set forth m SEQ ID NO : 6 , residues 737-337 
of the sequence set forth in SEQ ID NO; 23, residues 659-681 
of the sequence set forth in SEQ ID NO: 29, residues 994-1093 
of the sequence set forth in SEQ ID NO: 31, or residues 573- 

20 591 of the sequence set forth in SEQ ID NO: 103. 

Various low or high stringency hybridization conditions 
Tiay be used depending upon the specificity and selectivity 
desired. These conditions are well-knov;n to those skill^ed 
in the art. Under stringent hybridization conditions only 

25 highly com^plem.entary nucleic acid sequences hybridize. 

Preferably, such conditions prevent hybridization of nucleic 
acids having more than 1 or 2 mismatches out of 20 
contiguous nucleotides, mere preferably, such conditions 
prevent hybridization of nucleic acids having m.ore than 1 or 

30 2 mismatches out of 50 contiguous nucleotides, most 

preferably, such conditions prevent hybridization of nucleic, 
acids having m.ore than 1 or 2 mismatches out of 100 
contiguous nucleotides. In som^e instances, the conditions 

\1 
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may- prevent hybridization of nucleic acids having more than 
5 mismacrhes in the full-length sequence. 

By scringent hybridization assay conditions is w.sanz 
hybridi-acion assay conditions at least as stringent as the 
following: hybridization in 50% formamide, 3X SSC, 50 miM 
NaK.PO,, pH €.8, 0.5% SDS , 0.1 mg/mL sonicated salrr^.on sperm 
DNA, and 5X Denhart solution at 42 °C overnighc; washing with 
2X SSC, 0.1% SDS a- 45 °C ; and washing v/ith 0.2X SSC, 0.1% 
SDS ac 45 °C . Under some of the most stringenc hybridization 
assay conditions, -he second wash can be done with 0 . IX SSC 
ac a temperature up to 70 °C (Berger et al . (1987) Guide co 
Molecular Cloninc T^chn^n^^^^ pg 421, hereby incorporaced by 
reference herein including .any figure?, tables, or 
drawings.). Hov.'ever, other applications may require zhe use 
of conditions falling between these sets of conditions. 
Methods of _ determining the conditions required to achieve 
desired hybridizations are well-known to those with ordinary 

art, and are based on several factors, 
including but not liirated to, the sequences to be hybridized 
and the samples to be tested. 

In other preferred embodiraents , the invention features 
isolated, enriched, or purified nucleic acid molecules 
encoding kinase polypeptides, further Comprising a vector or 
promoter effective to initiate transcription in a host cell. 
2^ The invention also features recombinant nucleic acid, 

preferably in a cell or an organism. The recombinant 
nucieic acid may contain a sequence set forth in SSQ ID 
NO:l, SEC ID NO: 2, SEQ ID NO : 3 , SEQ ID NO : 9 , SSQ ID NO: 10, 
SEQ ID NG:11, SEQ ID NO : 1 7 , SEQ ID NO : 1 9 , SSQ ID NO : 2 0 , SEQ 
30 ID NO:21, SEQ ID NO : 2 7 , SEQ ID NO: 96, SEQ ID NO : 98 , SEQ ID 

NO: 100, SEQ ID NO: 102, SEQ ID NO: 104, or SEQ ID NO: 106, or a 
functional derivative thereof and a vector or a promoter 
errective to initiate transcription m a host cell. The 



20 
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reccnTLbinant nucleic acid can alternatively concain a 

cranscriptional iniciacion region functional in a ceil, a 
sequence complementary co an RNA sequence encoding a kinase 
polypeptide and a transcriptional terminacion region 
5 funccional in a cell. Specific vectors and host cell 

combinations are discussed herein. 

The term "vector" relates to a single or double- 
scranded circular nucleic acid molecule that can be 
cransfected into cells and replicated v/ithin cr 

10 independently of a cell genome. A circular double-stranded 

nucleic acid molecule can be cut and thereby linearized upon 
treatment v;ich rescriction enzymes. An assorcm;ent of 
nucleic acid vectors, restriction enzymes, and the knowledge 
of the nucleouide sequences cut by restriction enzymes are 

15 readily available to those skilled in the art. A nucleic 

acid m.olecule encoding a kinase can be inserted inco a 
vector by cutting uhe vector with restriction enzym^es and 
ligazing the tvjo pieces together. 

The term: " zransf ect ing" defines a number of methods eo 

20 insert a nucleic acid veczor or other nucleic acid miclecules 

into a cellular organism^. These miethods involve a variety 
of techniques, such as treating the cells with high 
concent razions of salt, an electric field, detergenz , or 
DiMSO CO render the outer mem^brane or v/all of the ceils 

25 permeable to nucleic acid molecules of interest or use of 

various viral transduction strategies. 

The termi "promoter" as used herein, refers to nucleic 
acid secuence needed for qene sequence expression. PromiOter 
regions vary fromi organism to organism, but are well known 

30 zo oerscns skilled in the art for different organisms. For 

example, m prokaryotes, the promioter region contains both 
zhe prom.ozer (which directs the iniziation of RNA 
transcripzion) as well as the DMA sequences whicii, v;hen 
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.n^ciacion 



NO: 24, SEQ ID NO 
NO : 99 , SEQ ID NO 



Such regions v/iii normally include those 5 ' -non- codino 
sequences involved with initiacion of cranscripcion and 
' translation, such as the TATA box, capping seauence, CA^^.T 
5 sequence, and the like. 

In preferred erribodiment s , the isolated nucleic acid 
comprises, consists essentially of, or consists of a nucleic 
acid sequence sec forth in SEQ ID NO : 1 , SEQ IB NO : 2 , SEQ ID 
N0:3, SEQ ID NO : 9 , SEQ ID NO : 1 0 , SEQ ID NO : 1 1 , SEQ ID NO : 1 7 ; 
10 SEQ ID NO: 19, SEQ ID NO : 2 0 , SEQ ID NO : 2 1 , SEQ ID NO : 2 7 , SEQ 

ID NO: 96, SEQ ID NO: 98, SEQ ID NO:iOO SEQ ID NO: 102, SEQ ID 
NO: 104, or SEQ ID NO: 106, or che corresponding full-length 
sequence, encodes che amino acid sequence of SEQ ID NO : 5 , 
•SEQ ID NO: 6, SEQ ID NO : 7 , SEQ ID NO: 13, SEQ ID NO : 14 , SEQ ID 
15 N0:15, SEQ ID NO:lS, SEQ ID NO:22, SEQ ID NO:-22,, SEQ ID 

:29, SEQ ID NO : 3 1 , SEQ ID NO: 97, SEQ ID 
.01, SEQ ID NO: 103, SEQ ID NO: ICS, or SEQ 
ID NO: 107, or che corresponding full-length amino acid 
sequence, a functional derivative cherecf , or at least 40, 
45, 50, 60, 100, 200, or 300 contiguous amino acids of SEQ 
ID NO: 5, SEQ ID NO : 6 , or SEQ ID NO : 7 , or of the 
corresponding full-lengch ammo acid sequence; ac least 250, 
255, 275, 300, or 400 contiguous am.ino acids of SEQ ID 
NO: 13, SEQ ID NO:14, SEQ ID NO : 1 5 , or of the corresponding 
2:> full-length am.ino acid sequence; at lease 27, 30, 35, 40, 

50, 100, -200, or 300 contiguous am.ino acids of SEQ ID NO: 18; 
ar least 16, 25, 35, 50, 100, 200, or 300 contiguous amino 
acids of SEQ ID NO : 2 2 , SEQ ID NO : 2 3 , SEQ ID NO: 24, SEQ ID 
NO:29, SEQ ID NO : 3 1 , or SEQ ID NO : 1 0 3 , or of the 
30 corresponding full-lengch amino acid sequence; 6 (preferablv 

10, m.ore preferably iS, mcsc preferably 25) or more 
conciguous ammo acids set forth in the amino acid sequence 
of SEQ ID NO: 97 or SEQ ID NO: 99, or che ccrrespondino full- 
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lengrh amino acid sequence; 22 (preferably 30, more 
preferably 35, most: preferably 45) or mere ccnciguous amino 
acids set forch m the amino acid sequence of SEQ ID NO: 101, 
or the corresponding full-length am.ino acid sequence; c-r ac 
5 least SO, 85, 90, 100, 200, or 300 contiguous amino acids of 

SEQ ID NO: 107, or functional derivatives thereof. The 
kinase polypeptides, selected from: the group consiscing of 
STLK2 , STLK3 , STLK4 , STLK5 , STLK6 , STLK7 , 2C1 , ZC2 , ZC3 , 
ZC4 , KHS2 , SULUl , SULU3 , GEK2 , ?AK4 , and FAK5 , comprise , 

10 consist essentially of, or consist of ac least at least 40, 

45, 50, 60, 100, 200, or -300 contiguous am.ino acids of SEQ 
ID NO: 5, SEQ ID MO : 6 , or SEQ ID NO : 7 ; at least 250, 255, 
275, 300, or 400 contiguous amino acids of SEQ ID NO: 13, SEQ 
ID NO:14, SEQ ID NO :■ 1 5 , or SEQ ID NO:105; at least 27, 30, 

15 35, 40, 50, lOG, 200, or 300 contiguous amino acids of SEQ 

ID NO:I3; at least 35, 40, 45, 50, 100, 200, or 300 
contiguous amino acids of SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID 
NQ:24, SEQ ID NO : 2 9 , SEQ ID NO : 3 1 or SEQ ID NO : 1 0 3 ; 6 
(preferably 10, more preferably 15, most preferably 25) or 

20 more contiguous amino acids sec forth in che arrdno acid 

sequence of SEQ ID NO: 97 or SEQ ID NO: 99; 22 (preferably 30, 
m.ore preferably 35, m.ost preferably 4 5) or more contiguous 
amtino acids set forth in zhe am.ino acid seauencs of SEQ ID 
NOrlOl; or at least 80, 85, 90, 100, 200, or 300 contiguous 

25 am.ino acids of SEQ ID NO: 107, or the corresponding full- 

.lengch sequences or derivatives thereof. The nucleic acid 
m.ay be isolated from a natural source by cDNA cloning or by 
subtract ive hybridization. The nacural source rr^ay be 
mammialian, preferably hum.an, blood, semten, or tissue, and 

30 the nucleic acid may be sync hesi zed by the criescer miethod 

or by using an autom.ated DNA synchesizer. 

The termi ''miammial" refers preferably to such organism.s 
as m.ice,. racs, rabbits, guinea pigs, sheep, and goacs, m.ore 
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preferably co cars, docs, monkeys, and apes, and most 
preferably co humans .- 

In yet other preferred embodimencs, the nucleic acid is 
a conserved or unique region, for example those useful for: 
the design of hybridization probes to facilitate 
Identification and cloning of additional polypeptides, che 
design of PGR probes to facilitate cloning of addicicnal 
polypep-ides, obtaining antibodies to polypeptide regions, 
and designing antisense ol igonucieocides . 

By "conserved nucleic acid regions", are meant regions 
present on two or rr.ore nucleic acids encoding a kinase 
polypeptide,. CO which a particular nucleic acid sequence can 
hybridize under lower stringency conditions. Examples of 
lower stringency conditions suitable for screening fcr 
nucleic acid encoding kinase polypepcides are provided in 
Abe, St ai. (J. Biol. Chem. 19:13361-13363, 19S2), hereby 
incorporated by reference herein in ins entirety, including 
any. drawings, figures, or tables. Preferably, conserved 
regions differ by no more than 5 out of 2 0 nucleotides, even 
rr.ore preferably 2 ouc of 2 0 nucleotides or most preferably 1 
out of 2 0 nucleotides. 

By "unique nucleic acid region" is m.eant a sequence 
present in a nucleic acid coding for a kinase polypeptide 
that is not present in a sequence coding for any other 
naturally occurring polypepcide. Such regions preferably 
encode 32 (preferably 40, m.ore preferably 45, most 
preferably 55) or more contiguous amino acids set forth m 
the amino acid sequence of SEQ ID NO : 5 , SEQ ID MO : 6 , or 3EQ 
ID b70:7,,or the corresponding full-length amino acid 
sequence; 2 50 (preferably 2 55, m.ore preferably 2 60, most 
preferably 270) or m.ore contiguous amino acids set forth m 
the am.inc acid sequence SEQ ID NO:13, SEQ ID NO: 14, or SEQ 
ID NO:15, or SEQ ID NO:105, or t.he corresponding full-length 
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amino acid sequence; 27 (p>-ef erabl\' 3 0 , more preferably 40, 
mosc preferably 45) or more concigucus amino acids ser fc-r"h 
in the amino acid sequence SEQ ID NO: IS; 16 (preferably 20, 
more preferably 25, mosc preferably 35) or more concicruous 
5 ammo acids sec forth in the amino acid sequence SEQ ID 

NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO : 2 9 , SEQ ID 
NO: 31, or SEQ ID NO: 103, or the corresponding full-iengch 
amino acid sequence; 6 (preferably 10, m.ore preferably 15, 
most preferably 25) or more contiguous amdno acids set: forth 

10 in the am.ino acid sequence of SEQ ID NO: 97 or SEQ ID NO: 99, 

22 (preferably 30, more preferably 35, most preferably 45) 
or more contiguous amino acids set forth in the amino acid 
sequence of SEQ ID NO: 101, or the corresponding full-lencrth 
am.ino acid sequence; or 7S .(pref erablv 30, more preferablv 

1^ S5, most preferably 90; or more contiguous amirno acids sec 

forth in the am.ino acid sequence SEQ ID NO: 107, or 
functional derivatives thereof. In particular, a unique : 
nucleic acid region is preferably of miammalian origin. 

A second aspect of the invention features a nucleic 

20 acid probe for the detection of nucleic acid encoding a 

kinase polypeptide in a sample, wherein said polypeptide is 
selected from the group consisting of STLK2 , STLK3 , STLK4 , 
. STLK5, STLK6, STLK7 , ZCl , ZC2 , ZC3 , ZC4 , KKS2 , SULUl , SULU3 , 
GEK2 , PAK4 , and ?AK5 . . Preferably, the nucleic acid probe 

25 encodes a kinase polypeptide that is a fragment of the 

protein encoded by the am.ino acid sequence set forth in SEQ 
ID NO: 5", SEQ ID NO : 6 , SEQ ID NO : 7 , SEQ ID NO: 13., SEQ ID 
NO: 14, SEQ ID NO: 15, SEQ ID NO : 1 5 , SEQ ID NO : 2 2 , SEQ ID 
NO: 23, SEQ ID NO: 24, SEQ ID NO : 2 9 , SEQ ID NO : 3 1 , SEQ ID 

30 NO: 97, SEQ ID NO: 99, SEQ ID NO: 101, SEQ ID NO: 103, SEQ ID 

N0^:105, or SEQ ID NO: 107, or the cor respondino full-length 
ammo .acid sequences, no~ to include fragments consisting 
only of amino acids 1-22 of SEQ ID NO: 13 or amiino 5;cids 1-3j 



iL3 
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of SEQ ID NO: 107. The nucleic acid probe conca.ns a 
nucleotide base sequence chac will hybridize co a secuence 
set forth in SEQ ID NO : 1 , SEQ ID MO : 2 , SEQ ID MO : 2 , SEQ ID 
NO: 9, SEQ ID MO: 10, SEQ ID NO: 11, SEQ ID NO : 1 7 , SEQ ID 
N0:i9, SEQ ID NO:20, SEQ ID N0:2i, SEQ ID NO:27, SEQ ID 
NO: 96, SEQ ID NO: 98, SEQ ID NO: 100, SEQ ID NO: 102, SEQ ID 
N0:104, or SEQ ID NO:106, or the corresponding full-length 
sequence, or a functional derivative thereof. 

In preferred embodiments, the nucleic acid probe 
hybridizes to nucleic acid encoding at least 6, 12, 75, 90, 
105, 120, 150, 200, 250, 300 or 350 contiguous amino acids 
or Che sequence set forth in SEQ ID NO : 5 , SEQ ID NO : 6 , SEQ 
ID NO: 7, SEQ ID NO : 1 3 , SEQ ID NO: 14, SEQ ID NO : 1 5 , SEQ ID 
NO: IS, SEQ ID NO : 22 , SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID 
NO: 29, SEQ ID NO : 3 1 SEQ ID NO: 97, SEQ ID NO: 99, SEQ ID 
NO: 101, SEQ ID NO: 103, SEQ ID NO: 105, or SEQ ID NO: 107, or 
the corresponding full-length amino acid sequence, or 
functional derivatives thereof. 

Methods for using the probes include detecting the 
20 presence or amount of kinase RNA in a sample by contacting 

the sample v/ich a nucleic acid probe under conditions such 
that hybridization occurs and detecting the presence or 
amount of the probe bound to kinase RNA. The nucleic acid 
duplex formed between the probe and a nucleic acid sequence 
25 coding for a kinase polypeptide m.ay be used in the 

identification of the sequence of the nucleic acid detected 
(Nelson et al . , in Ncnisotopic DNA Probe Techniques, 
Academ.ic Press, San Diego, Kricka, ed . , p. 275, 1992, herebv 
incorporated by reference herein in i ts ' en- ire ty , including" 

any drawinas, zia.:r--s ,-.r- r-=>,-i^-N v---^ --- _ . . 

- — ^' -^'-ieo,/ . .■\j._s i.or perrorm.in.Q such 

methods m.ay be constructed to include a container mean- 
having disposed therein a nucleic acid orcbe . 
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In a -hird aspecc, the invention describes a 

recorr±;-inant cell or cissue comprising a nucleic acid 
mclecuie encoding a kinase polypeptide selecced from che 
group consisting of 3TLK2 , STLK3 , STLK4 , STLK5 , STLK6 , 
5 STLK7, ZCl, ZC2 , ZC3 , ZC4 , KHS2 , SULUl , SULU3 GEK2 , FAK4 , 

and PAK5 . In such cells, the nucleic acid may be under the 
control of the genomic regulatory elements, or may be under 
the control of exogenous regulatory elem.ents including an 
exogenous promoter. By "exogenous*' it is m.eant a oromiocer 
10 that IS not normally coupled in v^ivo transcriptionally co 

the coding sequence for the kinase polypeptides. 

The polypeptide is preferably a fragment of the protein 
encoded by the amiino acid sequence set forth in SEQ ID MO : 5 , 



SEQ I 



D NO: 6, SEQ ID NO : 7 , SEQ ID NO: 13, SEQ. ID NC 



15 NO: 15, SEQ ID NO:13, SEQ ID NO:22, SEQ ID NO : 2 3 , SEQ ID • 

-NO: 24, SEQ ID NO : 2 9 , SEQ ID NO : 3 1 , SEQ ID NO: 97, SEQ ID 
NO: 99, SEQ ID NO : 1 0 1 , SEQ ID NO: 103, SEQ ID NO: 105, or SEQ 
ID NO: 107, or the corresponding full-length amino acid 
sequence, not to include fragm^ents consisting only of amino 

20 acids 1-22 of SEQ ID N0:13 or amino acids 1-33 of .SEQ ID 

N0:1C7. By "fragment," is m^eant an am.ino acid sequence 
present in a kinase polypeptide. Preferably, such. a 
.sequence comprises at least 32, 45, 50, 60, 100, 2 00, or 300 
contiguous amino acids of SEQ ID NO : 5 , SEQ ID NO: 6,- or SEQ 

25 ID NO : 7 , or of the corx-esponding full-length amino .acid 

sequence; at least 250, 255, 275, 300, or 400 contiguous 
amino acids of SEQ ID NO: 13, SEQ ID NO: 14, SEQ . ID NO : 1 5 , OR 
SEQ ID NO: 10 5, or of the corresponding full-length am.ino 
acid sequence; at least 27, 30, 35,. 40, 50, 100, 200, or 300 

30 contiguous amino acids of SEQ ID NO: 13; at least 16, 25, 35, 

50,-100, 200, or 300 contiguous am.ino acids of SEQ ID NO:22, 
SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO : 2 9 , SEQ ID NO : 3 1 or 
SEQ ID NO: 103, or of the corresponding full-length amino 

/J 
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acid sequence; 6 (preferably 10, mere preferabl'-/ 15. mosz 

preferably 25) or more contiguous amino acids sec forch ii- 

the amino acid sequence of SEQ ID NO: 97 or SEQ ID MO: 59, 2 2 



referably 30, more preferably 35, most preferablv 



/ c: 



5 more conciguous amino acids set forch in the amino acid 

sequence of SEQ ID MO: 101; at lease 73, 85, 90, 100, 200, or 
300 contiguous am.ino acids of SEQ ID MO: 107, or che 
corresponding full-length amino acid - sequence ; or a 
funccional derivative chereof . 
10 In a fourth aspect, che invention features an isoiaced, 

enriched, or purified kinase polypeptide selected "from the 
group consisting of STLK2 , STLK3 , STLK4 , STLK5 , STLK5 , 
STLK7, ZCl, ZC2, 2C3 , ZC4 , KHS2 , SULUl , SULU3 , GEK2 , PAK4 , 
and PAK5 . 

1^ By "isoiaced" in reference to a polypeptide is meanc a 

polymer of amino acids (2 or more am.ino acids) conjugated to 
each other, including polypeptides that are isolated from a 
natural source or that are synthesized. The isoiaced 
polypeptides of the present invention are unique in che 
20 sense chac chey are not found in a pure or separaced stace 

in nature. Use of the cermi "isolated" indicaces chac a 
nacurally occurring sequence has been removed from, ics 
normial cellular environm.ent . Thus, the seouence may be in a 
cell-free solution or placed in a different cellular 
25 environment . The term does not imply chat the sequence is 

che only amino acid chain present, but that it is 
essentially free (about 90 - 95% pure at lease) of non-am.ino 
acid miacerial nacurally associated v/ich ic. 

Sy the use of the cerm. "enriched'' in reference co a 
poiypepcide is meant chac che specific amine acid seauence 
consticuces a significantly higher fraccion (2 - 5 fold) of 
the tocal amiino acid sequences present in the cells or 
solution of interest chan in normial or diseased cells or in 
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- * the ceils from which the sequence was taken. This could be 

caused by a person by preferential reduction in the amount 
of other amino acid sequences present, or by a pref erent: ial 
increase in che amount of che specific arr;inc acid sequence 
5 of interest, or by a combination of che two. However, ic 

should be ncred chat enriched does not imply chat there are 
no other amino acid sequences present, just that the 
relative amount of the sequence of interest has been 
significantly increased. The term significant here is used 

10 to indicate that the level of increase is useful to the 

person making such an increase, and generally means an 
increase relative to other amino acid sequences of about at 
least 2-fcld, m-ore preferably at least 5- to 10-fold or even 
more. The term, also does not imply that there is no amino 

15 acid sequence from^ other sources. The other source of amino 

acid sequences may, for example, comprise amino acid 
sequence encoded by a yeast or bacterial genome, cr a 
cloning vector such as pUClS. The term, is meant to cover 
only those situations m which m.an has intervened to 

20 increase the proportion of the desired amine acid sequence.' 

It is also advantageous for somte purposes that an am.ino 
acid sequence be in purified form. The term, "purified" - in 
reference to a polypeptide does not require absolute Duriry 
(such as a homogeneous preparation) ; instead, it represents 

-5 an indication that the sequence is relatively purer than in 

the natural environment. Compared to the natural level this 
level should be at least 2-5 fold greater (e.g., in termis of 
m.g/mL) . Purification of at least one order of magnitude, 
preferably tv;o or three orders, and m.cre preferably four or 

?0 five orders of magnitude is expressl.y contemtplated . The 

substance is preferably free of cone am^inat ion at a 
functionally significant level, for example 90%, 95%, or 99% 
pure - 
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m preferred embodimencs, the kinase polvceocide is 
fragment: of che prccein encoded by the amino acid sequence 
set forrh in SEQ ID NO : 5 , SEQ ID NO : 6 , SEQ ID NO : 7 , SEQ ID 
NO:13, SEQ ID N0:14, SEQ ID N0:15, SEQ ID NO : 1 3 , SEQ ID 
5 NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO : 2 9 , SEQ ID 

NO: 31. SEQ ID NO: 97, SEQ ID NO: 99, SEQ ID NO : 1 0 1 , SEQ ID 
NO: 103, SEQ ID NO: 105, or SEQ ID NO:lC7, or the 
corresponding full-lengch amino acid sequences, not to 
include fragments consisting only of amino acids 1-22 of SEQ 
10 ID NO: 13 or amino acids 1-33 of SEQ ID NO: 107. Preferablv, 

the :-cinase polypeptide contains at least 32, 45, SO, 60, 
100, 200, or 3 00 conciguous amino acids of SEQ ID NO : 5 , SEQ 
ID NO: 6, or SEQ ID NO : 7 , or che corresponding full-lengch 
amino acid sequence; at least 250, 255, 275, 300, or 400 
I:^ conciauous amino acids of SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID 

NO: 15, or SEQ ID NO: 105, or the corresponding full-length 
amino acid sequence; ac least 27, 30, 35, 40, 50, 100, 200, 
or 300 contiguous am.ino acids of SEQ ID NO: 18; at least 16, 
2d, j5, 50, 100, 200, or 300 contiguous amino acids of SEQ 
20 ID NO: 22, SEQ ID NO : 2 3 , SEQ ID NO: 24, SEQ ID NO : 2 9 , SEQ ID 

NO:31, or SEQ ID NO:103, or the corresponding full-length 
ammo acid sequence ; 6 (preferably 10 , m.ore oref erablv 15 
most preferably 25) or m^ore contiguous am.ino acids set forth 
m the amino acid sequence of SEQ ID NO: 97 or SEQ ID NO: 99, 
22 (preferably. 30 , m.ore preferably 35, most preferably 45) 
or more contiguous amino acids set forth in the amano acid 
sequence of SEQ ID NO: 101, or the corresponding full-length 
am.ino acid sequence; or at least 7S, 85, 90, 100, 200, or 
300 contiguous amino acids of SEQ ID NO:107, or a functional 
3n derivative thereof . 

In preferred emLbodim.ent s , the kinase polypeptide 
com.prises an amiino acid sequence having (ai the am.ino acid 
sequence set forth m SEQ ID NO : 5 , SEQ ID NO : 6 , SEQ ID NO : 7 , 

2S 
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SEQ ID NO: 15, SEQ ID NO: 14, SEQ ID NO : 1 5 , 5EQ ID NO : 1 S . SEQ 
ID NO: 22, SEQ ID NO : 2 3 , SEQ ID NO: 24, SEQ ID NO : 2 S , SEQ ID 
NO: 31, SEQ ID NO: 97, SEQ ID NO : 9 9 , SEQ ID NO: 101, SEQ ID 
NO: 103, SEQ ID NO: 105, or SEQ ID NO : 1 0 7 ,- (b) che araino acid 
5 sequence set forch in SEQ ID NO. :5, SEQ ID NO : 6 , SEQ ID 

NO: 7, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO : 1 5 , SEQ ID 
NO: IS, SEQ ID NO: 22, SEQ ID NO : 2 3 , SEQ ID NO : 2 4 , SEQ ID 
NO: 29, SEQ ID NO : 3 1 , SEQ ID NO: 97, SEQ ID NO: 99, SEQ ID 
NO: 103, SEQ ID NO: 105, or SEQ ID NO : 1 0 7 , excepc that ic 

10 lacks one or more, but noc all,, of the follov/ing segmen-s of 

amino acid residues: 1-21, 22-274, or 275-416 of SEQ ID 
NO:5, 1-31, 32-308, 309-489 or 490-516 of SEQ ID NO : 5 , 1-17S 
or 179-414 of SEQ ID NO : , 1-22, 23-2S9, 290-526, 527-640, 
641-895, or a9'"- 1239 of SEQ ID N0:13, 1-255, 256-442, 443- 

15 626, 627-954, or 955-1297 of SEQ ID NO : 14 , 1-255, 256-476, 

477-680, '681-9S3, or 984-1326 of SEQ ID N0:15, 1-13, 14-273, 
274-346, 347-534, or 535-894 of SEQ ID N0:18, 1-21, 22-277, 
278-427, 428-637, 533-751,- or 752-893 of SEQ ID N0:22, 1-65, 
• 67-215, 216-425, 426-539, 540-786, or 787-887 of SEQ ID 

20 NO:23, 1-25, 25-273, 274-422, 423-632, or 633-743 of SEQ ID 

NO:24, 1-51, 52-224, 225-393, 394-653, or, 659-681 of SEQ ID 
NO:29, 1-25, 26-281, 282-430, 431-640, 641-754, 755-901, or 
902-1001 of SEQ ID NO : 3 1 , 1-10, 11-321, or 322-373 of SEQ ID 
NO:97, 1-57, 58-369, or 370-418 of SEQ ID NO:99, 1-52, 53- 

25 173, 17.4-307, 308-572, or 573-591 of SEQ ID NO:103-, 1-24,''. 

25-289, 290-397/ 398-623, 629-663, 669-872, or. 373-1227 of 
SEQ ID NO:105, or 1-33, 34-294, 295-337, 333-472, 473-724, 
or 725-968 of SEQ ID NO : 1 0 7 ; (c) the amino a.cid sequence set 
forth xn SEQ ID NO : 5 , SEQ ID NO : 6 , SEQ ID NO : 7 , SEQ ID 

30 NO: 13, SEQ ID NO : 14 , SEQ ID NO : 1 5 , SEQ ID NO : 1 8 , SEQ ID 

NO: 22, SEQ ID NO: 23, SEQ ID NO : 2 4 , SEQ ID NO : 2 9 , SEQ ID 
NO:97, SEQ. ID NO:99, SEQ ID NO:103, SEQ ID NO : 1 0 5 , or SEQ ID 
NO:107 from ammo acid residues 1-21, 22-274, or 275-416 of 
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SEQ ID NO: 5, 1-31, 32-30S, 309-48S, or 4S0-516 of SEQ ID 
NO: 5, 1-173 or 179-414 of SEQ ID NO : 1 , 23-289, 290-525, 527- 
640, 641-896, or 397-1239 of SEQ ID NO:13, 1 -255, 25-5-442, 
443-526, 627-954, or 955-1297 of SSQ ID NO : 14 , " 1 - 255 , 256- 
5 476, 477-630, 631-983, or 984-1326 of SSQ ID KO:15, 1-13, 

14-273, 274-346, 347-534, or 535-394 of SEQ ID NO:lS, 1-21, 
22-277, 273-427, 423-637, 638-751, or 752-398 of SEQ ID 
NO:22, 1-66, 57-215, 216-425, 426-539, 540-736, or 7S7-SS7 
Cf SEQ ID HO:23, 1-25, 26-273, 274-422, 423-632, or 533-74f 
10 of SEQ ID NO:24, 1-51, 52-224, 225-393, 394-658, or 659-6S1 

of SEQ ID NO:29, 1-25, 26-273, 274-422, 423-632, 633-746, 
747-993, or 994-1093 of SEQ ID N0:31, 1-10, 11-321, or 322- 
373 cf SEQ ID NO:97, 1-57, 58-369, or 370-418 of SEQ ID 
NO:99, 1-52, 53-173, 174-307, 30S-572, or 573-591 of. SEQ ID 
15 NO:103, 1-24, 25-289, 290-397, 398-62S, 629-66S, 669-372, or 

373-1227 of SEQ ID N0:105, or 1-33, 34-294, 295-337, 338- 
472, 473-724, or 725-963 of SEQ ID NO: 107; or ;d) che amino 
acid sequence sec forth in SEQ ID NO : 5 , SEQ ID NO : 6 , SEQ ID 
• NO: 7, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID 
NO: IS, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SSQ ID 
NO:29, SEQ ID NO : 3 1 , SEQ ID NO:97, SEQ ID NO:99, SEQ ID 
NO: 101, SEQ ID NO: 103, SEQ ID NO: 105, or SEQ ID NO: 107, 
excep- that: ic lacks one or more, but not all, of the 
domarns selected from the group consisting of a C-terminal 
domain, a catalytic domain, an N-terminal dom.ain, a spacer 
region, a proline-rich region, a coiled-coil structure 
region, an insert, and a C-terminal tail. 

The polypeptide can be isolated from, a natural source 
by methods v/eil -known in the art. The natural source m.av be 
m^am.malian, preferably human, blood, semen, or tissue, and 
the pclypepcide m.ay be synthesized using an auuomiated 
polypeptide synthesizer. The isolated, enriched, or 
purified k:.nase polypeptide is preferably: a STLK2 , STL 
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STLK4 , STLK5, STLK5 , or STLK7 polypeptide; a ZCl , ZC2 , ZC3 , 

or ZC4 polypeptide; a KHSI' polypeptide; a SULUl or SULU3 
polypeptide; a GEK2 polypeptide; or a PAK4 or PAK5 
polypeptide . 

5 In some errLbcdimencs the invention includes a 

recombinant kinase polypeptide selected from the group 
consisting of STLK2 , STLK3 , STLK4 , STLK5 , STLK6 , 5TLK7 , 2Ci, 
ZC2, ZC3, ZC4, '?:HS2, SULUl, SULU3 , GEK2 , PAK4 , and PAK5 . B\ 
'■recombinant kinase polypeptide" is meant ci polypeptide 

10 produced by recombinant DNA techniques such that it is 

distinct from a naturally -occurring polypeptide either in 
its location \G,g., present in a different cell or tissue 
than found in nature), purity or structure. Generally, such 
a recomx>inant ' polypeptide will be present in a cell in an 

15 amount different from, that norm.ally observed in nature. 

In a fifth aspect, the invention features an antibody 
(e.g., a monoclonal or polyclonal antibody) having specific 
binding affinity to a kinase polypeptide or a kinase 
polypeptide dom.ain or fragm^ent where the polypeptide is 

20 selected from, the group consisting of STLK2 , STLK3 , STLK4 

STLK5, STL'KS, STLK7 , ZCl , ZC2 , ZC3 , ZC4 , KHS2 , SULUl, SULU3 , 
GEP:2 , PAK4 , and PAK5 , By "specific binding affinity" is 
m.eant that the antibody binds to the target kinase 
polypeptide v/ith greater affinity than it binds to other 

25 polypeptides under specified conditions. Antibodies or 

antibody fragments are polypeptides that contain regions 
that can bind other polypeptides. The term^ "specific 
binding affinity" describes an antibody that binds to a 
k i na s e co 1 vpe o tide v/i t h g re a t e r affinity than it b i nds t o 

30 other polypeptides under specified conditions. 

The term, "oolyclonal" refers to antibodies that are 
heterogenous populations of antibody molecules derived from, 
the sera of animals immuni"ed with an antigen or an 
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antigenic fun'ccional derivative thereof. For the prccucticn 
of polyclonal ancibodies, various host aniraals -tay be 
immunized by iniection with the antigen. Various adjuvant? 
may be used to increase the immunological response, 
depending on the host species. 

".Monoclonal antibodies" are substantially homogenous 
populations of antibodies to a particular antigen. They m.ay 
be obtained by any technique v/hich provides for the 
production of antibody molecules by continuous cell lines in 
culture. Monoclonal antibodies may be obtained by methods 
known to chose skilled in -the art (Kohler et al . , Nacure 
256:435-497, 1975, and U.S. Patent No. 4,376,110, both of 
which are hereby incorporated by reference herein in their 
entirety including any figures, tables, or drawings). 

The cerm "ancibody fragment" refers to a portion of an 
antibody, often the hyper variable region and porcions of 
rhe surrounding heavy and light chains, that displavs 
specific binding affinity for a particular molecule. A 
hyper variable region is a portion of an antibody chat 
physically binds to the polypeptide target. 

-Zi.ntibodies or antibody fragments having specific 
binding affinity to a kinase polypeptide of -he invention 
may be used in methods for detecting the presence ana/or 
amount of kinase polypeptide in a sample by probing the 
sample wirh che antibody under conditions suitable for 
.^cmase-antibody im.munocomplex form.ation and detecting the 
presence and/or amount of che antibody conjugated to the 
kinase polypeptide. Diagnos-ic kits for performing such 
m.ethcds may be constructed to include antibodies or-antibodv 
fragments specific for ths kinase as well as a ccnjugace of 
a binding parcner cf the antibodies or che antibodies 
themselves . 
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An anribody or antibody fragment: wich specific bindinq 
affmicy zo a kinase polypeptide of che invencion can be 
isolated, enriched, or purified from a prokaryotic or 
eukar\'otic organisn\. Routine methods kncv;n cc those skilled 
in the art enable production of antibodies or antibody 
fragments, in both prokaryotic and eukaryotic organisms. 
Purification, enrichment, and isolation of antibodies, v;hich 
are polypeptide miolecules, are described above. 

Antibodies having specific bindino affinitv to a kinase 



detecting the presence and/or amount of kinase polypeptide 
in a samp'le by concacting the sample with the antibody under 
conditions such that an im.mrano complex forms and detecting 
the presence and/or amount of the antibody conjugated to the 

15 kinase polypeptide. Ciagnostic kits for performing such 

m.ethods may be constructed to include a first container 
containing the antibody and a second container ha\-'ing a 
conjugate of a binding parcner of the antibody and a label, 
such as, for example, a radioisotope. The diagnostic kit 

20 may also include notification of an FDA approved use and 

instruct ions therefor . 

In a sixth aspect, the invention feci cures a hybridoma 
which produces an antibody having specific binding affinity 
to a kinase polypeptide or a kinase polypeptide domain, 

25 where the polypeptide is selected fromt the group consisting 

of STLK2, STLK3, STLK4, STLK5 , STLK6 , STLK7 , ZCl , ZC2 , ZC3 , 
ZC4 , KHS2/ SULUl, SULU3 , GEK2 , PAK4 , and ' PAK5 . By 
"hybridom.a" is meant an im.mor t al i zed cell line that is 
capable of secreting an antibody, for exam.ple an antibody to 

3^1 a kinase of the invention. In preferred emibod im.ent s , the 

antibody to the kinase comprises a sequence of ammo acids 
that is able .to specifically bind a kinase polypeptide of 
the invention. 
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In a sevenrh aspecc, che invention features a kinase 

pclypepride omding agent able co bind zo a kinase 
polypepride selected from the group consisting of STLK2 , 
STLK3, STLK4, STLK5 STLK7 , STLK5 , ZCl , ZC2 , ZC3 , ZC4 , KHS2 , 
SULUl, SULU3, GEK2, PAPC4 , and PAK5 . The binding agent is 
preferably a purified ancibody chat recognizes an epitope 
presenc on a kinase polypeptide of che invention. Other 
binding agencs include molecules chat bind to kinase 
polypepcides and analogous ir^olecules chac bind to a kinase 
polypeptide. Such binding agencs may be identified by using 
assays chac measure kinase binding partner activity, such as 
those chac measure PDGFR activitv. 

The invention also features a mechocl for screening for 
human cells concaining a kinase polypeptide of the invention 
or an equivalent sequence. The mechod involves identifving 
che novel pol^/peptide in human cells using techniques that 
are routine and scandard m the art, such as chose described 
herein for identifying che kinases of che invention (e.g., 
cloning, Souchern or Morthern blot analysis, in situ 
hybridizacion, PGR amplif icacion, etc.). 

In an eighth aspect, che invention feacures mechods for 
identifying a substance thac modulates kinase activitv 
comprising che steps of: (a) contacting a kinase polypepcide 
selecced from the group consisting of STL:K2 , STLK3-, STLK4 , 
STLK5, STLK6, STLK"? , ZCl, ZG2 , ZC3 , ZG4 , KHS2 , SULUl , SULU3 . 
GEK2, PAK4, and PAK5 v;ith a cest substance; (b) measuring 
the activity of said polypepcide; and (c) determ.inmg 
v;hecher said substance modulates che activicy of said 
polypepc ide . 

The term "m.odulates" refers to the abilicy of a 
compound to alter che function of a kinase of che invencion. 
A modulator preferably activaces or inhibits the activitv 
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of a kinase of che invention depending on the concent: r^.i: i or. 

of the compound exposed zo the kinase. 

The cerm ''activaces" refers to increasing the cellular 
activicy of the kinase. The terrri inhibit refers to 
5 decreasing the cellular activity of che kinase. Kinase 

activicy is preferably the interaction with a natural 
binding partner . 

The term "modulates" also refers to altering the 
function of kinases of the invention by increasing or 

10 decreasing the probability that a complex forms between the 

kinase and a natural binding partner. A modulator 
preferably increases the probability that such a com^plex 
forms between the kinase and the natural binding partner, 
more preferably increases or decreases the . probabili t\' that 

15 a complex forms becv/een the kinase and the natural binding 

partner depending on the concentration of the compound 
exposed to the kinase, and most preferably decreases the- 
probability that a com.plex forms between the kinase and the 
natural binding partner. 

20 The term, "comolex" refers to an assembly of at -least' 

t'wo moilecules bound to one a.nother. Signal transduction 
complexes often contain at least tv;o protein molecules bound 
, to one another. For instance, a protein tyrosine recep.tor 
protein kinase, GRB2 , SOS, RAF, and RAS assemible to- form, a 

25 signal transduction complex in response to a mitogenic 

- 1 igand . ■ " 

The term ''natural binding partner" refers to 
ool voept ides , lioids, smiall mO'lecules, or nucleic acids that 
bind to kinases in cells. A change in the interaction 

30 between a kinase and a natural binding partner can m^anifest 

Itself as an increased or decreased probability that che 
interaction form.s, or an increased or decreased 
concentration of kinase /natural binding partner complex. 
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The term "concacting" as used herein refers co mixir.g = 
solucion comprising the cesc compound with a liquid medium, 
ba-hing the ceils of the me c hods . The sclution comprising 
the ccm.pound mtay also comprise another com.ponenr , such as 
dim.echyl sulfoxide (DMSO) , which facilitates che upcake of 
the test compound or compounds into the cells of the 
m.ecnods. The solucion comprising the test compound mav be 
added co che m.edium bathing che cells by utili-ing a 
delivery apparacus, such as a pipet-based device or syringe- 
based device . 

In a ninth aspecc, che invention features mechods for 
identifying a substance chat modulaces kinase accivity in a 
cell comprising the steps of: (a) expressing a kinase 
polypeptide m a cell, v/herem said polypeptide is selecced 
from che group consiscing of 3TL?:2 , STLK3 , STLK4 , STLK5 , 
STLIvS, STLK7, ZCi , ZC2 , ZC3 , ZC4 , I-CHS2 , SULUl , SrJLU3 , GEK2 , 
?AK4, and PAK5 ; (b.^ adding a tesc substance to said cell; 
and (c) monicorinq a change in cell phenotype or che 
interaction between said polypeptide and a natural binding 
partner . 

Tne term, "expressing" as used herein refers co the 
prcduction of kinases of che invention from a nucleic acid 
veccor containing kinase genes within a cell. The nucleic 
acid vector is transfecced inco cells using well known 
techniques in che art as described herein. 

In a tenth aspecc, che invencion provides methods for 
creacing a disease by administering to a patient in need of 
such creatmient a subscance thac modulates the accivitv of a 
Kinase selected from che group consisting of STLK2 , 3TLP:3 . 
STLK4, 3TLK5, STLK6 , STLK^ , ZCI, ZC2 , ZC3 . ZC4 . KHS2 , SULUl, 
SULU3, GEK2, PAK-l , and PAKE . Preferably, che disease is 
seiecced from the group consisting of im.miune - relac ed 
diseases and disorders, organ cranspl ancac ion , myocardial 
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inf arct: ion . cardiovascular disease, stroke, renal failur-Z;. 

oxidative scress - related neurodegenerative disorders, and 
cancer. Mosc preferably, the immune - relaced diseases and 
disorders include, but are not limited to, rheumatoid 
5 arthritis, archerosclerosis , and autoim.mune disorders. 

In preferred embodiments, the invention provides 
mtethods for creating or preventing a disease or disorder by 
administering to a patient in need of such treatment a 
substance chat modulates the activicy of a kinase 
10 polypepcide selected from che group consisting of ZCl, ZC2 *, 

ZC3 , ZC4 , KKS2, PA?:4 , and PAK5 , Preferably, the disease or 
disorder is selected fromi the group consisting of rheumtacoid 
arthritis, archerosclerosis , autoim;m.une disoi'ders, and organ 
transplantation. The invention also features methoos of 
15 treating or preventing a disea.se or disorder by 

administering to a patient in need of such treatment a 
substance that m.cdulaces the activity of a kinase 
Dolvoeptide selected from the group consisting of 3TLK1, 
STLK2 , STLK3 , STLK4 , STLK5 , STLK6 ^ and STLK7 . Preferably 
20 the disease or disorder is selected fromt the group 

consisting of imtm.une-related diseases and disorders, 
myocardial infarction, cardiomyopathies, stroke, renal 
failure, and oxidative stress -related neurodegenerative., 
disorders. Most preferably, the immune - related diseases and 
25 disorders are selected from, the group consisting of 

rheumiatoid arthritis, chronic inflammatory bowel disease, 
chronic inflammatory pelvic disease, m.ultiple sclerosis, 
asthm.a , osteoarthritis , psoriasis , atherosclerosis , 
rhinitis, auto im^muni ty , and organ transplantation. 
30 The invention also features m.echods of creating or 

preventing a disease or disorder by admiinis t er mg to a 
patient in need of such treatment a subscance that m^cdulates 
the activicv of a kinase polypeptide selected from, che group 
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-cnsisring of ZCl, ZC2, ZC2 , and ZC4 . Preferablv che 
disease is seleccsd frorr. the group consisring of ir.,rr,une- 
related diseases and disorders, cardiovascular disease, and 
cancer. Mosc preferably, the immune -re la ted diseases and 
disorders are selected from the group consisting of 
rheumacoid arthritis, chronic inflammatory bov.-el disease, 
chronic inflammatory oelvic disease, m.ultiple sclerosis, 
asthma, osteoarthritis, psoriasis, atherosclerosis, 
rhinitis, autoim.munity, and organ transplantation. 

Substances useful for treatment of kinase-reiaced 
disorders or diseases preferably show pcsicive results in 
one or -nore in W assays for an activity correspond ino to 
treatmenc of the disease or disorder m quesc^on (Examples 
of such assays are provided in the references in section VI, 
belov..-; and m Example 7, herein). Examples of substances 
that can be screened for favorable activity are provided and 
referenced in section VI , below. The substances that 
modulare the activity of the kinases preferably include, but 
are not limited zc. antisense oligonucleccides and 
inhibitors of procein kinases, as determined by mechods and 
screens referenced in section VI and Example 7, below. 

The term "preventing" refers to decreasing the 
probability that an organism contracts or develops an 
abnormal condicion. 

The term "treanng" refers- to having a cherapeutic 
effect and at least partially alleviating or abrogating an 
abnorm.al condition in t.he organism. 

The term "therapeucic effect" refers zo the inhibition 
or actix-ation factors causing or contributing to the 
ab.normal condition. A therapeutic effect relieves to some 
extent one or more of the symptoms of the abnormal 
condition. In reference to the treatment of abnormal 
conditions, a therapeutic effect can refer to one or mere of 
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the following: (a) an increase in the proliferation, grov/th, 

and/or differentiation of oells; (b) inhibition (i.e., 
slowing or scooping) of cell death; (c) inhibition of 
degeneration; (d) relieving to some extent one or more of 
5 the symptoms associated with the abnormal condition; and (e) 

enhancing the function of the affected population of cells. 

Compounds dem.cns t ra t ing efficacy against abnormal . 
conditions can be identified as described herein. 

The term "abncrm.al condition" refers to a function m 
10 the cells or tissues of an organismi that deviates from, their 

normal functions in that organism. An abncrmial condition 
can relate to cell prolif eration, cell dif f erent iation, or 
cell survival . 

Abnvorm.al cell proliferative conditions include cancers 
15 such as fibrotic and miesangial disorders ^ abnormtal 

angiogenesis and vasculogenesis , wound healing, psoriasis, 
diabetes mellitus, and inf lamtmat ion . 

Abnormal differentiation conditions include, but are 
not lim.ited to neurodegenerative disorders slov; v/ound 
20 healing rates, and slow tissue grafting healing rates. 

. A.bnormal cell survival conditions relate to conditions 
in which program.med cell death (apoptosis) pathways are 
activated or abrogated. . A numiber of protein kinases are 
associated with the apoptosis pathways. Aberrations in the 
25 function of any one of the protein kinases could lead to 

cell im.mortality or premiature cell death. 

The term ''aberration", in conjunction v;ith the function 
of a kinase in a signal transduction process, refers to a 
kinase that is over- or under-expressed in an organism, 
30 mutated such that its catalytic activity is lower or higher 

than v/ild-type protein kinase activity, mutated such that it 
can no longer interact with a natural binding partner, is no 
longer modified by another protein kinase or protein 
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phosphacase, or no longer mteraczs wizh a natural binf-mc: 
parcner. 

The zerm "administering" relates to a metihcd of 
incorporating a compound into cells or tissues of an 
organism. The abnormal condition can be prevented or 
treated when the cells or tissues of the organisn^. exisc 
within the organism or outside of the organism. Cells 
exiscing outside the organism can be maintained or grov/n in 
cell culture dishes. For cells .harbored within zha 
organism, many techniques exist in che art zq admmis-er 
compounds, including (but not limited to) oral, parenteral, 
cermal, injection, and aerosol applications. For cells 
outside of rhe organism, m.ultipls techniques exist in the 
art to adminiscer the compounds, including {but not limited 
to) cell microiniection techniques, transf ormacion 
techniques, and carrier techniques. 

The abnormal condition can also be prevented or treated 
by adm.inistermg a compound to a group of cells having an 
aberration in a signal transduction pathway to an organism. 

The effect of admdnistering a com.pound on organism function 
can then be m.onicored. The organism is preferably a mouse, 
rat, rabbit, guinea pig, or goat, more preferably a monkey 
or ape, and most preferably a human. 

In an eleventh aspect, the invention features methods 
for detection of a kinase polypeptide m a sam.ple as a 
diagnostic tool for diseases or disorders, wherein the 
method comprises the steps of: (a) contacting che sam.ple 
with a nucleic acid probe which hybridizes under 
hybridization assay conditions to a nucleic acid target 
region of a kinase poiypepcide selected from the group 
consisting of STLK2 , STLK3 , 3TLK4 , STLK5 , ST-LK6 , STLK7 , ZCl , 
ZC2, ZC3, ZC4, .KHS2, SULUl , 3ULU3 , GEK2 , PAK4 , and PAK5 , 
said probe comprising the nucleic acid sequence encoding the 

4d 
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polypeptide, fragments thereof, and the ccmplenrient s of the 

sequences and fragments; and lb) detecting the presence or 
amounc of the probe: target region hybrid as an indication of 
the disease. 

5 In preferred embodiments of che invention, zhe disease 

or disorder is selected from the group consisting of 
rheumatoid arthritis , artherosclerosis , autoimmune 
disorders, organ transplantation, mtyocardial infarction, 
cardiomyopathies, stroke, renal failure, oxidati\''e stress- 

10 related neurodegenerative disorders, and cancer. In other' 

preferred embodiments, che kinase polypeptide is selected 
from the group consisting of PAK4 and PAKS , or the ■ 
polypepcide is selected fromt the group consisting of ZCl, 
ZC2 , ZC3 , and ZC4 , and the disease is cancer. 

15 The kinase ''target region" is the nucleotide base 

sequence set forth in SSQ ID NO : 1 , SEQ ID NO : 2 , SEQ ID NO : 3 , 
SEQ ID NO: 9, SEQ ID NO: 10, SSQ ID NO: 11, SEQ ID NO : 1 ^ , SEQ 
ID NO:19, SEQ ID NO : 2 0 , SEQ ID NO : 2 1 , SEQ ID NO : 2 7 , SEQ ID 
NO: 96, SEQ. ID NO: 98, SEQ ID NO: 100, SSQ ID NO: 102, SEQ ID 

20 N0:104, or SSQ ID NOrlGc, or the corresponding, full - length ' 

sequences,- a functional derivative "hereof, or a fragment 
thereof to v/hich the nucleic acid probe will specifically 
hybridize. Specific hybridization indicates chat in che ■ 
presence of other nucleic acids the. probe only hybridizes 

25 detectably with the kinase of the invention's target region. 

Putative target • regions can be idencified by m.ethods well 
knov/n in the art ccnsiscing of alignment and com.parison of 
the vr-.Gsz -closely related sequences in the dcicabase. 

In preferred embodimiencs the nucleic acid probe 

30 hybridizes "c a kinase carget region encoding at least 6, 

12, 75, 90, 105, 120, 150, 200, 250, 300 or 350 contiguous 
amino acids of che sequence sez forth in SEQ ID NO : r , SEQ ID 
NO: 6, SEQ ID NO : 7 , SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, 

HI 
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3EQ xD MO: 13, £EQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ 
ID NO: 29, SEQ ID NO : 3 1 , SEQ ID NO: 97, SEQ ID NO: 95, SEQ ID 
NO:101, SEQ ID NO:103, SEQ ID NO:lC5, or SEQ ID NO:107, or 
the corresponding full-length amino acid sequence, cr a 
5 functional derivative thereof. Hybridization conditions 

should be such thac hybridizacion occurs only v/ich the 
kinase genes in the presence of other nucleic acid 
rrolecules. under stringent hybridization conditions only 
highly corripl errtent ary nucleic acid sequences hvbridize. 
10 Preferably, such conditions prevent hybridization of nuclei- 

acids having more than 1 or 2 mismatches ouc of 2 0 
contiguous nucleotides. Such conditions are defined supra. 

The diseases for which decection of kinase genes in a 
sample could be diagnostic include diseases in which kinase 
15 nucleic acid (DNA and/or RNA) is amplified m comparison co 

normal cells. By "amplification" is meant increased numbers 
of kinase DNA or RNA in a cell compared with norm.al cells. 
In normal cells, kinases are typically found as single coov 
genes. In selected diseases, che chromosomtal locacion of 
the. kinase genes may be amplified, resulting m multiple 
copies of the gene, or am.plif icauicn . Gene amplification 
can lead to amplification of kinase RNA, or kinase RNA can 
be amplified m the absence of kinase DNA ampl i f icacion . 

" Am.plif ication" as ic refers to RNA can be the 
detectable ■ presence of kinase RNA m cells, since m some 
normal cells chere is no basal expression of kinase RNA. in 
ether normal cells, a basal level of expression of kinase 
exists, therefore in these cases amplification is the 
detecrion of at leas- 1-2-fold, and preferably more, kinase 
RNA, compared to che basal level. 

The diseases char could be -diagnosed bydecection of 
Kinase nucleic acid in a sample preferably include cancers. 
The test samples suitable for nucleic acid probing methods 
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of che present invention include, for example, cells or 

nucleic acid extracts of cells^ or biological fluids. The 
samples used in the above -described methods v;ill vary based 
on the assay format, the detection method and the nature of 
the tissues, cells or extracts to be assayed. Methods for 
preparing nucleic acid extracts of cells are v/ell knov/n in 
the art and can be readily adapted in order to obtain a 
sample that is compatible with the method utilized. 

In a final aspect, the invention features a method for 
detection- of a kinase polypeptide in a sample as a 
diagnostic tool for a disease or disorder, wherein the 
method comprises: (a) comiOaring a nucleic acid target" 
region encoding the kinase polypeptide in a sam.ple, where 
the kinase polypeptide is selected from the group consistinc 
of STLK2 , STLK3 , STLK4 , STLK5 , STLK6 , STLK7 , ZCl, ZC2 , ZC3, 
ZC4, KHS2, SULUl, SULU3 , GEK2 , ?AK4 , and PAK5 , or one or 
m.ore fragmer.ts thereof, with a control nucleic acid target 
region encoding the kinase polypeptide, or one or more 
fragments therec^f; and (b) detecting differences in sequence 
or amount between the target region and the control target 
region, as an indication of the disease or disorder. 
Preferably, the disease or disorder is selected from: the 
group consisting of imimiune -related diseases and disorders, 
organ transplant at ion myocardial infarct ion , cardiovascular 
disease, stroke, renal failure, oxidative stress -related 
neurodegenerative disorders, and cancer. Imimune - related 
diseases and disorders include, but are not limited to, 
those discussed previously. 

The term^ "comparing" as used herein refers to 
identifying discrepancies between the nucleic acid target 
region isolated from a sam.ple, and the control nucleic acid 
target region. The discrepancies can be in the nucleotide 
sequences, e.g. insertions, deletions, or point mutations, 
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or in -he amcunc cf a given nucleocide sequence. Methcds; 
decerrTiine these discrepancies in sequences are v;eil-knov;n cc 
cne of ordinary skill in che art. The "contrcl=' nucleic 
acid target region refers co the sequence or arrount of che 
:^ sequence found in normal cells, e.g. cells chac are noc 

diseased as discussed previously. 

The invention has been described broadly and 
generically herein. Each of the narrov/er species and 
subgeneric groupings falling within che generic disclosure 
10 also form part of the invention. This includes che generic 

description of che mvencion v/ith a proviso or negacive 
limicacicn rem.oving any subjecc matcer from, che genus, 
regardless of v;hether or not the excised material is 
specifically reciced herein. For example, in some mscances 
the nucleotide sequence of the ZC4 kinase polypepcide mav 
noc be part of a preferred embodiment. 

The summ.ary of the invention described abc-ve is not 
limicixng and other features and advantages of che invencion 
v/ill be apparent from the following detailed description of 
che invention, and from the claims. 
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r^RiEF DESCRIPTION OF THE FIGURE? 

Figure 1 shows a multiple sequence alignm.enc of the 
am.ino acid sequences of che STE20-STE20 fam.ilv kinases. 

Figure 2 shows a mrultiple sequence alignmenc of the 
ammo acid sequences of the STE20-STLK5 family kinases. 

Figures 3A and 33 show a m.ulciple .sequence alignmient of 
che ammo acid sequences of 3TE20-ZC family kinases. 

Figure 4 shows a pairwise sequence alignmenc cf STE20- 
KHS family kinases . 

Figure 5 shows a multiple sequence alignm.enc of the 
ammo acid sequences of STE20-SULU family kinases. 

MM 
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Figure 6 shows a pairwise sequence aiignrr.er.c of STE2 0- 
GEK family kinases 

Figure 7 shows a multiple sequence alignment of zhe 
amino acid sequences of STE20-PAK family kinases. 
5 Figures 3A, 8B, 8C, 8D, and 3E show the am.ino acid 

sequences of human 3TLK2 , human STLK3 , human STLK4 , human 
STLK5 , human ZCl, human ZC2 , human ZC3 , humian ZC4 , human 
KHS2, hum.an SULUl , human 3ULU3 , murine SULU3 , human GEK2 , 
human FAK4 , and human PAK5 . 
10 Figures 9A, 93, 9C, 9D, 9E, 9F, 9G. 9H. 91, 9J, 9?:, 9L, 

9M, 9M, 90, and 9P show the nucleic acid sequences of human 
STLK2 , humian 5TLK3 , human StLK4 , humtan STLPC5 , human ZCl, 
human ZC2 , human ZC3 , human ZC4 , human KHS2 , human SULUi ^ 
hum.an 5ULU3 , miurine SULU3 , human GEK2 , humian PAK4 , and human 

1 5 ?ap:5 . 

Figures lOA and lOB show the full-length am.ino acid 
sequences of human STLK5 {SEQ ID NO: 97), human PAK5 (SEQ ID 
N0:103), and hum:an ZC4 (SEQ ID NO:105), as well as the 
partial amiino acid sequences of human ful 1 - length • STLK6 (SEQ 
20 ID NO: 99) and human STLK7 (SEQ ID NO: 101? . 

Figures llA, IIB, IIC, and IID show^ the full-length 
nucleic acid sequences of hum.an STLK5 (SEQ ID NO: 96), hum.an 
?AF;5 (SEQ ID NO:102), and human ZC4 (SEQ ID N0:104), as well 
as the parcial nucleic acid sequences of human 5TLK6 (SEQ. ID 
25 NO: 9S; and human 3TLK7 (SEQ ID NO: 100? . 

Figure 12 shows" a multiple sequence alignmiehc amiong 
hum.an SPAK, hum.an STLKo , human STLK7 and full-length human 
STLK5 . 

Figure 13 show's a miultiple sequence alignment am.ong 
30 human PAKl , human PAPC4 and human PAK5 . 

Figures 14A and 14B show a pair-wise sequence alignmienc 
between humian ZCl and hum.an ZC4 . 
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. Figure 15 shows a pair- wise sequence aiiar.menc bezv/eer: 
LOKl and ' f ui 1 - length GEK2 . 

DETA ILED DESCRIPTION OF ^Nk TNVBMTTOM 

The present invention relates in part to kinase 
polypeptides, nucleic acids encoding such polypeptides, 
cells containing such nucleic acids, antibodies to such 
polypeptides, assays utilizing such .polypepc ides , and 
methods relating to all of the foregoing. The present 
invention is based upon the isolation and characterisation 
of new kinase polypeptides. The polypeptides and nucleic 
acids may be produced using well-knovm and standard 
synthesis -.echniques when given the sequences presented 
herein. 

The recent elucidation of the DNA sequence of 
Saccharomyces cerevesiae has provided the first complete 
example of the genetic information contained in a sim.ple 
eukaryotic organism. Analysis of this yeast genome revealed 
that it contains at least 113 protein kinases. These 
kinases were further subdivided into several structurally 
related groups. One of these newly defined groups v;as 
term.ed the STE20- family co represent its founding memh.er 
STE2 0, which is a protein kinase involved in the yeast 
pheromone response pathway that iniciates a protein kinase 
2? cascade in response to a G-prctein mediated signal. S. 

cereveslae has two additional memJoers of this family, CLA4 , 
and YOL113'/7 (HR.2i655; . 

Several mamm.alian homoiogues have recenclv been 
identified that belong to the STE20 - f ami ly , including SOK-1 
30 (human STE20), GC- kinase, KHS , HPKl , NIK, SLK, GEK, PAKl , 

PAK55, MSTl, and CDC7 . Furthermore, the Droscchlla and the 
C. elegans genome efforts have identified additional protein 
kinases which belong to the STE2C - f ami ly , yet have 



20 
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scruccurally unique excracataiynic domains, including 
ZC504.4 and SULu kinases from C. elecrans, and NINAC of 
Drosophila . 

3TE20 - reiaced pronein kinases have been implicated as 
5 regulating a variety of cellular responses, including 

response to growch factors or cytokines, oxidative-, LW- , or 
irradiacion-related scress pathways, inf lam.macory signals 
(i.e., TNFa) , apopcotic stimuli (i.e.. Fas), T and 3 cell 
coscim^ulacion, che concrol of cytoskeletal architecture, and 

10 cellular transformation. Typically, the STE2 0 - rel ated 

kinases serve as upscream regulators of iyiA^PK cascades. 
Examples include: HPP:i, a procein - serine/ threonine kinase 
(STK) that possesses a STE20-like kinase domain that: 
accivates a protein kinase pathv/ay leading to the scress - 

15 accivaced protein kinase SAPK/JNK; PAKl , an STK with an 

upstream CDC4 2 - binding dom.ain thac interacts with Rac and 
plays a role' in cellular transformation through nhe Ras-MAFK 
pathway; and miurine NIK, which interaccs wich upstream, 
recepcor tyrosine kinases and connects with downstream 

20 STEll- family kinases. 

The 3TE20-kinases possess a variety of non- catalytic 
domiains chac are believed to interact with upstream, 
regulators. Examples include proline-rich dcm.ains for 
interaction with 3H3 - cent aining proteins, or specific 

25 domiains for interaction with Rac, Rho , and Rab smiall-G- 

proceins. These interactions may provide a mechanismi for 
cross -talk betv;een distinct biochem.ical pathv/ays in response 
CO external stimuli such as the activation of a variety of 
cell surface receptors, including tyrosine kinases, cytokine 

3U receptors, TNF receptor, Fas, T cell receptors, CD2 8, or 

CD4C . 
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The Nucleic Acids of the Inv^n^-io^ 

Included v/ithm the scope of this inventicn are the 
functional equivalents of the herein-described isolated 
nucleic acid molecules. The degeneracy of the genetic code 
permits substitution of certain codons by other ccdons that 
specify the same amino acid and hence would give rise to the 
samie protein. The nucleic acid sequence can varv 
substantially since, with the exception of methionine and 
tryptophan, the known amino acids can be coded for bv m.ore 
than one ccdcn . Thus, portions or all of the kmase genes 
of the invention could be synthesized to give a nucleic acid 
sequence significantly different from that shown m 5EQ ID 
NO:l, SEQ ID NO: 2, SEQ ID NO : 3 , SEQ ID NO : 9 , SHQ ID NO: 10, 
5EQ ID NO: 11, SEQ ID NO : 1 7 , SEQ ID NO : 1 5 , SEQ ID NO : 2 0 , SEQ 
ID NO: 21, SEQ ID NO : 2 7 , SEQ ID NO: 96, SEQ ID NO: 93, SEQ ID 
NO:100, SEQ ID NO:i02, SEQ ID NO:104, and SEQ ID NO:106. 
The encoded amino acid sequence thereof would, however, be 
preserved . 

In addition, the nucleic acid sequence may comprise a 
nucleotide sequence which results from the addition, 
deletion or substitution of at least one nucleotide to the 
?'-end and/or tne 3 ■ -end of the nucleic acid form.ula shown 
-n SEQ ID N0:1, SSQ ID NO : 2 , SEQ ID NO : 3 , SEQ ID MO:?, SSQ 
ID NO: 10, SEQ ID NO : 1 1 , SEQ ID NO:17, SEQ ID NO : 1 9 , SEQ ID 
NO: 20, SEQ ID NO : 2 1 , SEQ ID NO: 27, SEQ ID NO: 96, SEQ , ID 
NO:9S, SEQ ID NO: 100, SEQ ID NO: 102, SEQ ID NO: 104, or SEQ 
ID NO: 106, or a derivative thereof. Any nucleotide or 
polynucleotide may be used in this regard, provided that its 
addition, deletion or substitution does not alter the amino 
acid sequence of SEQ ID NO : 5 , SEQ ID NO : 5 ., SEQ ID NO : 7 , SEQ 
:C NO: 13, SEQ ID NO: 14, SEQ ID NO: 15. SEQ ID NO : 1 5 , SEQ ID 
NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO : 2 9 , SEQ ID 
NO: 97, SSQ ID NO: 59, SEQ ID 1X^0:101, SEQ ID NO: 103, SEQ ID 
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NO: 105, or SEQ ID NO: 107, which is encoded by che nucleoside 
sequence. For exarriple, che presenc invention is incenced lo 
include any nucleic acid sequence resulting from che 
addition of ATG as an initiation codon at the 5 '-end of the 
5 inventive nucleic acid sequence or ics derivacive, or from 

the addition of TTA, TAG- or TGA. as a termination codon at 
the 3 ' -end of the inventive nucleotide sequence or its 
derivative. Moreover, the nucleic acid molecule of the 
present inventiori may, as necessary, have restriction 
10 endonuclease recognition sites added to its 5 '-end and/or 

3 ' -end . 

Such functional avlterations of a given nucleic acid 
sequence afford an opportunity no promote secretion and/or 
processing of heterologous proteins encoded by foreign 

15 nucleic acid sequences fused thereco. All variations of the 

nucleotide sequence of the kinase genes of che invention and 
fragments thereof permitted by the genetic code are, 
therefore, included in this invencion. 

Further, it is possible to delete codons or to 

20 substitute one or more codons with codons other than 

degenerate codons to produce a structurally modified 
polypeptide, but one v/hich has subscantialiy che same 
utility or activicy as the polypeptide produced by the 
unmiodified nucleic acid molecule. A.s recognized in the arc, 

25 the two polypeptides are functionally equivalent, as are the 

cwo nucleic acid m.olecules thac give rise to their 
production, even though the differences between the nucleic 
acid molecules are not related co che degeneracy of the 
genetic code. 



30 



Mammalian STLK2 

The full-lengch hum:an STLK2 cDNA (SSQ ID NO : 1 ) is 326S 
bp long and consists of a 1248 bp open reading fram^e (OR.F) 
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ilanked t-:.- a iSl bp 5' unnranslatsd region ;'UTR; i-13li ard 

a 1734 bp 3' TjTR (1432-3216) that is followed by a 52 
nucleotide poiyadenylated region. A polyadenvlat ion signal 
f.AATAAA) is found at positions (3193-3198). The seauerc^ 
flanking the first ATG conforms to the Kozak consensus 
(Kozak, M. , Nucleic Acids Res. 15, S125-S143 (19S7; } for a!-i 
mitiacing meohionine, and is believed to be the 
translarional start site for STLK2 . Furthermore, human 
STLK2 and che related SOK-1 and 1VIST3 proteins conserve the 
amino acid sequence im.mediately following this presumed 
initiating m.ethionine. 

Several EST fragm.encs span the complete STLK2 seauence 
wicn AA191215 at the 5' end and W16504 at the 3' end. 

Mammalian STLK3 

The partial human STLK3 cDNA (SEQ ID NO: 2) is 3030 bp 
long and consists of a 1548 bp ORF flanked by a 1476 bp 3' 
UTR (1550-3025) and a 5 nucleotide pclyadenvl a-ed recjion. t:. 
. pc-sntial poiyadenylaticn signal (AAT.^aA) begins at position 
3 004. Since the coding region is open throughcur the 5' 
extent of this sequence, this is apparently a partial cDMA 
clone lacking the N-terminal start methionine. 

Multiple EST fragm.ents span che complete STLK3 sequence 
v/ith A-A278907 ar the 5' end and A-A628477 and others at the 
3 ' end . 

■Mamm.alian STLK4 

The partial human STLK4 cDNA (SEQ ID NO: 3) is 3 3 57 bp 
long and consists of a 1242 bp ORF flanked by a 2596 bp 3' 

ci-., c;i. iti nucieocicle poiyadenylaced region. 
A potential pclyadenyiac ion signal (P-J^TAJ^Jk) is found at 
positions 2131-3322. Since the coding region is open 
througnout the 5' extent of this sequence, nhis is 

be 
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apparently a partial cDNA clone lacking the M-cerrrdnal star- 
methionine. A near full-length murine STLK4 cDNA is 
represented in the 1773 bp EST AA117438. It extends an 
additional 21 nucleotides 5' of the human STLK4 consensus, 
but since its coding region is open throughout the 5' extent 
of the sequence, this is also apparently a partial cDNA 
clone lacking the N-terminal start methionine. 

Several EST fragments span the complete STLK3 sequence 
with AA297759 at the 5' end and AA100434 and others ac the 
3' end. 



Mammalian STLK5 

The full-length human STLK5 cDNA (SEQ ID NO: 96) is 2110 
bp long and consists of a 1119 bp OR? flanked by a 229 bp 5' 

15 UTR and a 762 bp 3' UTR . The sequence flanking the first ATG 

conforms to the Kozak consensus (supra) for an initiating 
methionine, and is believed to be the t ransla t ional start 
site for STLK5 . Several EST fragments span che complete 
STLK5 sequence v;ith AA297059 and F07734 at the 5' end, and 

20- R46686 and F03423 and others at the 3' end. 

Mammalian 5TLK6 

The .full-lengTih human STLK6 cDNA (SEQ ID NO:93) is - 
2,0.01 bp long and consists of a 1,254 bp ORE flanked by a 75 
25 bp 5- UTR and a 673 bp 3' UTR. The sequence flanking the 

first ATG conforms to the Kozak consensus (supra) for an 
initiating methionine, and is believed to" be the 
translat ional start sice for STLK6 . 

30 Mammalian 3TL,K7 

The partial human STLK7 cDNA (SEQ ID NO: 100) is 311 bp 
long and consists of a 3 09 bp ORF . Since che coding region 
is open throughout both the 5' and 3' excenc of chis 

6i 
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sequence, chis is apparenrly a partial cDMA rlone lackina 
the M-cerTdnal scare methionine and C- terminal 



stcc coaon. 



Mammalian 2C1 

The full-length human 2C1 cDNA (SEQ ID NC:9) is 3738 bp 
long and consis-s of a 3717 bp ORF (7-3723) flanked by a 6 
bp 5' UTR and a 75 bp (3724-379S) 3' UTR . Mo 

?olyadenyla-ion signal (AATAAA) or polyadenylated region are 
presenc in the 3 ' UTR . The sequence flanking che first ATG 
conforms zo the Kozak consensus for an initiating 
methionine, and is believed to be the transla- lonal start 
site for human ZCl . 

Mulciple EST fragments (W31656) match the 3' end of the 
human ZCl gene, but at the tim.e of filing, the inventors 
believe that none exist' in GenBank or the EST database that 
m.atch its 5' end. 



20 
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Mammalian 7.C9 

The partial human ZC2 cDNA (SEQ ID NO: 10) is 4055 bp 
long and consists of a 38S1 bp ORF (1-3S91) and a 164 bp 
(3892-4055) 3' UTR. Since the coding region is open 
throughout the 5' extent of this sequence, this is . 
apparently a partial cDNA clone lacking the N- terminal start 
methionine. No polyadenylation signal (AJ^TAAA) or 
polyadenylated region are present' in the 3 ' UTR . 

Multiple EST fragments (R5 124 5 ) . match the 3' end of the 
human 2C2 gene, but at the time of filing, the inventors 
believe that none exist m GenBank or the EST database that 
match its 5' end. 

Mammalian zr3 

The partial human ZC3 cDNA (SEQ ID NO: 11) is 4133 bp 
long and consists of a 3973 bp ORF (1-397S; and a 152 bp 

31 
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(3979-4123) 3 ' UTR region. Since the coding region is ooen 
throughout the 5' extent of this sequence, this is 
apparently a partial cDNA clone lacking the N-terminal start 
methionine. No polyadenylat ion signal (AATAAA) or 
polyadenylated region are present in the 3 ' UTR . 

Multiple EST fragments {R54563) match the 3 ' end of the 
human ZC3 gene, but at the time of filing, the inventors 
believe that none exist in GenBank or the EST database that 
match its 5' end. 



Mam.m.alian 2C4 

The full-length human ZC4 cDNA (SEQ ID NO: 104) is 3,684 
bp long and was originally assembled from X chromcsom.e 
genomic DNA sequence. 

15 Multiple EST fragments (R9S571) match the 3 ' end of the 

human ZC4 gene, but at the time of filing, the inventors 
believe that none exist in GenBank or the EST database that 
miatch its 5' end. ZC4 gene is' also contained within the 
human genomic clone ZS3S50 . 

20 . . 

Mammalian PCHS2 

The full-length human KHS2 cDNA (SEQ ID NO: 17) is 4023 
bp long and consists of a 2632 bp ORE (6-2687) flanked by a 
5 bp (1-5) 5 'UTR and a 13 3 6 bp (2688-4023) 3' UTR. A 

25 potential polyadenylat ion signal (AATAAA) is found' at 

positions 4008-4013. No polyadenylated region is present in 
the 3'UTR. The sequence flanking the. first ATG conform.s to 
the Kozak consensus for an initiating methionine, and is 
believed to be the cranslat ional start site for human KHS2 . 

^0 Multiple EST fragments match the 5 ' end (AA446022) as 

well as the 3' end fR3 7625) of the human KHS2 aene . 
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Mammalian STTT-rn 

The full-lengch human SULUl cDNA (SEQ ID NO: 19) is 4177 
bp long and consists cf a 2594 bp ORF (415-3103) flanked by 
a 414 bp (1-414) 5 ' UTR and a 1069 bp (3109-4177) 5' "JTR 
5 followed by a 19 nucleotide polydenylated region. A 

potential polyadenylation signal (AATAAA) is found at 
positions 4164-4169. The sequence flanking che first ATG 
ccnrorms to the Kozak consensus for an initiating 
methionine, and is believed to be the translational starr 
10 site for human SULUl . 

Multiple EST fragments match the 5 ' end (N27153) as well 
as the 3' end (R9C903) of the human SULUl gene. 
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Mammalian f Murine) SULU3 

The partial murine SULU3 cDNA (SSQ ID NO: 21) is 224 9 bp 
long and consists of a 2244 bp ORF (6-2249) flanked by a 5 
bp (1-5) 5 'UTR. The sequence flanking the first ATG 
conforms to the Kozak consensus for an initiating 
methionine, and is believed to be the translational start 
site for murine SULU3 . The 3' end of the m.urine SULU3 cDNA 
shares 90% DNA sequence identity over 1620 nucleotides wirh 
human SULU3 , suggesting that these two genes are functional 
orthologues . 

One EST fragment (AA446022) matches the 3' end of the 
partial m.urine SULU3 gene, but at the tim.e of filing, the 
inventors believe that none exist in GenBank or the EST 
database that match its 5' end. 

Mammalian (Hum.an' STTT,T7-^ 

The partial human 3ULU3 cDMA (SEQ ID MO: 20) is 3 32 4 bp 
long and consists of a 23 5 3 bp ORF (2-2359) flanked bv a 
1465 bp (2360-3824) 3 ' UTR followed by a 19 nucleotide 
polydenylated region. A potential polyadenylation signal 

6^ 
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(AATAAA) is found ac positions 2602-2507. Since che coding 
region is open nhroughout che 5' extenn of this sequence., 
this is apparently a partial cDNA clone lacking che N- 
terminal start methionine. The 5' end of the human SULU3 
5 cDNA shares 90% DNA sequence identity over 1620 nucleotides 

with murine SULU2 , suggesting that these two genes are 
functional orchclogues . 

Multiple EST fragments (RC22S3) match the 3 ' end of che 
human SULU3 gene, but ac the time of filing, che inventors 
10 believe thac none exisc in GenBank or che EST database chac 

miatch its 5' end. 

Mammal ian GSK2 

The fuli-lengch human GEK2 cDNA (SEQ ID NO: 106) is 2962 
15 bp long and consists of a 2737 bp ORF (59-2795) flanked by a 

58 bp (1-5S) S'UTR. The sequence flanking the- first ATG 
conforms to the Kozak consensus for an initiating 
m.ethionine, and is believed to be the t ranslat ional start 
. site for human GHK2 . 
20 Multiple EST fragments (AJV465671) match the 5'end, but 

at the time of filing, the inventors believe that only one 
(AA3S04 92) miacches the 3' end of che human GEr:2 gene. 

Mammalian PAK4 

25 The full-length human PAK4 cDNA . (SEQ ID NO: 27) is 3604 

bp long and consists • of a 2043 bp ORF (143-2135) flanked by 
a 142 bp (1-142; 5 ' UTR and a 1419 3' UTR follov/ed- by a 22 
nucleotide polydeny laced region. A potential 

polyadenylation signal (AATTAAA) is found at positions 3532- 
30 35SS. The sequence flanking che firsc ATG conform.s to che 

Kozak consensus for an initiating methionine, and is 
believed to be che t ranslac lonal start sice for human PAK4 . 

63 
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Muiciple EST fragments (^.^1535791) match che 



the human ?AK4 gene, buc at the time of filing, the 
inventors believe that none exist in GenBank or the EST 
database that match its 5' end. 

■Ma mma ] i a r:. p^K S 

The full-length human PAK5 cDNA (SEQ ID NO: 102) is 2806 
bp long and consists of a 1773 bp ORF flanked by a 2C1 bp 5- 
UTR and a S33 bp 3' UTR . The sequence flanking the first 
ATG conforms co the Kozak consensus (supra) for an 
initiating methionine, and is believed to be the 
^ransiat lonal start site for PAK5 . 

Multiple EST fragments (AA4423S7) match the 3 ' end of 
the human PAK5 gene, but at the time of filing, the^" 
inventors believe that none exist: in GenBank or che EST 
database that match its 5' end. 

^^"Isi^ Ar^d Prober, MethoH^ .r..-, Kir^ f.^.r- ngr ecM 
of STKPn - Related KinaaoR 

A nucleic acid probe of che present invention may be 
r_o probe an appropriate chromosomal or cDNA librarv by 
usual hybridization methods to obtain other nucleic ac'd 
molecules of the present invention. A chromosomal DNA or 
CDNA library m>ay be prepared from appropriate cells 
according to recognized m.ethods in the art (cf . "Molecular 
Cloning: A Laboratory .Manual-, second edition. Cold Soring 
Haroor Laboratory, Sambrook, Fritsch, & .Maniatis, ^d= 
1S39) . 

In the alternative, chemical synthesis can be carried 
out m order to obtain nucleic acid probes having nucleotide 
sequences which correspond to N-termmal and C- termanai 
portions of the amino acid sequence cf ch- o-i v--^r ■ 

o nu.^lcric ocid prooes -a^y be used 



used 
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as primers in a pclyn^.erase chain reaction (PCR) carried ouc 
■ in accordance v/ith recognized PCR cechniques, essentially 
according zo PCR Protocols, "A Guide to Methods and 
Applications", Academic Press, Michael, et al . , eds . , 1990, 
5 utilizing the appropriate chromosomal or cDNA library to 

obtain che fragrr^ent of che present invention. 

One skilled in the art can readily design such probes 
based on the sequence disclosed herein using methods of 
compucer alignment and sequence analysis knovm in the art 

10 ("Molecular Cloning: A Laboratory Manual", 1989, supra), 

The hybridization probes of the present invention can be 
labeled by standard labeling techniques such as winh a 
radiolabel, enzyme label, fluorescent label, biocin-avidin 
label, chemiluminescence , and the like. After 

lo hybridization, the probes may be visualized using known 

methods 

The nucleic acid probes of the present invention 
include RNA, as well as DMA probes, such probes being 
generated using techniques known in the art,' The nucleic 

20 acid probe may be immobilized on a solid support. Examples 

of such solid supports include, but are not. limited to, 
plastics such as polycarbonate, complex carbohydrates such 
as agarose and sepharcse, and acrylic resins, such as 
polyacrylamide and latex beads. Techniques for coupling 

25 nucleic acid probes to such solid supports are well knov/n in 

the art . 

The test samples suitable for nucleic acid probing 
methods of the present invention include, for example, cells 
or nucleic acid extracts of cells, or biological fluids. 
30 The samples used in the above-described methods will vary 

based on the assay format, the detection method and the 
nature of the tissues, cells or extracts to be assayed. 
Methods for preparing nucleic acid extracts of cells are 

&1 
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well kncv/n in the art: and can be readily adapted in order to 
obtain a sample v/hich is coTioacible v/ith the method 
ut il i zed . 

One me-ihod of detecting the presence of nucleic acids 
5 of the invention in a sample comprises (a) contacting said 

sample v/ith the above -described nucleic acid probe under 
conditions such chac hybridization occurs, and (b) detecting 
the presence of said probe bound to said nucleic acid 
m.oiecule. One skilled in the art would select the nucleic 
10 acid probe according to techniques known in the art as 

described above. Samples to be tested include but should 
not be limited to RNA samples of human tissue. 

A Kit for detecting the presence of nucleic acids of 
tne invention in a sample comprises at least one container 
15 means having disposed therein the above -described nucleic 

acid probe. The kit may further comprise other containers 
comprising one or more of the following: wash reagents and 
reagents capable of detecting the presence of bound nucleic 
acid probe. Examples of detection reagents include, but are 
20 not limited to radiolabel led probes-, enzymatic labeled 

probes (horseradish peroxidase, alkaline phosphatase) , and 
affinity labeled probes {biotin, avidin, or steptavidin) 

In detail, a comipartmientalized kit includes anv kit in 
which reagents ar.e contained in separate containers. Such 
2^ containers include small glass containers, plastic 

containers or strips of plastic or paper. Such containers 
allow the efficient transfer of reagents from one 
compartment to another compartment such that the samoles and 
reagents are not cross -contaminated and the agents or 
30 solutions of each container can be added in a quantitative 

fashion from one compartm.ent to another. Such containers 
will include a container which vyill accept the test sample, 
a container which contains the probe or prim.ers used in the 
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assay ; containers which contain wash reagents (such as 

phosphate buffex-ed saline, Tris-buf f ers , and zhe like), and 
containers which contain che reagenrs used to deteci: the 
hybridized probe, bound ancibody, amplified produce, or the 
5 like. One skilled in zhe art: will readily recognize chat 

che nucleic acid probes described in the present invention 
can readily be incorporated into one of the established kit 
formats which are well knov;n in the art, 

10 III . DNA Constructs Comprising a STS2 0 -Related Nucleic 

Acid Molecule and Cells Containing These Conscructs. 
The present invencion also relates to a recombinann DNA 
molecule comprising, 5' to 3 ' , a prom.orer effective to 
initiate cranscripcion in a host cell and the above- 

15 described nucleic acid molecules. In addition, the present 

invention relates to .a recomJoinant DNA molecule comprising a 
vector and an above -described nucleic acid molecule. The 
present invention also relates to a nucleic acid molecule 
comprising a transcriptional region functional in a cell, a 

20 sequence complementary to an RNA. sequence encoding an amino 

acid sequence corresponding to the above -described 
polypeptide, and a transcriptional term^ination region 
functional in said cell. The above -described molecules may 
be isolated and/or purified DNA. miol.ecules. 

25 The present invention also relates to a cell- or 

organismi that contains an above -described nucleic acid 
molecule and thereby is capable of expressing a polypeptide. 

The polypeptide may be purified fromi cells v;hich have been 
altered to express the polypeptide. A cell is said to be 

30 "altered to express a desired polypeptide" when the cell, 

through genetic m.anipulat ion, is made to produce a protein 
which it normially does net produce or which the cell 
normally produces at lower levels. One skilled in the art 
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can readily adapt procedures for mcroducing and expressing 
either genomic, cDNA, or synthetic sequences into either 
eukaryocic or prokaryotic cells. 

A nucleic acid molecule, such as DNA, is said zo be 
"capable of expressing" a polypeptide if it contains 
nucleotide sequences which contain transcriptional and 
translat ional regulatory information and such seauences are 
"operably linked" to nucleotide sequences which encode the 
polypeptide. An operable linkage is a linkage in which che 
regulatory DNA sequences and the DNA sequence sought to be 
expressed are connected in such a way as to permit gene 
sequence expression. The precise nature of the regulatory 
regions needed for gene sequence expression may vary from 
organism to organism, but shall in general include a 
promoter region which, in prokaryotes, contains both the 
promoter (which directs the initiation of RNA transcription) 
as vjell as the DNA sequences which, when transcribed into 
RNA, will signal synthesis initiation. Such regions will 
normally include those 5 ' -non-coding sequences involved wich 
initiation of transcription and translation, such as the 
TATA box, capping sequence, CAAT sequence, and the like. 

If desired, the non-coding region 5' to the sequence 
encoding a kinase of the invention may be obtained by the 
above-described methods. This region may be retained for 
its transcriptional termination regulatory sequences, such 
as termination and pclyadenylat ion . Thus, by retaining the 
3 ' -region naturally contiguous to the DNA sequence encoding 
a kinase of the invention, the transcriptional termination 
signals may be provided. Where the transcriptional 
termination signals are not sacisf actorily functional m the 
expression host cell, chen a 3' region functional in the 
host cell m.ay be substituned. 

U>0 
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T'vjo DNA sequences (such as a proiTiOter region sequence 

and a sequence encoding a kinase of the invention) are said 
to be operably linked if- the nature of the linkage between 
the two DNA sequences does not (1) result in the 
introduction of a frame-shift mutation, (2) interfere v;ith 
the ability of the promoter region sequence to direct the 
transcription of a gene sequence encoding a kinase of the 
invention, or (3) interfere with the ability of nhe gene 
sequence of a kinase of the invention to be transcribed by 
the promocer region sequence. Thus, a promoter region would 
be operably linked to a DMA sequence if the promoter were 
capable of effecting cranscript ion of thac DNA sequence. 
Thus, to express a gene encoding a kinase of the invention, 
transcripcional and transiat ional signals recognized by an 
appropriate host are necessary. 

The present invention encompasses the expression of a 
gene encoding a kinase of the invention (or a functional, 
derivative thereof) in eicher prokaryotic or eukaryotic 
cells. Prokaryoric hosts are, generally, very efficient and 
convenient for the production of recombinant proteins and 
are, therefore, one type of preferred expression system for 
kinases of the invention. Prokaryoces most frequently are 
represented by various strains of E. coll. However, other 
microbial strains m.ay also be used, including other 
bacterial strains . 

In prokaryotic systems, plasmid vectors chac contain 
replication sites -and control sequences derived from a 
species compatible v/ich the hose may be used. Examples of 
suitable plasm.id vectors m.ay include pBR322, pUCllS, pUC119 
and uhe like; suitable phage or bacteriophage vectors may 
include ygtlO, ygtll and the like; and suitable virus vectors 
may include pNlAM-neo, pKRC and the like. Preferably, the 
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selected veccor of the present invention has nhe capacitv zo 
replicaue in rhe selecced hose cell. 

Recognized prokaryotiic hosts include bacceria such as 
E. coll, Bacillus, Screpcomyces , Pseudomonas, Salmonella, 
Serratia, and the like. However, under such condicions, che 
polypepride v/ill noc be glycosylated. The prokaryotic host 
must be compatible wich the replicon and control sequences 
in the expression plasmid. 

To express a kinase of the invention (or a functional 
derivative thereof) in a prokaryotic cell, ic is necessary 
to operably link the sequence encoding the kinase of che 
invention to a functional prokaryotic promoter. Such 
promoters may be either constitutive or, more preferablv, 
regulatable (i.e., inducible or derepressible) . Exarrples of 
constitutive promoters include the int ^Drom.cter of 
bacteriophage X, the bia promoter of the p-lactamase gene 
sequence of pBR322, and the cat promoter of the 
chloramphenicol acetyl transferase gene sequence of pPR325, 
and the like. Examples of inducible prokaryotic promoters 
include the major right and left promoters of bacteriophage 
A. (P^ and Pj,) , the trp, recA, AacZ, AacI , and gal promoters 
of E. coli, the a-amylase (Ulmanen et al . , J. Bacteriol . 
162:176-182, 1985) and the c- 2 8 - specif ic promoters of B. 
subtllis (Gilman et al . , Gene Sequence 32:11-20, 1984), the 
promoters of the bacteriophages of Bacillus (Gryczan, In: 
The Molecular Biology of the Bacilli, Academic Press, Inc., 
MY, 1982), and Streptomyces promoters (Ward et al . , Mcl . 
Gen. Genet. 203:468-478, 1986). Prokaryotic promoters are 
reviewed by Glick (Ind. Microbiot . 1:277-282, 1987), 
Cenatiempo (Biochimie 68:505-515, 1986), and Gottesmtan (Ann. 
Rev. Genet. 13:415-442, 1984) 

Proper expression in a prokaryotic cell also requires 
the presence of a ribcsom.e-binding si^e upstream of che qene 
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sequence-encoding sequence. Such ribosome -binding sites are 
disclosed, for example, by Gold et ai . (Ann. Rev. Microbiol. 
35:365-404, 1981). The selection of control sequences, 
expression vectors, transformation methods, and the like, 
5 are dependent on the type of host cell used to express the 

gene. As used herein, "cell", "cell line", and "cell 
culture" may be used incerchangeably and all such 
designations include progeny. Thus, the woi'ds 
" transf ormanrs" or "transformed cells" include the primary 
10 subject cell and cultures derived therefrom, v.^ithout reaard 

to the number of transfers. It is also understood thar ail 
progeny m.ay noc be precisely identical in DNA content, due 
to del iberate . or inadvertent mutations. Howevex-, as 
defined, mucant progeny have the same f uncmcnal i ty as zhac 
15 of the originally transformed cell. 

Host cells which may be used in the expression systems 
of the present invention are not strictly limited, provided 
that chey are suitable for use in the expression of the 
kinase polypeptide of interest. Suitable hosts may often 
20 include eukaryotic cells. Preferred eukaryocic hoses 

include, for example, yeast, fungi, insect cells, mamtmalian 
cells either in vivo, or in tissue culture. Mammalian cells 
which may be useful as hosts include HeLa cells, cells of 
fibroblast origin such as VERO or CHO-Kl, or cells of 
25 lymphoid origin and their derivatives. Preferred mamm.alian 

host cells include SP2/0 and J553L, as well as neuroblastoma 
cell lines such as IMR 332, which may provide better 
capacities for correcc post - translat ional processing. 

In addition, plant cells are also available as hosts, 
and control sequences compatible with olant cells are 
available, such as rhe cauliflower mosaic virus 35S and 19S, 
and nopal ine synthase promoter and polyadenylac ion signal 
sequences. Another preferred host is an insecc cell, for 
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example the Drosophila larvae. Using insect cells as hosts, 

che Drosophila alcohol dehydrogenase prorrioter can be used 
(Rubin, Science 240:1453-1459, 1988). Al cernat ively , 
bacuiovirus vectors can be engineered to express large 
5 amounts of kinases of the invention in insect cells (Jasny, 

Science 238:1553, 1987; Miller et al . , In: Genetic 
Engineering, Vol. 8, Plenum., Setlow et al . , eds . , pp. 277- 
297, 1986) . 

Any of a series of yeast expression systems can be 

10 utilized which incorporate promoter and termination elements 

from the actively expressed sequences coding for glycolytic 
enzymes that are produced in large quantities when yeast are 
grov/n in mediums rich in glucose. Known glycolytic gene 
sequences can also provide very efficient transcriptional 

15 control signals. Yeast provides substantial advancages in 

that it can also carry out post - translat ional m.odi f icat ions . 

A number of recombinant DNA strategies exist utilizing 
strong promoter sequences and high copy number plasmids 
which can be utilized for production of the desired proteins 

20 in yeast. Yeast recognizes leader sequences on cloned 

mammalian genes and secretes peptides bearing leader 
sequences (i.e., pre-peptides) . Several possible vector 
systems are available for the expression of kinases of che 
invention in a mammalian host. 

-5 A wide variety of transcriptional and translational 

regulatory sequences may be employed, depending upon the 
nature of the host. The transcriptional and translational 
regulatory signals may be derived from viral sources, such 
as adenovirus, bovine papilloma virus, cytomegalovirus, 

50 simian virus, or the like, v;here the regulatory signals are 

associated with a particular gene sequence which has a high 
level of expression. Alcernat ively , promoters from, 
mammalian expression products, such as actin, collagen. 
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myosin, and che like, vcvay be erripioysd. Transcriptional 
iniciacion regulatory signals may be selected which allov; 
for repression or activation, so that expression of the gene 
sequences can be modulated. Of interest: are regulatorv 
5 signals v/hich are temperature-sensitive so that by varying 

the temperature, expression can be repressed or initiated, 
or are subject to chem.ical (such as metabolite) regulation. 

Expression of kinases of the invention in eukaryotic 
hosts requires the use of eukaryotic regulatory regions. 

10 Such regions will, in general, include a promoter region 

sufficient to direct the initiation of RNA synthesis. 
Preferred eukaryotic promoters include, for example, the 
promoter of the mouse metal lot hionein 1 gene sequence (Kamer 
et al . , J. Mcl . Appl. Gen. 1:275-238, 1982); the TK prom.oter 

15 of Herpes virus (McKnight, Cell 31:355-365, 19S2); the 3V4C} 

early promoter (Benoist et ai . , Nature (London) 290:304-31, 
1981) ; and the yeast gai4 gene sequence promoter (Johnston 
et ai . , Proc . Natl. Acad. Sci. (USA) 79:6971-6975, 1982; 
Silver et ai . , Proc . Natl. Acad. Sci. (USA) 81:5951-5955, 

20 1984) . 

- Translation of eukaryotic mRNA is initiated at the 
codcn which encodes the first methionine. For this reason, 
it is preferable to ensure that the linkage between a 
eukaryotic promoter and a DNA sequence v/hich encodes a 

25 kinase of the invention (or a functional derivative thereof) 

does not contain any intervening codons which are capable of 
encoding a methionine (i.e., AUG). The presence of such 
codons results either in the formation of a fusion protein 
(if the AUG codon is in the sam^e reading frame as the kinase 

^0 of the invention coding sequence) or a frame-shift mutation 

(if .the AUG codon is not in the sam.e reading fram:e as the 
kinase of the .invention coding sequence) . 

G6 
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A nucleic acid molecule encoding a kinase of the 
inven-ion and an operably linked promocer may be incroduced 
into a recipient prokaryotic or eukaryocic cell eicher as a 
nonreplicating DNA or RNA molecule, vyhich may eicher be a 
5 linear molecule or, more preferably, a closed covalent 

circular molecule. Since such molecules are incapable of 
auccnom.ous replication, the expression of che gene may occur 
througn the transienc expression of the introduced sequence. 
A.lternacively , permanenc expression may occur through che 
10 integracion of the introduced DNA sequence into the host 

chrom.osome . 

A vector may be employed which is capable cf 
integrating the desired gene sequences into the host cell 
chromosome. Cells which have scably integrated the 

15 introduced DNA into their chromosomes can be selecced by 

also introducing one or more markers which allow for 
seleccion of hose cells which concain the expression vector. 

The marker may provide for prototrophy to an auxotrophic 
hose, biocide resistance, e.qr,, antibiotics, or heavy 

20 m.ecals', such as copper, or che like. The selectable marker 

gene sequence can either be directly linked to che DNA gene 
sequences to be expressed, or incroduced into the sam.e cell 
by co-cransf ection. Additional elements may also be needed 
for optimal synthesis of m.RNA . These elements m.ay include 

25 splice signals, as v/ell as t ranscripc ion promocers, 

enhancers, and term.ination signals. cDNA expression veccors 
incorporating such elements include those described by 
Okayama (Mol . Cell. Biol. 3:280-, 1983). 

The introduced nucleic acid molecule can be 

30 incorporated into a plasm-id or viral vector capable of 

autonomous replication in the recipient host. Any of a v;ide 
variety of vectors may be employed for this purpose. 
Factors of im.porcance in selecting a parcicular plasmid or 
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viral veci:or include: the ease with which recipienc cells 
that conrain che vector rr.ay be recognized and selected frorr; 
those recipient cells '.vhich do not conzeiin che vector; che 
number of copies of che vector which are desired in a 
5 particular host; and whether it is desirable to be able co 

"shuttle" the vector between host cells of different 
species . 

Preferred prokaryotic veccors include plasmids such as 
those capable of replication in E. coll (such as, for 

10 exaT:ple, p3R322, ColEl , pSClOl, pACYC 184, rcVX ; "Molecular 

Cloning: A Laboracory Manual", 19S9, supra) . Bacillus 
plasrrdds include pC194, pC221, pT127, and the liice 
(Gryczan, In: The Molecular Biology .of the Bacilli, Academic 
Press,. NY, pp. 307-329, 1982). Suitable Strep tomyces 

15. plasmids include plJlOl (Kendall et al . , J . Bacterid. 

169:4177-4183, 1987), and screptomyces bacceriophages such 
as (t)C31 (Chaner ec ai.. In: Sixth International Symposium on 
Accinom.ycetales Biology, Akademiai Kaido, Budapest, Hungary, 
■pp. 45-54, 19S6) . Pseudomonas plasmdds are reviewed by John 

20 et al. (Rev. Infect. Dis. 3:693-704, 1966), and Izaki {Jpn. 

J. Bacteriol. 33:729-742, 1978). 

Preferred eukaryotic plasmids include, for example, 
3PV, vaccinia, 3V40, 2-m.icron circle, and che like, or their 
derivatives. Such plasmids are well known in the art 

25 (Botstein et al . , Miam.i Wntr. Symp . 19:265-274, 1982; 

Broach, In: "The Molecular Biology of the Yeast 
Saccharomyces : Life Cycle and Inheritance", Cold Spring 
Harbor Laboratory, Cold Spring Harbor, NY, p. 445-470, 1981; 
Broach, Cell 28:203-204, 1982; Bollon et ai . , J. Clin. 

30 Hematol . Onccl . 10:39-48, 1980; Maniatis, In: Cell Biology: 

A Comprehensive Treatise, Vol. 3, Gene Sequence Expression, 
Academic Press, NY, pp. 563-6C&, 1980). 
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Once the vector or nucleic acid "olecuie containing the 
construct (s) has been prepared for expression, the DNA 
construct (s) may be introduced into an appropriate host cell 
by any of a variety of suitable means, i.e., t ransf orrr^ac icn , 
5 transf ect ion , conjugation, protoplast fusion, 

electroporation, particle gun technology, calcium phosphate- 
precipitation, direct microinjection, and the like. After 
the introduction of the vector, recipient cells are grown in 
a selective medium, v/hich selects for the grov/th of vector- 
10 containing cells. Expression of the cloned gene(s} results 

in the production of a kinase of the invention, or fraaments 
thereof. This can take place in the transformed cells as 
such, or follov;ing the induction of these cells to 
differentiate (for exam.ple, by administration of 
15 bromodeoxyuracil to neuroblastoma cells or the like) . A 

variety of incubation conditions can be used to form the 
peptide of the present invention. The most preferred 
conditions are those which mtimic physiological conditions. 



20 IV. The Prot 



.nvent 2.on 



A variety of methodologies known in the art can be 
utilized to obtain the polypeptides of the present 
invention. The polypeptides may be purified from tissues or 
cells that naturally produce the polypeptides. 

25 Alternatively, the above-described isolated nucleic acid 

fragments could be used to express the kinases of the 
invention in any organism. The samples of the present 
invention include cells, protein extracts or membrane 
extracts of cells, or biological fluids. The samples will 

30 vary based on the assay format, the detection method, and 

the nature of tne tissues, cells or extracts used as the 
sample . 



8NSDOCID: <WO 9953036A2_IA> 



wo 99/53036 PCT/US99/08I50 

Any eukaryotic organism can be used as a source for zh 
polypepcides of the invention, as long as the source 
organism naturally contains such polypeptides. As used 
herein, "source organism" refers to the original organism 
from, which the amino acid sequence of the subunit is 
derived, regardless of the organism, the subunit is expresses 
in and ultimately isolated from. 

One skilled in the arc can readily follov; known mechodi 
for isolating proteins in order to obtain the polypeptides 
free of natural contaminants. These include, but are not 
limited to: size -exclusion chrom.atography , HPLC, ion- 
exchange chrom^acography , and immtuno - affinicy chroma iiogr a phv 

Mammalian STLK2 

Analysis of the deduced am.ino acid sequence predicts - 
STLK2 to be an innracellular serine/threonine kinase, 
lacking both a signal sequence and transmem.brane dom.ain.- 
STLK2 contains a 21 amino acid N-termanal dom^ain, a 253 
amino acid catalytic domain with all the motifs 
characteristic of a serine/ threonine kinase, followed by a 
142 amino acid C- terminal domiain. 

STLK2 is most closely related to human STE2 0 - subfamily 
kinases, MST3 {GB : AF024 63 6 ) and SOK-1 (G3:X99325) and aC. 
elegans kinase yk34bll.5 {GB:U53153) sharing 72 . 7% , - 68 . 7% , 
and 69.3% amino acid identicy, respectively. 

The 21 amino acid N-term.inal domain of hum^an STLK2 is 
71.4% identical to the N-terminus of MST3 (GB : AF024636) , 
Human STLK2 lacks a glycine residue at position 2, and is 
therefore unlikely to undergo myr istylat ion . A Smdth- 
Waterm.an search of zhe nonredundant pronein database does 
not reveal any significant homologies rhat might suggest a 
potential function for this domiain. 
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The 253 amino acid catalytic domain of human STLK2 is 
mos- related cc human SOK-1 (X99325), MST3 {G5 : AF024 53 6 ) , C 
elegans yk32bll . 5 (G3:U53153), and STLK3 (SEQ id mo : 6 ) 
snaring 83.9%, 87.4%, 78.3%, and 49% amino idencity 
5 respectively, placing it in che STLK- subfamily of STE20- 

related kinases. The STLK2 kinase dom.ain displayed lesser 
hcm.ology to other STE20-related kinases including: 55.9% zc 
human MST2 (G3:U26424), 49.2% to hum.an GCK (GB:U07349), 
49.2% CO human KHSi (GB:U77129), and 44.2% to human HPKl 
10 (GB:U66464). The activation loop of human 3TLK2 cacaiycic 

domain is identical co that of human SOK-1 and MST3 
including the presence of four potential threonine 
phosphorylacxon sites that could serve an autoregulatory 
role on kinase activity. 

Tne 142 amino acid C- terminal domain of human 3TLK2 is 
mosc related zo human SOK-1 (X99325) , MST3 (G3 : AF024 6 3 -3 ) , 
and C. el egans yk32bl 1.5 {GB:U53153), sharing 3 9 . 9% , 39.9%, 
and 33.3% amino acid identity, respectively. This C- 
term.inal domain shares some significant am.ino acid 
similarity to the C-terminal dom.ains of the related human 
STLK3 (SEQ ID NO: 6) and STLK4 (SEQ ID NO: 7) . 

The C-terminus of che related human SOK-i (GB:X99325) 
kinase has been shown to be inhibicory to the cacaiytic 
activity of this kinase (Pombo, CM., Bonventre, J. v., 
Momar, A., Kyriakis, J. and Force, T. EMBO J. 15, 4537-4546 
(1996)). Based on the sequence identity between the C- 
termini of human SOK-1 (G3:X99325) and human 3TLK2 (39.2%), 
rhe C-cerminus of human STLK2 m.ay also function as an 
inhibitory domain for its kinase. 
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Mammalian STT,y3 

The 3030 bp human STLK3 nucleotide sequence of the 
partial cDNA clone encodes a polypeptide of 516 am.ino acids 

^0 
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(SHQ ID MO: 5) v/irh a predicted molecular mass of 56,784 
dalccns . Analysis of che deduced amino acid sequence 
predicts 3TLK3 to be an intracellular serine/ threonine 
kinase, lacking both a signal sequence and transmembrane 
domain, hov;ever the cDNA clone lacks an initiating ATG, so 
the full exuent of it N-termius is not known. STLK3 
contains a 31 amino acid N-terminal domain, a 277 amino acid 
catalytic domain with all the motifs characteristic of a 
serine/ threonine kinase, followed by a ISl amino acid C- 
Lerminal domain containing a 2 5 amino acid insert and a 2 7 
amino acid nail relative to the sequence of human STLK2 . 

STLK3 is m.ost closely related to human STE20 -subfamily 
kinases, STLK4 (SSQ ID. NO:7), MST3 ( GB : AFO 2 4 63 6 ) , • SOK- 1 
(GE:X99325) and STLPC2 (SEQ ID NO:5) sharing 71.1%, 37,6%, 
38.1%, and 33.4% amino acid identity respectively. 

The 31. am.ino acid N-term.inal domain of hum.an STLK3 
lacked any significant amiino acid sequence homologies using 
a Sm.ith- Waterman search of the nonredundant procein 
database, other than sequence similarity co proline-alanine 
repeats . - ■ - • - 

The 2 77 amino acid catalytic domain of human STLK3 is 
mosc related to human STLK4 (SEQ ID N0:7), SOX- 1 
(GB:X99325), MST3 ( GB : AFO 2 4 63 6 ) , and STLk2 (SEQ ID NO : 5 )t 
sharing 88.2%, 49.2%, 49%, -and 49% am.ino acid identity, 
respectively. In also shares strong homiology to other STKs 
from lower organisms including 51.7% to A. thallana (GB: 
AC002343) , 43.1% to .A. thaliana (GB: Z97336) , 42.1% -to A. 
thaliana (GE: U96613), and 43.3% to C. elegans (GB: U53153). 

The activation loop of the hum.an STLFC3 cacalycic dom.ain 
conserves three potential threonine phosphorylation sices 
with other members of che STLK-subfamily ,of STE2 0 - relaced 
kinases (human 3TE2 0, MST3 , STLK2 , STLK4 ) thac could serve 
an autoregularory role on kinase activity. 
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Tne iSi amino acid C-terminal domain of human STLK3 
snares 55.5% amino acid identity to human STLK4 (SEQ ID 
NO: 7), and is 100% identical to a parcial human cDNA DCHT 
(G3:AF017.635) . The C-terminal domain of human STLK3 
contaxns a 26 amino acid insert relative to human STH20. A 
similar (37.5% amino acid identity) 26 amino acid inserc is 
also present in human STLK4 . 

The 2 7 am.ino acid C-terminal tail of human 3TLK3 shares 
77.8% amino acid identity to human STLK4 , but is absent from 
other STLK-family members. . This high degree of homology 
between. the C-tail of two STLK-family members suggests they 
may be involved in an as yet unidentified procein- Dronem 
interaction . 

Tne weak sequence homology ben ween the C-cermini of 
human STLK3 and STE20, suggests in may also function as an 
inhibitory domain for its kinase. 

Mamm.alian 3TLK4 

The 3 857 bp human STLK4 nucleotide sequence of the 
partial cDNA clone encodes a polypeptide of 414 amino acids 

(SEQ ID NO: 7) with a predicted molecular m.ass of 45,451 
daltons. Analysis of the deduced amino acid sequence 
predicts STLK4 co be an intracellular serine/threonine 
kinase, lacking both a signal sequence and transmembrane 
domain, however the cDNA clone lacks an initiating ATG, so 
the full extent of it N- terminus is not known. The partial 
STLK4 protein sequence contains a 178 amino acid catalytic 
domain corresponding co the C-cermmal mocifs VI -XI of a 
serine/threonine kinase, followed by a 23 6 am.ino acid C- 
cerm.inal domain containing cwo inserts of 2 5 and 41 amino 
acids each, relative to che sequence of human STLK2 . 

STLK4 is most closely related to hum.an STE2 0 - subfamily 
kinases, STLK3 (SEQ ID. NO 6), MST3 (GE : AF024636 ) , STLK2 
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(3EQ ID NO:5), and SOK-1 (GB:X99325) sharing 71.0%, 46.5%, 
43.9%, and 37.7% amino acid idencicy, respectively. 

The 176 amino acid cacalytic domain of human STLK4 is 
most related zo human STLK3 (SEQ ID NO. 7} , SOK-1 
5 (GB:X99325) , MST3 (GB : AFO 2 4 6 3 6 ) , STLK2 (SEQ ID NO : 5 ) , and 

MSTl (GB :U13297) , sharing 88.2%, 54.2%, 54,0%, 53.7 and 
45.7% amino acid identity, respectively. It also shares 
strong homology to other STKs from lower organisms including 
56.9% -o A, chaliana (G3: AC002343), 52.5% to C. elegans 

10 (GB: U53153) , 46.2% to A. thaliana (GB : Z97336) and 45.7% to 

A, thai iana {GB: 1196513) . The activation loop of the human 
STLK4 catalytic domain conserves three potential threonine 
phosphorylation sites with other members of the STLK- 
subfam.ily of STE20 -related kinases (human STE2 0, MST3 , STLK2 

15 and STLK3 ) that could serve, an autbregulatory role on kinase 

activity . 

The 236 amino acid C-terminal domain of human STLK4 
shares 58.1% amino acid identity to both human STLK3 (SEQ ID 
NO:6) and to a partial hum.an cDNA, DCHT {GB : AFO 1 7 6 3 5 ) . The 
20 C-terminal domain of human STLK4 contains a 25 amino acid ■ 

insert relative to human SOK-1 and shares S7.5% am.ino acid 
identity to an insert present in hum.an STLK3 . 

The weak sequence homology between the C-termdni of 
• human STL.K4 . and STE20, suggests it m.ay also function as an 
25 inhibitory domain for its kinase. 

Mammialian STLK5 

The full-length 2110 bp humtan STLr:5 cDNA encodes a 
polypeptide of 373 amino acids (SEQ ID NO: 97) v;ith a 
30 predicted molecular mass of 4 1,700 daltons. Analysis of the 

deduced amino acid sequence predicts STLK5 to be an 
intracellular STE20-subf amily kinase, lacking both a signal 
sequence and transmemJorane domain. STLK5 contains a 10 

12) 
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amino acid ^7- c errr^inal . domain, a 311 amino acid catalycic 

domain v/ith all the mccifs characteristic of a 
serine/threonine kinase, and a 52 amino acid C-cerminal 
domain. 

STLK5 is most closely related to the human STE20- 
subfamily kinases STLK6 (SEQ ID No. 99) and SPAK (AF0999S9), 
sharing 51% and 33% amino acid identity, respectively, over 
ICS full extent. It also shares significant homology to 
database encries from Arabidopsis thaliana (GB : AC0G2343 } and 
C.eiegrans (GB : AL023843 , GB:ALC23843) . 

The 10 am:ino acid N- terminal domain of human STLK5 dees 
not reveal any significant: homologies co che orotein 
database . 

The 311 amino acid cacalycic domain of human STLK5 
15 shares 51% and 34 % identity to STLK6 and SPAK, 

respectively. The catalytic domain of 3TLK5 contains a 45 
ammo acid insert between kinase subdomains X and XI 
relative to human STE20. Multiple human EST fragments as 
well as a murine EST (GB : AA575647 ) contain this insert 
20 providing evidence that this region is an integral part of 

STLK5 . 

The 52 amino acid C-termmal tail of human STLK5 shares 
41.3% am.ino acid identity to human SOK-1 (G3 : X99325) . The 
v/eak sequence homology between the C- termini of human STLK5 
and STE20, suggests it m^ay also function as an inhibitory 
domain for its kinase . 



25 



Mam.malian STLKf; 

The 2,0Cl-bp human STLK6 nucleotide sequence of the 
30 com^plete cDNA entodes a polypeptide of 418 am.ino acids ;SEQ 

ID NO: 99) with a predicted molecular mass of 47,025 daltons 
Analysis of the deduced am.ino acid sequence predicts 3TLK6 
to be an intracellular STE20-subf am.ily kinase, lacking both 

If 
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a signal sequence and cransmembrane domain. STLK5 contains 
a 57 aTiino acid N-cerminal domain, a 312 amino acid 
catalytic domain wiuh all the motifs characteristic of a 
serine/threonine kinase, fcllov;ed by a 49 amino acid C- 
5 term.inal domain. 

STLK6 is most: closely related to human STE20 - subfamily 
kinases STLPvB (SEQ ID NO:97), STLK7 (SEQ ID NO : 1 0 1 ) , and 
SPAK (AF0999S9) , sharing 50%, 35%, and 30% am.ino acid 
identity over its full extent. It also shai~es significant 
10 homology no database entries from Arahidopsis thaliana 

(GB : AC0G2343 ; and C.elegans (GB:U53153). 

The 57 am.ino acid M- terminal dom.ain of hum.an STLFC6 does 
not reveal any significant homiologies in che protein 
database . 

15 The 312 am.ino acid catalytic domain of humian STLK5 

shares 51 and 30 % identity to human STLK5 and SPAK, 
respectively. 

The 4 9 amino acid C-term.inal tail of humian STLK5 shares 
low amino acid sequence identity (29%) with STLK5 and SPAK. 

20 • 

Mamm.alian 5TLK7 

The 311 bp hum.an STLK7 nucleotide sequence of the 
partial cDNA encodes a polypeptide of 103 amino acids (SEQ 
ID NO: 101) . Analysis of the deduced am.ino acid sequence 

25 predicts STLK7 to be an internal fragment of an 

intracellular STE20 -family kinase. This sequence lacks the 
N- and C-term^inal portions of STLK7 and contains only the N- 
cerminal 103 amano acids of the predicted caiialyt ic * dom.ain . 

30 Human STLK7 is most closely relaced to hum.an STS20- 

subfamily kinases SPAK (AF099989), STLK5 (SEQ ID NO:97;, and 
STLK5 (SEQ ID A'C : 99) , sharing 86%, 3S%, and 35% am.ino acid 
identity v/ithin this region of the kinase domain. It also 

13 
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snares significanc homology to database entries from 
Arabidopsis thaliana (GB : AC002343 ) and Drosophiia 
melanogascer (GB : A?006640i . 

Mammalian 7.C~ 

The 3798 bp hum.an ZCl nucleotide sequence encodes a 
polypeptide of 1239 am.ino acids (SEQ ID NO:13) wich a 
predicted molecular mass of 142,140 daltons. Analysis of 
the deduced amino acid sequence predicts ZCl to be an 
intracellular serine/threonine kinase, lacking both a signal 
sequence and transmembrane domain. The full -length ZCl 
prorem concains a 22 amino acid N-terminus, a 2^7 ammo 
acid catalytic dom.ain with all the m.otifs characteristic of 
a serine/threonine kinase, a 237 amino acid region prediccid 
tc form a coiled-coil structure, a 114 amino acid proline- 
rich region, a 256 ammo acid spacer region, followed bv a 
343 amino acid C-term,inal domain containing a potential' 
Rab/Rho-binding region. 

ZCl is most closely related to the human STE2 0- 
subfamily kinases ZC2 (SEQ ID NO:14), ZC3 (SEQ ID NO-15) 
and ZC4 (SEQ ID NO:16), sharing 61.7%, 60.9%, and 43.8% ' 
ammo acid identicy, respectively. ZCl also shares 45.5% 
amino acid identity to a C. elegans kinase encoded bv che 
cosmid 2C504.4 (GB:Z5002o). zci exhibits 90.0% ammo acid 
homology to murine NIK (G3:U88984), suggescmg ic rr.ay be the 
numan orthologue of this STK. 

The 22 amino acid N-terminal domain of human ZCl is 
58.8% identical co the C. elegrans kinase encoded bv the 

cosmdd ZC504.4 (GB:Z5002S) nr^d ^ 00'-^ H^-r--^^ t 

^-z , a..G ^uo. Identical zo murine ^JIK 

(GB: UeS984). Human ZCl lacks a olvcin^ ---iHue -- 
2, and is cherefore unlikely to undergo mvrist vlar .on . 
Smith-Wacerman search of che nonredundant protein database 
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does not reveal any significant: homologies cha- miohc 
suggest a potential function for this domain. 

The 2 67 amino acid catalytic domain of hum.an ZCl is 
most related to human STE20 -subfamily kinases, ZC3 (SEQ ID 
5 NO: 15), ZC2 (SSQ ID NO: 14), KHS2 (SEQ ID NO: 18), SOK-1 

(GB:X99325), GCK (GB:U07349), and GEK2 (SEQ ID NO:107), and 
to the C. elegans kinase encoded by the cosmid ZC504.4 
(G3:Z50029) sharing 90.6%, 90.2%, 50.6%, 47.4%, 45.4%, 42.5% 
and 82. 6% amino acid identity, respectively. The ZCl kinase 
10 dcm:ain shares 93.1% identity to murine NIK (GB:USS984). ZCl 

concains the potential "TPY" regulatory phosphorylation site 
in its activation loop. This "TPY" motif is conserved in 
other STE20-related kinases, including ZC2 , ZC3 , ZC4 , GEPC2 , 
KKS2 , SULUl, SULU3 , PAK4 and ?AK5 . 

Immediately C-terminal to the kinase domain of -human • 
ZCl is a 237 am.mo acid region predicted to form: a coiled- 
coil structure based on the Lupas algorithm: (Lupas, A. Meth. 
Enzymol . 266, 513-525 (1996)). This region of ZCl is m.osc 
related to human STE20 -subf am.ily kinases, ZC3 (SEQ ID 
N0:15), ZC2 (SEQ ID NO : 14 ) , and GSK2 (SSQ ID NC:'107)', as 
well as to human PITSLRE (GB :U04824) sharing 65.5%, 65.4%, 
25.3%, and 29.0% am.mo acid identity., respectively. The ZCl 
coiled-.coil dom^am also -shares 90.6% ammo acid homology to 
murine NIK. The C. elegans homologue ZC504.4 shares 32.2% 
sequence identity over this region. . . - 

Within the predicted coiled-coil domain of human ZCl, 
and the related ZCj , is a region predicted to form a leucine 
zipper (Leu-X6-Leu-X6-Leu-X5-Leu-X20-Leu-X6-Leu) . The fact 
tnat this leucine repeat exists v/ichin a predicted coiled- 
coil structure suggests that the leucine zipper may have a ' 
high probability of serving as a . dimeri zat ion interface 
LHirst, J.D. et al Protein Engineering 9 657-662 fl996)) 

11 
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mea.ac.ng a potential inter- or intra-moiecular dirr.erxzation 

or nurp.an ZCl . 

The 114 amino acid proline-rich region of hu~an ZCi is 
most related to human STE2 0- subfamily kinases, ZC2 (SEQ in 
NO:i4) and ZC3 (SEQ IDN0:15), sharing 35.8%, and 24. 9^," 
respectively. The ZCl proline-rich domain shares 36.4% amino 
acid homology to murine NIK (GB:U8S984). Three potential 
"Pxx?" 3H3 domain-binding motifs (I, II and III) are found 
vv-ichm the proline-rich region of hum.an ZCl. Motif I is 
conserved in human ZCi and C. elegans ZC504.4 (QB:Z50029). 
Motif II is conserved in ZCl, ZC2 , ZC3 , ZC4 and C. elegans 
ZC504.4. Motif III is conserved in ZCl, ZC2 , ZC3 and ZC47 
Motifs II and III of murine NIK have been shown to bind the 
SH3 motif of the adaptor molecule Nck (Su, Y-C. et al EMBO 
J. 16, 127S-1290 (1997)). From this evidence, hum.an ZCl mav 
nave tne potential to bind to Nek or other SH3 or WW domain- 
containing proteins and participate in growth factor- induced 
s ignal ing pathways . 

The 256 amino acid spacer region of hum.an ZCl is most 
related to human STE20-subf amily kinases, ZC2 (SEQ' IE NO : 14 ) 

and ZC3 (SEO 'D NO•^R^ 

-u JMU.15/, C.S well as to numan PITSLRE 

(GB:U04S24), shariro 5Q 9% - ,c - . 

-^-^ 29. o^, ana 26.4% amino 

acid Identity, respectively. ic also shares 39.9% ammo 
acid homology to murine NIK. The C. elegans homologue 
ZC504.4 has only limdted sequence sim.ilarity in this spacer 
region . 

The 343 ammo acid C-terminai of human ZCl is most 
related to human STE20 -subf am.ily kinases, ZC3 (SEQ in 

^"^'^ ^"^ = ---^ ZC4 (SEQ ID N0:16:., sharing 

-50 89.2%, eRQ% -r^-s -1-7 -50 . ^ 

c.na ,2.3^, ammo acid identity, respect ivelv . 

The ZCl C-terminal domain . also shares 93-. s% ammo acid 
identity to m.urine NIK. The C. elegans homoloaue ZC^O. . 
also shares 63.7% amino acid identity with tne C-tail of 

is 
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human ZCi . A lower, yet significance homolocy is also 

evident: to human KH32 (3EQ ID NC:13), GCK (GE:U07349), and 
murine citron {G3:U07349) with 26.6%, 23.1% and 36.2% amino 
acid identity, respectively. GCK is a STE20- family kinase 
5 whose C- terminal domain has been shown to bind the small G- 

protem RabS (Ren, M. et al., Proc . Natl. Acad. Sci. 93, 
5151-5155 (1996)), Citron is a non-kinase Rho-binding 
protein (Madaule, P. et al . , FE3S Lett. 377, 243-238 
(1995) ) . 

10 The sequence similarity of che C- terminal region of ZC 

ro proceins that have potential Rab- or Rhc- binding domains 
suggests thac ZCI may signal through a small G-protein- 
dependant pathway . 

15 . Mamm.alian ZC2 

The 4 05 5 bp human ZC2 nucleotide sequence of the 
partial cDNA encodes a polypeptide of 12 97 amino acids (SEQ 
ID NO: 14) 'with a predicted molecular mass of 147,735 
daltons . Analysis of the deduced amino acid sequence 

20 predicts ZC2 to be an intracellular serine/threonine kinase 

lacking both a signal sequence and transmerribrane domain, 
however the cDNA clone lacks an initiating ATG , so the full 
extenc of it N-cerminus is not known. The N-cerminally 
truncaced ZC2 protein concains a 255 amtino acid catalytic 

25 domain with all che motifs characteristic of a 

serine/ threonine kinase, a 137 amino acid region predicted 
to form a coiled-coil structure, a 184 amino acid proline- 
rich region, a 328 amino acid spacer region, followed by a 
343 amino acid C-term.inal domain concaining a potential 

30 Rab/Rhc - binding region. 

ZC2 is mosc closely related to the human STE20- 
subfamily kinases ZC3 (SEQ ID NC:15}, ZCI (SEQ ID NC:13), 
and ZC4 . (SEQ ID N0:16), sharing 38.3%, 61.7%, and 41.9% 

1^7 
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amino acid identity, respectively, and shares 4i.7% arr.ino 
acid identity to a C. elegans kinase encoded by che cosn-ad 
ZC504..4 (G3:Z50023}. 

The 255 ammo acid catalytic domain of human ZC2 is 
most related to human STE20-subf amily kinases, 2C1 (seq id 
NO:13), ZC3 (SEQ IDNO:15), SOK-1 {G3:X99325), KKS2 (SEQ ID 
NO:lS), MSTl (G3:U1S297), and GCK {GB:U07349), and to the C. 
elegrans kinase encoded by the cosmid 2C504.4 (GB:Z5002 9} 
sharing 90.2%, 39.3%, 49.0%, 48.5%, 47.9%, 45.0 and 75.7% 
amino acid identity, respectively. ZC2 contains the 
poten-ial "TPY" regulatory phosphorylation site in its 
activation loop. This "TPY" motif is conserved in other 
^E20-related kinases, including ZCl , ZC3 , ZC4 , GSK2 , KHS2, 
SULUl , SULU3 , PAK4 and PAK5 . 

Imm.ediately C- terminal to the kinase domain of human 
ZC2 is a 187 amino acid region predicted to form a coiled- 
coil structure based on the Lupas algorithm (supra) . This 
region of 2C2 is most related to human STE2 0 - subfamily 
kinases, ZCl (SEQ ID N0:13), ZC3 (SEQ ID MC:15), and GEK2 
(SEQ ID MO:107), as well as to hum.an PITSLRE (GB:UG4824), 
sharing 55.8%, 51. 5%, 29.7% and 29.5% am.inc acid identitv, 
respectively. The C. elegrans homologue ZC504.4 shares 30. S% 
sequence identity over this region. Hum.an ZC2 lacks the 
potential leucine zipper found in ZCl as a consequence of a 
29 amano acid deletion relative to ZCl and ZC3 . 

The 184 amino acid proline-rich region of human ZC2 is 
most related to human STE2 0 - subf amd ly kinases, ZC3 (SEQ ID 
KO:15) and ZCl (SEQ ID N0:13), sharing 35.9% and 28.5%,am,inc 
acid iden-ity, respectively. Significant hom.ology is also 
evident to the m.urine WW domain- binding protein WBP7 
(G3:U92455), and to the hum.an SH3 domain-binding protein 
33P-1 (GE:X67571), with 27.7% and 25.3% ammo acid identity, 
res peer ivelv" . 



BNSDOCID: <WO 9953036A2_IA> 



wo 99/53036 PCT/TJS99/08150 

ZC2 contains zvjo of the potential "PxxP" 3H3 domain- 
binding motifs {II and III) found within the proline-rich 
region of human ZCl. Motif II is conserved in ZCl, ZC3 , ZC4 
and C. eiegans ZC504.4, and Motif III is conserved in ZCl, 
5 ZC3 and ZC4 . Motifs II and III of murine NIK have been 

shown to bind the SK3 motif of che adaptor molecule Nek. 
From this evidence, human ZCl may have the potential to bind 
CO Nek or othei' 3H3 or WW domain-concaining proceins, and to 
participate in growth faccor- induced signaling pathways. 

10 The 328 amino acid spacer region of human ZC2 is most 

relaced to human STE2 0 - subfamily kinases ZCl (SEQ ID N0:13) 
and ZC3 (SEQ ID NO:15), and zo murine NIK (GB:US3934), 
sharing 31.6%, 26.9% and 25.9% amino acid identity, 
respecc ively . The C. elegans homologue ZC504.4 has only 

15 limited sequence similarity in this spacer region. 

The 34 3 amino acid C-term.inal of human ZC2 is most 
related to humian 3TE20 - subf am.ily kinases ZCl (SEQ ID NO: 13), 
ZC3 (SEQ ID NO:15) and ZC4 (SEQ ID NO:16), and to murine NIK 
(G3:USS984), sharing 88.9%, 88.3%, 41.9%, and 88.0%, am.ino 

20 acid identity, respectively. The C. elegans homologue, 

ZC504.4, also shares 67.2% amino acid identity with the C- 
tail of humian ZC2 . A lower, yet significant, hom.olcgy is 
also evident zo hum.an GCK {GB:U07349), miurine citron 
(GB:U07349) , and the S. cere^'isiae ROM2 pr^otein (GB:U19103), 

25 a Rhol GDP/GT? exchange facuor, with 22.3%, 22-. 2% and 21.9% 

amino acid identity, respectively. 

The sequence similarity of the C- term.mal ^ region of ZC2 
to proteins that have potential Rab- or Rho-binding dom.ains 
suggests chat ZC2 , like ZCl, may also signal through a small 

30 G-protein-dependant panhv/ay . 
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Mamma i -i ZC2 

The 4133 bp human ZC3 nucleotide sequence of che 

partial cDMA encod'=*s a oolvn^^^nt i Ht=^ 1 • 

^^-'-Ly^t:uu _:.a.tz: i_ lo^o ammo ac ici^ (S^Q 

ID NO: 15) wich a predicted molecular ruass of 149,906 
daltons. Analysis of the deduced amino acid sequence 
predicts ZC3 to be an intracellular serine/ threonine kinase, 
lacking both a signal sequence and transrr.embrane domain, 
however the cDMA clone lacks an initiating ATG, so the full 
extent of it N-termius is not known. The N-terminally 
truncated 2.C3 protein contains a 255 amino acid catalytic 
domain with all the motifs characteristic of a 
serine/threonine kinase: a 221 amino acid region predicted 
to for- a coiled-coil structure, a 204 amino acid prolme- 
rich region, and a 3 03 amino acid spacer region followed b 
a 34 3 amino acid C-term.inal domain containing a potential 
Rab/Rho-binding region. 

ZC3 is most closely related to the human STE2 0- 
subfamily kinases ZCl (SSQ ID NO:13), ZC2 (SEQ ID M0:14}, 
and ZC4 (SEQ ID NO:16), sharing 62.0%, 61.0%, and 42.5%" 
amino acid identity, respectively and shares 46.7% amino 
acid identity to a C. elegans kinase encoded by the cosmid 
ZC504 .4 (GB:Z50029) . 

The 2 55 am.ino acid catalytic domain of human ZC3 is 
most related to human STE20-subf amily kinases, ZCl' (SEQ ID 
N0:13), ZC2 (SEQ IDNO:14), SOK-1 (GB:X99325), KHS2 (SEQ td 
N0:18), GCK (GB : U07349) , SULUl (SEQ IDNO:22), andGEK2 (SEQ 
ID NO:107), and to che C. eiegrans kinase encoded bv che 
cosmid ZC504.4 (GB:Z50029) sharing 90.6%, 89.3%, 49.0%, 
48.3%, 45.0%, 43.1%, 42.3% and 76.7% am.ino acid identity, 
respectively. ZCl concams the potential "TPY" regulatorv 
phosphorylation site in its accivacion loop. This "t?Y" 
motif is conserved in other STE20 - related kinases, 
including ZCl, ZC2 , GEK2 , FCHS2 , SULUl , SUL.U3 , P.^K4 ' and PAK5 . 
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Imtr.ediaceiy C- terminal to the kinase domain of human 
2C3 IS a 221 amine aoid region predicted co form a coiled- 
coil struccure based on nhe Lupas algorithm (supra) . This 
region of ZC3 is m.osr homologous to human 3TE2 0 - subf amil v 
kinases, 2C1 (SSQ ID N0:13), ZC2 (SEQ ID NC:14;, and GEK2 
(SEQ ID NO:107), sharing 66.9%, 61.5%, and 27.5% identify, 
as well as to rat PLC-beta (GB:A45493) and human PITSLRE 
(GB:H54024) sharing 29.6% and 25.9% amino acid identify, 
respectively. The C. elegans hom.ologue ZC504.4 shares 26.3% 
sequence identify over this region. 

Within the predicted coiled-coil dom^ain of human ZC3 , 
and the related ZCl , is a region predicted to formt a leucine 
zrpper {Leu-X6 -Leu-X5 -Leu-X6 -Leu-X20 -Leu-X6 -Leu) . The facc 
tnat this leucine repeat exists v/ithin a predicted coiled- 
coil structure suggests that the leucine zipper may have a 
hign probability of serving as a dimerization interface 
iKrrst, J.D. et al Protein Engineering 9 657-662 (1996)) 
mediating a potential inter- or intra - molecular dimerization 
of humian ZC3 . 

The 204 am.ino acid proline-rich region of human ZC3 is 
mosc relaced to human STE20 -subf am.ily kinases, ZCl (SEQ ID 
NO:13) and ZC2 (SEQ ID N0:14), sharing 66.9% and 61.5% amino 
acid identity, respectively. 

ZC3 contains tv/o of the potential "Pxx?" SH3 domiain- 
binding m.onifs (II and III) found within the proline -rich 
region of human ZCl . Motif II is conserved m ZCl,- ZC2 , ZC4 
and C. elegans ZC504.4; Motif III is conserved in ZCl, ZC2 
and ZC4 . Motifs. II and III of murine NIK have been shown 
bind the SH3 m.otif of the adap-or molecule Nek. From this 
evidence, hum.an ZC3 m^ay have che potential to bind zo Nc! 
other SH3 or WW domain- concainmg proteins and oarticiioate 
m growth faccor- induced signaling pathv/avs . 



o 
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Tne 3 03 amino acid acid spacer region zf human ZC3 is 
most relaced zc human STE20 -subfamily kinases, ZCl (SEQ IC 
NO: 13) and ZC2 (SEQ ID NG;14) sharing 30.1%, and 27.1% amino 
acid identity, respectively. The C. elegans homologue 
5 ZC504.4 lacks nearly the entire spacer region of ZC3 . 

The 34 3 amino acid C- terminal of human ZC3 is most 
related co human STE2 0 - subfamily kinases, ZCl (SEQ ID 
N0:13), ZC2 (SEQ ID N0:14) and ZC4 {SEQ ID NO:16), sharing 
39.2%, SS.9%, and 42.5%, amino acid identity, respectively. 

10 The C. elegans hom.ologue ZC504.4 also shares 67.2% amino 

acid identity v.^ich the C-tail of hum.an ZC3 . A lower, yez 
significant, hom.ology is also evident to human QCK 
(G3:U07349), as well as to the non-kinases murine citron 
(GB:UG7349) and the S, cerevlsiae R0M2 protein (GE:U19103), 

15 a Rhol GDP/GT? exchange factor, with 21.6%, 32.4% and 22.9% 

amino acid idencity, respectively. 

The sequence similarity cf the C-terminal region of ZC3 
to proteins that have potential Rab- or Rho-binding domains 
suggests that ZC3 , like ZCl and ZC2 , m.ay signal through a 

20 small G-protein-dependant pathway. 

Mammalian ZC4 

The 3,684 bp hum.an ZC4 nucleotide sequence of the 
complete cDNA encodes a polypeptide of 1,22 7 amino acids 

25 (SEQ ID NO: 105) v/ith a predicted molecular mass of 138,205 

Daltons. Analysis of the deduced amino acid sequence 
predicts ZC4 to be an intracellular STE20 -subfamily kinase, 
lacking both a signal sequence and a t ransmeiribrane domain. 
The full-length ZC4 protein contains a 25 amino acid N- 

30 terminus, a 265 amino acid catalytic domain with all the 

motifs characteristic of a serine/ threonine kinase, a 108 
amino acid region predicted to form a coiled-coil structure, 
a 231 amino acid proline-rich region, a 40 amino acid region 
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predicted co forni a coiled-coil structure spacer region, a 

204 amino acid spacer region (domain B) , foiiov/ed by a 355 
amino acid C-cerminal domain containing a pocenciai Rab/Rhc- 
binding region (domain C) . 
5 ZC4 is most closely related zo human ZCl (SEQ ID NC:13, 

also knov/n as human HGK, human KIAA06S7, murine NIK; human 
AC005035, human NIK, and C. elegans KIG-15), ZC2 (SEQ ID 
N0:14, similar to partial sequence human KIAA0551), and ZC3 
(SEvQ ID NO: 15) . An assem.bled genomic fragm.enc in the 

10 database (Z3385C) is identical to ZC4 , except for 

inappropriate identification of the exon boundaries. (Abo 
et al. (199S) EMBO J. 17: 6527-6540.) 

The 2 5 amino acid N-termiinai domain of -hum.an ZC4 shares 
weak homology no human ZCl in its C-term.inal extent, but 

15 ocherv/ise does not reveal any significant homologies to the 

protein database. 

The 265 amino acid catalytic domain of human ZC4 is 
most related to human ZCl (SEQ ID NO:13), ZC3 (SEQ ID 
NO:15), and ZC2 (SEQ ID M0:14), sharing 63%, 64% and 62% 

20 am.ino acid identity, respectively. 

Immediately C- terminal to the kinase domain of hum:an 
ZC4 is a 108 am.ino acid region predicted to form, a coiled- 
coil structure based on the Lupas . algorithm (supra). This 
region is most related to human ZCl (SEQ ID NO:13), ZC3 (SEQ 

25 ID NO:15), and ZC2 (SEQ ID N0:14), sharing 29%, 25% and 20% 

amino acid identity, respectively. 

The 231 am:ino acid proline -rich region of human ZC4 
does not reveal any significant homologies to the protein 
database. This region of ZC4 contains tv;o ''FxxP" mocifs 

30 that could potentially bind to proteins containing 3H3 or VJW 

domains and allow ZC4 to participate in growth factor 
activated signaling pathways. In addition, within the pro- 
rich domain of human ZC4 is a region predicted to form a 
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leucine zipper (Leu-Xb - Leu-X6 -Leu-X6 -Leu-X20 -Leu-Xb -Leu) 
v/hich may serve as a dimerizaricn interface. The ZC STE20 
subfamily kinases (ZCl, ZC2 and ZC3) have similarly located 
"PxxP' motifs and potential Leu zippers. 

Immediately C-terminal to the proline-rich region of 

human ZC4 is a 40 am.ino acid region also predicted to form a 
coiled-coil structure based on the Lupas algorithm. This 

region of human ZC4 does not reveal any significant 

homologies to the procein database. 

The 204 am.ino acid acidic- and serine-rich domain "B" 

of ZC4 does not reveal any significant: homologies to the 

procein database . 

The 355 am.ino acid C-cerminal of hum^an ZC4 is, m.ost 

related to hum.an ZCl (SEQ ID N0:13), ZC3 (SEQ ID NO:15). and 

ZC2 (SEQ ID N0:.14), sharing 43%, 42% and 42% am.ino acid 

identity, respectively . 

The sequence similarity of the C-term.inal region of ZC4 

to proteins that have potential Rab- or Rho-binding domains 

suggests that ZC4 , like other ZC-subf amdly STE20 kinases, 

m:ay signal through a sm.all G-protein-dependant pathway. 

Mammialian KH52 

The 4023 bp human KK52 nucleotide sequence encodes a 
polypeptide of 894 amino acids (SEQ ID N0:1S) with a 
predicted m.clecular mass of 101,327 daltons. Analysis of 
the deduced amino acid sequence predicts rCH32 to be an 
intracellular serine/ threonine kinase, lacking both a signal 
sequence and transmembrane domain. The full-length KHS2 
protein contains a 13 amino acid N- terminus, a 260 am^ino 
acid catalytic domain v/ith all the motifs characteristic of 
a serine/threonine kinase, a 73 amiino acid spacer region, a 
138 proline-rich region, follov/ed by a 360 amiino acid C- 
terminal domain containing a potencial Rab/Rhc- binding site. 
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KHS2 is T.csc closely relaced cc the human STE20- 
subfamily kinases KHSl {GB : 1117712 9 ) , GCK (GB : IJ0734 9 ) , and 
HPKl (GB:U07349) , sharing 65.5%, 51.9%; and 44.9% ammo acid 
identity, respectively and shares 38.5% amino acid identity 
to a C. elegans STK (GB:U55363). 



not reveal any significant homologies that might suggest a 
potential function for this dom.ain when examined by a Smith- 
Wacerman alignment to the nonredundant protein dacabase. 
Hum.an KHS2 lacks a glycine residue at position 2, and is 
therefore unlikely to undergo myristylat ion . 

The 260 amino acid catalytic domain of human KKS2 is 
most relaced to human STE2 0 - subf ami ly kinases KHSl 
(G3:U177129) , GCK (GB:U07349), HPKl (GB:U66464), 30K-1 
(G3:X99325) , MSTl (GB:U1S297), ZCl ( SEQ ID NO: 13), and to 
the C. elegans kinase (GB:U55363}, sharing 35.4%, 75.1%, 
67.7%, 51.4%, 43.1%, 49.8% and 72.0% amino acid identity, 
respectively. KHS2 contains the potential "TPY" regulatory 
phosphorylation site in its activation loop. This ^^tpY" 
motif is conserved in other STE2 0 - related kinases, 
includd-vg ZCl, ZC2 , ZC3 , ZC4 , GEK2 , 3ULU1 , SUL,U3 , PAK4 and 
PAK5 . 

The 73 amino acid acid spacer region of human KHS2 is 
most related to hum.an STE2 0 - subf am.ily kinases, KHSl 
(GB:U177129) , HPKl (G3:U66464) and GCK (GB:U07349), sharing 
60.3%, 43.5% and 44.0%, amino acid identity, respectively. 

The 188 amiino acid proline-rich region of human KHS2 is 
most related to hum^an STE20 -subf am.iiy kinases, HPKl 
(GB:U66464) , GCK (G3:u07349) and KHSl (GB : U17712 9 ) , sharing 
33.3%, 31.9% and 31.4%, amino acid identity, respectively. 

Two potential ''?xxP" 3H3 dom.ain-binding motifs fi and 
II) are found v;ithin che proline-rich region of human KHS2 . 
Motif I is conserved v/ich humian KHSl and HPKl; mocif II is 



13 amino acid N- terminal domain of human KHS2 does 
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conserved wich GCK and KHS2 . A 192 amino acid region of 
human HPKl concaining rrocif ii has been shown co bind co zhe 
C- terminal SK3 motif of the adaptor molecule Grb2 (Anafi, M 
et al, J. Biol. Chem. J. 272, 27804-27811 (1997)). Human 
KHS2 may bind SH3 or WW domain-containing proteins through 
this proiine-rich r 



-s most 



eg ion . 

The 360 amino acid C- terminal of hum.an KHS2 i: 
related to KHSl (GB :U177129) , GCK (GB:U07349) and HPKl 
(G3:U66464), and to the C. elegans kinase (GE:U55363), 
sharing 74.9%, 54.8%, 42.9%, and 31.0%, amino acid xdencity, 
respectively. GCK is a STE20-famdly kinase whose C-cerm.inal 
domain has been shown to bind the small G-protean RabS (Ren, 
M. etal., Proc. Natl. Acad. Sci . 93, 5151-5155 (1996)). 

^5 Mammalian SITT.Tti 

The 4196 b? human SULUi nucleotide sequence encodes a 
polypeptide of 898 amino acids (SEQ ID NO: 22) with a 
predicted molecular mass of 105,402 dalcons. Analysis of 
the deduced amino acid sequence predicts SULUi to be an 
intracellular serine/threonine kinase, lacking both a signal 
sequence and transmembrane domain. The full-length SULUI 
protein contains a 21 amino acid N- terminus, a 256 amino 
acid catalytic dom.ain with all the motifs characteris-ic of 
a serine/threonine kinase, a 150 amino acid spacer region, a 
210 amino acid region predicted to form a coiled-coil 
structure, a 114 amino acid spacer region and a 147 am.ino 
acid C-terminal domain predicced to form a coiled-coil 
structure . 

SULUI is most closely related to the STE2 0 - subfamily 
kinases murine SULU3 (SEQ Id NO: 24), human SULU3 (SEQ Id" 
NO: 23), and zc the C. eiegans kinase SULU (GB:U11230), 
sharing 53.9%, 72.2% and 3S.2% ammo acid identity, 
respectively. 

Hi 
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The 21 amino acid N- terminal domain of human SULUl is 
most relaced to murine 3ULU3 (SEQ ID NO:24; and to che C. 
elegans kinase 3ULU (GB:U11230), sharing 85.3% and 62.3% 
amino acid identicy. Human SULUl lacks a glycine residue az 
position 2, and is therefore unlikely co undergo 
mtyristoylat ion . A Smi ch- Wat erman search of the nonredundanc 
protein database does not reveal any significant hom.ologies 
that miight suggest a potential function for this dom.ain. 

The 256 amino acid catalytic domain of humian SULUl is 
most related to murine SULU3 (SEQ ID NO:24), and to hum.an 
SOK-1 (G3:X99325) , STLK2 {SEQ ID MO : 5 ) , MSTl (GB:U18237}, 
?AK1 {QE;U24152), ZC2 (SEQ ID NO:14), and KHS2 (SEQ ID 
MO: IS) sharing S6.3%, 48.1%, 46. 9%, 45.2%, 43.3%, 43.1% and 
42.0% amtino acid idencity, respectively. The C. elegans 
SULU STK (GB:U11280) shares 62.3% sequence identity over 
this region. SULUl contains the pcnential "TPY" regulatory 
phosphorylation site in its activation loop. This ''T?Y" 
m.otif is conserved in other STE2 0 - related kinases, 
including ZCl , 2C2 , ZC3 , ZC4 , GEK2 , KHS2 , 3ULU3 , PAK4 and 
PAK5 . 

The 150 amino acid spacer region of hum.an SULUl is mcsc 
related to hum.an SULU3 (SEQ ID MO: 23) and to che C. elegans 
kinase (GE :U11280) , sharing 53.5% and 10.4% amino acid - 
identity, respectively. 

Imm.ediately C-terminal to the spacer region of human 
SULUl is a 210 amino acid region predicted to formi a coiied- 
coil structure based on the Lupas algorithmi. This region of 
SULUl is most related to SULU3 (SEQ ID MO:23) , the C. 
eiegrans SULU kinase (GB : U1128Q) , GEK 2 (SEQ ID MO:107) and 
ZCl (SEQ ID MO:13/, sharing 63 . 6% , 25 . 3% , 23 . 2% , and 22.8% 
amino acid identity, respectively. 

The 114 amino acid spacer region human SULUl is most 
related to human SULU3 (SEQ ID MO: 24) with 73.7% amino acid 
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sequence identicy. A lower, yec significant:, homcloav is 
aisc evident tc murine PITSLRE (G3:u04324) and DL?: 
{GB:A5531£), human ZCi (SEQ ID NO:i3) and GEK 2 (SEQ ID 
NC:107), as well as to the C. elegrans SULU STK (G3:U112SC), 
5 sharing 39.7%, 35.4%, 29.5%, 23.6% and 37.6% au.ino acid 

identicy, respectively . 

Imrriediately C- terminal to the second spacer region of 
human S'JLUl is a 14 7 amino acid region predicted zo form a 
coiled-coil srruccure based on the Lupas algorithm. This 
10 region of 5ULLJ1 is mcsc related co human SULU3 (SEQ ID 

NO: 24), ZCI (SEQ ID NO: 13) and GEK 2 (SEQ ID NO: 107), as 
v/ell as to the C. elegans SULU STK (GB:U11230) , sharing 
73.3%, 23.4%, 26.1% and 39.5%, amtino acid identity, 
respecc i vel v . 

15 

Mam^mialian (human) 3ULU3 

The 3S24 bp partial cDNA hum^an SULU3 nucleotide 
sequence encodes a polypeptide of 786 am.ino acids. (SEQ ID 
NO: 23) v;ich a predicted molecular mass of 92,03 7 daltons. 

20 Analysis of the deduced amtino acid sequence predicts 3ULU3 

to be an intracellular serine/ threonine kinase lacking a 
trans mem;brane dom^ain. The N- cerm.inall v truncated human 
SULU3 prone in contains a 66 amiino acid oartial catalvtic 
domain fcllovyed by a 149 amino acid spacex' region, a 210 

25 amino acid region predicted to form a coiled-coil structure, 

a second spacer region of 114 amino acids, a 247 amino acid 
C-terminal region predicted to form a second coiled-coil 
structure and a 100 amino acid C-cermdnal tail. 

Human SULU3 is most closely related murine 3ULL'3 (SEQ 

30 ID NO: 24), human SULUl (SEQ ID MO: 22), and zo the C. elegans 

SULU kinase (GB:U112 30), sharing 66.3%, 68.9% and 32.9% 
am.ino acid identity, respectively. The hiah seauence 
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horr.ology becween murine and human SULU3 suggests chaii zhese 
two procsins are orthclogs of each ocher. 

The 66 amino acid partial catalytic domain of human 
SULU3 is most related to murine SULU3 (SEQ ID NO:24), and to 
5 the human STS20 subfamily kinases ZCl (SEQ ID NO:13), STE20 

(GB:X99325), KJ^IS 1 (GB : Ul 7 7 12 9 ) and GEK 2 *(£EQ ID NO : 1 0 7 ) , as 
well as CO the C. el&gans SULU kinase (GB:ail2S0), sharing 
83.3%, 47.0%, 45.5%, 43. 5%, 41. 3% and 55.6% amino acid 
identify, respectively . 

10 The 14 9 amino acid spacer region of human SULU3 is most 

related to m.urine SULU3 (SEQ ID NO: 24) , human STE2 0 
(G3:X99325) , MSTl (GB:U13297), and to che C.eiegrans SULU 
kinase ( GB:U112S0) sharing 9S.7%, 21.9% and 21.3% amino 
acid identity, respectively. 

15 Immediately C-terminal to the first spacer region of 

human SULU3 is a 210 amino acid region predic::ed to form a 
cciled-coil struccure based on the Lupas algorithm. This 
region of SULU3 is mosc related to murine 3ULU3 (SEQ ID 
MO:24), and to human SULUl (SEQ ID NO:22); ZCl (SEQ ID 

20 NO:13) and GEK 2 (SEQ ID NO:107), as well as to the C. 

alegans SULU kinase (GB:U11280), sharing 99. 5%, 68.6%, 
27.4% and 22.5% am.ino acid identity, respect ively . 

The 114 amino acid second spacer region of human SULU3 
is most related to murine SULU3 (SEQ- ID NO:24), and co hum^an 

25 SULUl (SEQ ID NO:22) GEK 2 (SEQ ID NO:107)-, and ZCl (SEQ ID 

N0:13), as well as to the C. elegans SULU kinase- 
(GB:U11280) , sharing 99,1%, 73.7%, 24. 6%, 24.1% and 41.2% 
amino acid identicy, respecc ively , 

Imimediacely C- terminal to the second spacer region of 

30 human SULU3 is a 24 7 amino acid region predicted eo form a 

■ coiled-ccil structure based on the Lupas algorithm (supra) . 
This" region of SULU3 is m^ost related to human SULUl (SEQ ID 
NO:22) and ZCl (SEQ ID NO:13) as well as to rat PKN- 
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(-B.D2O180.) rr.urine pl60 ROCKl (G3:U5S512), and ^ho - 

eiegans SUL-U kinase (GB:U112S0), sharing 73.7%, 26.7%,' 2. 0^- 
ana 21.0% amino acid idenci-y, respectively. 

The 100 amino acid C-tail of human SULU3 is most 
related to a human prion protein (C-B : L3S 993 ) , with 45.0% 
amiiic acid identity. 

Mammalian fm urine) SUL'lT3 

Tfte 2249 bp murine, partial cDNA SULU3 nucleotide 
sequence encodes a polypeptide of 748 ammo acids (SEQ ID 
MO:24; with a predicted molecular mass of 37,520 da^^ons 
Analysis of the deduced amxno ac.d sequence oredxcts SULU^ 
to be an intracellular serxne/threonrne kinase, lacking both 
a signal sequence and transmembrane domain. The partial 

murine SULU3 oroceir conf-ginc, =, n c; = • 

. o.exx. contains a 2b am.xno acxd N- terminus, a 

248 amino acid catalytic domain with all the motifs 
characteristic of a serine/rhreonine kinase, a 149 amino 
acxd spacer region, a 210 amino acid region oredicted to 
form a coiled-coil structure, and a ii6 ammo acid spacer 
20 region. 

Murine SULU3 is m.ost closely related to human SUUJ3 
(SEQ ID NO:23) and 3ULU1 (SEQ ID NO:22.), as well as co the 
C. eieaans SULU kinase (GB:rjii2 80), sharxng 97.0%, 72.3% 
and 38.4% amdno acid xdentity, respectively. The hiah 
sequence homology between murxne and human SULUS suggests 
that these two proteins are orthologs. 

The 25 amino acid N-terminal domain of murine SULU3 is 
rr.ost related to human SULUl (SEQ ID KT0:22) and to the C. 
elegans SULU kxnase (G3:U11280), sharing 70.0% and 44.4% 
amino acid identity, respectively. 

Murine SULU3 lacks a glycine residue a. oosition 2, and 
therefore unlikely to undergo rryrxstoylation . A Smich- 
Wacerman search of the nonredundanc protein database does 
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not reveal any significant horriologiss that might suggest a 
potenmal function for this domain. 

The 24S amine acid cacaiycic domain of m.urine SULU2 is 
most related to human SULUl (SEQ ID NO:22), STE2C 
5 (GB:X99325), ZCl (SEQ ID NO;13), and KHSl (GB:U77129), as 

well as to the C. elegans SULU kinase (GB:U112S0), sharing 
86.7%, 46.6%, 43.3%, 59.4% amino acid identity, 
respeccively . Murine SULU3 contains the poiientiial "TPY" 
regulatory phosphorylation site in its activation loop. This 
10 i-TPY" motif is conserved in other STE2C- related kinases, 

including ZC2 , ZC3 , ZC4 , GEK2 , KKS2 , SULUl, SULU3 , PAK4 and 
PAK5 . 

The 14 9 amino acid spacer of murine SULU3 is most 
related co human SUL.U3 (SEQ ID NO:23), SULUl (SEQ ID NC:22), 
15 and 3TE2 0 (GB:X9 932 5) , as well as to the C . elegans SULU 

(GB:U11230; and the 5. cerei^isiae STE2C (G3:L04655) kinases, 
• sharing 98.7%, 53.4%, 21.9%, 59.4% and 21.9% am.ino acid 
identity , respectively . 

Immediately C-cerminal to rhe spacer region of murine 
20 SULU3 is a 210 amino acid region predicted to form a coilisd- 

coil srruccure based on the Lupas algorithm. This region of 
murine SULU3 is most related to human SULU3 (SEQ ID NO:,2 3) , 
ZCl (SEQ ID NO:13), and GEK 2 (SEQ ID NO:107), as vvell ^as to 
the C. elegans SULU kinase (GE:U1128C), sharing' 99.5%, 
25 . 27.4%-, 22.5% and 29.2% amino acid identity, respectively. 

The 116 amdno acid C-cerminal spacer region of murine 
SULU3 is most related to humvan SULU3 (SEQ ID NO:23), GEK 2 
(SEQ ID MC:107), and ZCl (SEQ ID NO:13), well as to the C. 
elegans SULU kinase (G3:U112S0', sharing 93.3%, 24.6%, 24.1% 
30 -and 40.5% amino acid identirv, respec t ivel v . 
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?'^Iam.rr:alian ^ir- . rine /human) SULU3 

The 224 9 bp murine SULU3 and the 3824 bp human SULU3 
cDNAs contain a 1620 nucleotide overlap (541 amino acids) 
with 90% and 98% DNA and amino acid sequence identitv, 
respectively. Owing to the high degree of sequence identity 
m this extended overlap, we propose that these are 
functional orthologues of a single gene. The comb)ined 
murine/human 4 4 92 bp SuLU3 sequence encodes a polypeptide of 
1001 amino acids (SSQ ID NO: 31) with a predicted m.olecular 
m.ass of 115,069 daltons. Analysis of the deduced amino acid 
sequence prediccs SULU3 to be an intracellular 
serine ,/threonine kinase, lacking both a signal sequence and 
transmembrane domain. SULU3 contains a 25 amino acid N- 
terrrdnus, a 24 8 amino acid catalytic dom.ain with all the 
!:> motifs characteristic of a serine/threonine kinase, a 149 

. amino acid spacer region, a 210 amino acid region predicted 
CO form a cciled-coil structure and a second spacer region 
°- ■^■'■"^ ammo acids, a 247 amino acid C-cerminal reaion 
predicted to form a second coiled-coil structure and a 100 
amino acid C-terminal tail. The murine SULU3 clone lacks 
the region from the second C-terminal coiled-coil to the C- 
terminus, whereas the human clone lacks che N- terminal 
domain, and all bur 66 amino acids of the 24 3 amino acid 
kinase domain. 

SULU3 is most closely related to SULUl (SEQ ID NO: 22) 
and the C. elegraiis SULU kinase (GB:U11280) sharing 72.3% and 
38.4% am.ino acid identity, respectively. 

Tne 2 5 amdno acid N- terminal domain of SULU3 is most 
related to human SULUl (SEQ ID NO:22) and to the C. . el egans 
SULU kinase {GB:U112 8C) . sharing 70.0% and 4 4.4% amino acid 
identity, respectively. SULU3 lacks a give: ::e residue at 
position 2, and is therefore unlikely to undergo 
myristylation. ■ A Smith- Wat erm.an search of the nonredundant 
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protein dacabase does not reveal any significanc homologies 
chat might . suggest a potential function for this domain. 

The 248 amino acid catalytic domain of SULU3 is most 
related co human SULUl (SEQ ID NO:22), SOK-1 (G3;X99325), 
5 ZCl (SEQ ID NO:13), KHSl (GB:U77129) and the C. elegans SULU 

kinase ( GB : Ul 12 S 0 ) , sharing 86.7%, 46.6%, 43.3%, 42.0% and 
59.4% amino acid idencity respect ively . SULU3 contains the 
potential "TPY" regulatory phosphorylation site in its 
activation loop. This "tpY" motif is conserved in ocher 

10 STE20-related kinases, including ZC2 , ZC3 , ZC4 , GEK2 , KHS2 , 

SULUi , PAK4 and ?AK5 . 

The 149 amino acid spacer of SULU3 is mosc relaced co 
SULUl (SEQ ID NO: 22) and 30K- 1 (GB:X9932 5} , and co che C. 
elegans SULU {GB:U112S0), and S, cerexrisiae STE20 

15 (GB:L04655) kinases, sharing 53.4%, 21.9%, 59.4% and 21.9% 

amino acid idencity, respectively.. 

Immediately C- terminal to the spacer region of 3ULU3 is 
a 210 ammo acid region predicted to form a coiled-coil 
structure based on the Lupas algorithm. This region is most 

20 related co ZCl (SEQ ID NO:13), GSK 2 (SEQ ID NO:-107), and ■ 

the C. elegans SULU kinase (GB:U112S0), sharing 27.4% 22.5% 
and 29.2% amino acid identity, respectively. 

The 114 amino acid spacer region of SULU3 is mosc 
related to human SULUl (SEQ ID NO:22) , GEK 2 (SEQ -ID 

25 . NO:107), ZCl (SEQ ID NO:13), and to the C, elegans SULU 

kinase (GB:U11280), sharing 73.7%, 24.6%, 24.1% and 41.2% 
amino acid identity, respecc i vely . 

Immediately C-term.inal to the second . spacer region of 
SULU3 is a 247 amiino acid region predicted co form a coiled- 

30 coil structure based on the Lupas . algorithmi . This region of 

SULU3 is mosc relaced to human SULUl (SEQ ID NO: 22) and ZCl 
(SEQ ID NO:13), as well as co rat PKN (GB:D26180), murine 
pl60 ROCKl (G3:U53512) and che C. elegans SULU kinase 

q5 
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(C-3:U11280) , sharing 73.7%, 25.7%, 24.0%, 21.0% and 37.5% 
amino acid idencicy, respectively. 

The 100 amino acid C-tail of SULU3 is mosc relaced to i 
human prion protein (GB:L38993). with 45.0% amino acid 
identity . 

Mamrp.alian GEK2 

The 2 92 6 bp human GEK2 nucleotide sequence cf the 
complete cDNA encodes a polypepcide of 963 amino acids (SEQ 
ID NC:107) with a predicted molecular mass of 112,120 
daltons. Analysis of the deduced amino acid sequence 
predicts GE?:2. to be an intracellular serine/threonine 
kinase, lacking both a signal sequence and transmembrane 
domain. The complete GEK2 protein contains a 33 amino acid 
15 M-term.inus, a 251 amino acid cacalycic dom.ain with all che 

motifs characteristic of a serine/ threonine kinase, a 43 
amine acid spacer region, a 135 amino acid proline-rich 
region, a 252 amino acid region predicted to form a coiled- 
coil structure followed by a 244 amino acid region also 
predicted to form a coiled-coil structure. 

GEK2 is most closely related to rat ATl-46 (GB:U33472) 
(a partial cDNA that expends from the middle of the first 
potencial coiled-coil domain of GEK2 to che C-term.inus) , 
murine LOK (GB:D89728), Xenopus laevis polo-like kinase 1 
(GB :AF100165} , and human SLK (GB :AB002804 ) , sharing 91.3%, 
38.5%, 65.0%, and 44.7% amino acid identity, respectively. 
The high sequence hom.ology between human GSK2 , m.urine LOK 
and rat ATl-4 6 suggests thac human GSK2 is a highly related 
protein cc the rodenc f orm.s , or alcernatively , its 
orthologue. Recencly, a full-length version of GEK2 was 
reported (STKIO or human LOK A30i5718) . The 968 ammo acid 
sequence is 99% identical to GEK2 (SEQ ID NO:107). 
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The 3 3 amino acid M-cerminai dorrtain of human GEK2 is 
mosc related to murine LOK (GB:DS972S) and to human SLK 
(GB:AB002304) , sharing 100% and 54.5% am.ino acid identity, 
respect ively . 

Human GEK2 lacks a glycine residue at position 2, and 
is therefore unlikely to undergo myristyla t ion . A Smith- 
Waterman search of the nonredundant protein database does 
non reveal any significant homologies than m.ight suggest a 
potential function for this domain. 

The 261 amdno acid catalytic domain of human GEK2 is 
most related to murine LOK (GB:DS972S), rat ATl-46 
(GB:D89728) and human SLK (GB : AB002804 ) as well as to a C. 
elegans kinase {G3:ZS14bO), sharing 97.7%, 90.8%, 54.5% and 
55.9% amino acid identity, respectively. GEK2 contains the 
potential "TPY" regulacory phosphorylation . site in its 
activacion loop. This "TPY". motif is conserved in other 
STE20-related kinases, including ZC2 , ZC3 , ZC4 , GEK2 , KHS2 , 
SULUl , SULU3 , PAK4 and PAK5 . 

The 4 3 amino acid spacer region of human GSK2 is most 
related to murine LOK (GB:D89723) and to human SLK, sharing 
83.7% and 77.6% amino acid identity, respectively. 

The 135 amino acid prcline-rich region of human GEK2 is 
m.ost related to murine LOK (GE:DS972S) with 66.2% amino .-acid 
identity, respectively. Within the proline-rich region of 
human GEK2 is a potential ^'PxxF" SH3-binding domain 
conserved with miurine LOK. 

Immediately C-term.inai to the proline-rich region of 
human GEK2 is a 2 52 amino acid region predicted to form a 
coiled-coil structure based on the Lupas algorithm. This 
region of human GEK2 is m;OSt: related to rat ATl-46 
(GB:D39728), murine LOK (GB:D8972£) and human SLK 
(GB : AB002804 ; , and ZC2 'SEQ ID MO: 14), sharing 90.8%, 86.9%, 
42.2%, and 2 9.7% ammo acid identity, respectively. 
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immediately C-terminal to the predicted coiied-coil 
szruccure of human GEK2 is a second pocential coiled-coil 
structure of 244 amino acids predicted based on che Lupas 
algorithm. This region of human GEK2 is most related to rat 
ATl-46 (GB:DS972S) and murine LOK (GE:BS972S) as well as to 
human SLK {GE : AB002304 ) and ZCl (SSQ ID NO:13), sharing 
91.8%, 92.6%, 70.4% and 26.7% amino acid identity, 
respectively. The C. elegans kinase (G3:Z814oO) shares 
amino acid sequence identity over this region. 



oj. . ar 



Mamma 1 i an PA K4 

The 3 604 bp human PAK4 nucleotide sequence encodes a 
polypeptide of 681 amino acids (SEQ ID MO: 29) with a 
predicted molecular mass of 74,875 daltons . Analysis of the 
deduced am.ino acid sequence predicts PAK4 to be an 
incracellular serine/ threonine kinase, lacking both a signal 
sequence and transmembrane domain. The full-length PAK4 
protein contains a 51 amino acid N-terminus predicted to 
contain a rac-binding motif, a 173 amiino acid insert 
20 relative to the known mammalian PAK proteins, a 16 9 amino 

acid spacer region, a 265 amino acid catalytic domain with 
ai^ the mocifs characcerist ic of a serine,/ threonine kinase 
and a 23 amino acid G-term.inal tail. 

PAK4 is most closely related to xhuman PAK5 (SEQ ID 
NO:30), FAKl (QB : U24152), and PAK65 (GB:U25975), as well as 
to a C. elegans kinase (GB: Z74029), sharing 76.8%, 49.5%, 
49.8%, and 34.6% amdno acid identity, respect ivelv . 

The 51 amino acid N-termiinal domain of human PAK4 is 
. m.ost related to human PAKl (GB:U24152), and PAK65 ■ 
(GB : U2 5975 ) , as v;ell as to a C, elegans kinase (GB: Z74029), 
sharing 50.0%, 50.0% and 49.0% amino acid identity, 
respectively.. 
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The 11 amine acid region at positions 13-2 3 of human 
PAK4 fits Che consensus for a Cdc4 2 /Rac -binding rrocif 
(SXPX4-6HXXH) (Burbelc, P.D., Dreschel, D. and Hall, A. J. 
Bio, Chem. 270, 29071-29074 (1995)). 

The 173 amino acid insert of human PAK4 , relative zo 
the known mammalian PAK proteins, is most relaced to a C. 
elegrans kinase (GB: Z74029) with 39.0% amino acid identity. 

A Smd th- Watermian search of the nonredundant procein 
database does not reveal any significant homologies:, zhaz 
might suggest: a pocential function for this region. 

The 169 amino acid spacer of human PAK4 does noc reveal 
any significant hcm.ologies that might suggest a pocential 
function for chis region. 

The equivalenc spacer region in PAKl binds to che 
guanine nucleocide exchange faccor PIX (Manser, E. et al 
(1996) Molecular Cell, 1, 133-192). Since PAK4 differs 
substantially from PAKl over this region, the spacer domain 
of PAK4 may differ in its guanine nucleotide exchange faccor 
binding specificity, relative to PAKl. 

The 2 65 amino acid catalytic domain of human PAK4 is 
mosc related co human ?AK5 (SEQ ID NO:30), PAKl (G3:U24152), 
GCK (GB:U07349), SOK-1 (GB:X99325), and SLK {GB : ABO 02 8 04 ) , 
as well as to the C. elegans {GB : 274029), and S. cere^^isiae 
STE20-related kinases (G3:L04655), sharing 95.9%, 51.7%, 
41.3%, 39,8%, 37.4%, 50.2% and 47.9% amino acid -identity, ' 
respectively. PAK4 contains the potential ^'^tpY" regulatory 
phosphorylation site in its activacion loop. This "TPY" 
motif is conserved in other STE2 0 - related kinases, including 
ZCi, ZG2, ZC3, ZC4 , GEK2 , KHS2 , SULL'l , SULU3 and PAK5 . 

The 23 amino acid C-tail of hum^an PAK4 contains a 
sequence that is homologous co a G-procein beca subunit 
binding site (Leeuw, T. et al , Nature, 391, 191-195 (1998)). 
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PAK4 has, therefore, -he potencial to be activated cv both 
Cac42- as well as G-prctein-dependant pathv/ays . 

M a mma 1 i a n F A K R 

The 2,806 bp human FAK5 nucleotide sequence of the 
complete cDNA encodes a polypeptide of 591 amino acids (SEQ 
ID NO: 103) v/ith a predicted molecular mass of 64,071 
Daltons. Analysis of the deduced amino acid sequence 
predicts PAKE to be an intracellular STE2C -subfamily kinase, 
lacking both a signal sequence and transmembrane domain. 
The full-length PAK5 protein contains a 52 amino acid N- 
terminus predicted to contain a p21 (small G-protein) 
binding domain (PDB or CRIB), a 121 amino acid insert 
relative to the known mammalian PAK proteins, a 13 4 amino 
spacer region, a 265 amino acid catalytic domain- wirh all 
the motifs characteristic of a serine/ thi-eonine kinase and a 
19 amino acid C-terminal tail. 

?AK5 is most closely related to Hum.an PAK4 (SEQ . ID 
NO: 29), Drosaphila melanogaster PAK (also known as "mushroom 
bodies tiny") (AJ01i578}, C45Bll.lb from C. elegans 
(Z74029) , and human PAK3 (Q13177) sharing 48% (327/574 aa) , 
50% (330/651 aa) , 43% (234/435 aa excluding gap) , and 47% 
(190/405 aa excluding gap) amino acid identitv, 
respectively. Recently, a full length version of PAK5 was 
reported (PAK4 AF005046) whose 591 amino acid sequence is 
identical to PAK5 (SEQ ID NO:103). (Abo, ec ai . (1998) EMBO 
u . 17 : 6527-6540) . 

The 52 amin.o acid N- terminal domain of human PAK5 is 
mosz related to human ?A?:4 (SEQ ID NO: 29), Drosophila 
melanogaster PAK (AJ01157d), C45Bil.ib from C. elegans 
(Z74029) , and human PAK3 (Q13177), sharing 65%, 57%, 54%, 
ana 53% am.ino acid identity, respectively. 
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The 11 amino acid region ac positions 12-22 cf human 
PAK5 (FIG. IS) fits zhe consensus for a small G-protein 
binding domain ( PDB or CRIB) ( SXPX4 - 6HXXH) (Burbelo, P.D., 
Dreschel, D. and Hall, A. J. Bio. Chem. 270, 29071-29074 

(1995) , hereby incorporated by reference herein in its 
entirety including any figures, tables, or drawings.) . 

The 121 am.inc acid insert of human PAK5 shares 43% 
am.ino acid identity with a sim.ilar domain from FAK4 (SEQ ID 
NO: 29) , buc chac is absent from other known PAKs . 

The equivalent spacer region in PAKl binds to zhe 
guanine nucleotide exchange factor PIX (Manser, E et al 

(1993) Molecular Cell, 1, 183-192 hereby incorporated by 
reference herein in its entirety including any drawings, 
figures, or tables.). Since PAK5 differs substantially from 
PAKl over this region, the spacer domain of PAK5 may- differ 
in its guanine nucleotide exchange factor binding 
specificity, relative to PAKl. 

The 134 am^ino acid col lagen- 1 ike region of human PAK5 
shares 34% amino acid identiny to pro-a I cype collagen from, 
several species and is not present in other known PAKs. 

The 265 amino acid canalytic domain of hum^an PAK5 is 
mosc related to hum.an PA.K4 (3EQ ID NO: 29) , Drosophila 
melanogascer PAK (AJ011578), C45Bll.lb from C. elegans 
(Z74029) , and human PAK3 (Q13177), sharing 78%, 80%, 61%, 
and 55% amino acid identity, respectively. PAK5 also 
contains the potential "TPY" regulatory phosphorylation site 
in its activation loop. This "TPY" motif is conserved in 
other STE20-related kinases , including ZCl, ZC2 , ZC3 , ZC4 , 
GEK2 , KHS2 , SULUl , SULU3 and PAK4 . 

The 19 amino acid C-tail shares 80% amino acid identity 
to a PAK-like homologue idencified from gencm.ic DiSFA 
{AL031652)'. Furthermore, this C-terminal region of hum.an 
PAKS contains a sequence that is homiologous to a G-protein 
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beca subunit: binding site (Leeuw, T. ec al . Nature, 391, 
191-195 (199S) hereby incorpox~atad by reference herein in 
ICS entirely including any figures, tables, or drawings) 
PAK5 has, therefore, the pocencial to be activated bv both, 
Cdc42 as well as G-protein-dependant pathways. 

^* Antibodies, Hvbridomas , Methods of Use and Kics for 
Detection of STE2 0 -Relaced Kinases 

The present: invention relates to an antibody havinc 
binding affinity to a kinase of the invenrion. The 
polypeptide may have the amino acid sequence set forth in 
SEQ ID MO: 5, SEQ ID NO : 6 , SEQ ID NO : 7 , 3EQ ID NO: 13, SEQ ID 
N0:14, SEQ ID NO:15, SEQ ID NO:18, SEQ ID NO : 2 2 , SEQ ID 
NO: 23, SEQ ID NO: 24, SEQ ID NO : 2 9 , SEQ ID NO : 9 7 , SEQ ID 
NO: 99, SEQ ID NO: 101, SEQ ID NO: 103, SEQ ID NO: 105, or SEQ 
ID NO: 107, or a functional derivative thereof, or at least 9 
contiguous amino acids thereof (preferably, at least 20, 30, 
35, or 40 or more contiguous amino acids thereof). 

The present invention also relates to an antibody 
having specific binding affinity to a kinase of che 
invention. Such an antibody may be isolated by comparing 
its binding affinity to a kinase of the invention with irs 
binding affinity to other polypeptides. Those which bind 
selectively to a kinase of the invention would be chosen for 
use m methods requiring a distinction between a kinase of 
the invention and other polypeptides. Such methods could 
include, but should not be lim.ited to, the analvsis of 
alcered kinase expression in tissue containing other 
polypeptides . 

Tne STE20 -Related kinases of che present invention can 
be used in a variety of procedures and methods, such as for 
the generation of antibodies, for use in identifving 

102. 
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pharmaceu- ical composi t: icns ; and for studying DNA/protein 
inceract ion . 

The kinases of the present invention can be used to 
produce antibodies or hybi-idomas . One skilled in the arr 
^ will recognize that if an antibody is desired, such a 

peptide could be generated as described herein and used as 
an imrnunogen . The antibodies of the presenc invention 
include rr.oncclonal and polyclonal antibodies, as well 
fragments of these antibodies, and humanized forms. 

10 Humanized form.s of the antibodies of the presenc invention 

may be generated using one of the procedures known in the 
art such as chim.eri zat ion or CDR grafting. 

The present invention also relaces to a hybridoma which 
produces the above -described monoclonal antibody, or binding 

15 fragm^enc thereof. A hybridcmia is an imm.orcal i zed cell line 

which is capable of secreting a specific monoclonal 
antibody. 

In general, techniques for preparing monoclonal 
antibodies and hybridomas are well knov;n in the art 

20 (Campbell, "Monoclonal Antibody Technology: Laboratory 

Techniques m Biochemistry and Molecular Biology, " Elsevier 
Science Publishers, Amsterdam, The Netherlands, 1934; St. 
Groth ec al . , J, Immunol. Methods 35:1-21, 19S0). Any 
animal (mouse, rabbit, and the like) which is known to 

25 produce antibodies can be immunized v/ith the selected 

polypeptide. Methods for imimunizat ion are well known m the 
art. Such methods include subcutaneous or int raoeri toneal 
injection of the polypeptide. One. skilled m the art v/ill 
recognize that the amount of polypeptide used for 

30 immunization v;ill vary based on the animial v/hich is 

immunized, the antigenicity of the polypeptide and the site 
of injection. 

10:^ 
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The polypeptide may be. modified or administered in an 
adjuvant in order to increase the pepcide ancigenicicy . 
Methods of increasing the antigenici-y of a polypepride are 
well known in the art. Such procedures include coupling che 
antigen with a heterologous protein (such as globulin or fi- 
galaccosidase} or through the inclusion of an adiuvanc 
during immunization. 

For monoclonal antibodies, spleen cells from the 
im.munized animals are removed, fused with myeloma cells, 
such as SF2/C-Agl4 myeloma cells, and allowed to become 
monoclonal antibody producing hybridoma cells. Any one of a 
number of m.echods well known in the art can be used to 
identify the hybridom.a cell which produces an antibody with 
tne desired characteristics. These include screening the 
hybridom.as with an BLISA assay, vvestern blor analysis, or 
radioimmunoassay (Lutz et ai . , Exp. Cell Res. 175:109-124, 
1988) . Kybridomas secreting the desired antibodies are 
c_oned and the class and subclass are decermined using 
procedures known in the art (Campbell, "Monoclonal Ancibodv 
Technology: Laboratory Techniques in Biochemistry and 
Molecular Biology", supra, 1984). 

For polyclonal antibodies, antibody- containing antisera 
is isolated from the immunized animal and is screened for 
rhe presence of antibodies wich the desired specificity 
using one of the above -described procedures. The above- 
described antibodies may be decectably labeled. Antibodies 
can be detectably labeled through the use of radioisotopes, 
affinity labels (such as biotin, avidin, and the like) , 
enzymatic labels (such as horse radish peroxidase, alkaline 
phosphatase, and che like) fluorescent labels (such as FITC 
or rnodamine, and the like), paramagnetic atom.s, and the 
like. Procedures for accomplishing such labeling are weli- 
Known in the art, for example, see Stembjerger et ai., J. 

(04 
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Hiscocheni. Cytochem. 18:315,-1970; Bayer et al . , Mezh. 
Enzym. 62:308-, 1979; Engval et al . , Immunol. 109:129-, 
1972 ; Coding, J. Immunol . _iMeth . 13:215-, 1976. The labeled 
antibodies of the present invention can be used for in 
5 \^itro, in vivo, and in sicu assays to identify ceils or 

tissues which express a specific peptide. 

The above-described antibodies may also be immobilized 
on a solid support. Examples of such solid supports includ 
plastics such as polycarbonate, complex carbohydrates such 

10 as agarose and sepharose, acrylic resins and such as 

polyacrylamide and latex beads. Techniques for coupling 
antibodies co such solid supports are well known in the art 
(Weir et al . , "Handbook of Experimental Immunology" 4th Ed. 
Blackwell Scientific Publications, Oxford, England, Chapter 

15 10, 1986; Jacoby et ai . , Meth. Enzym.. 34, Academic Press, 

N.Y., 1974). The immobilized antibodies of the present 
invention can be used for in vitro, in vivo, and in situ 
assays as well as in imtmunochromotography . 

Furcherm.ore , one skilled in the art can readily adapt 

20 currently available procedures, as well as the • techniques , 

m^ethods and kitis disclosed herein with regard to antibodies 
to generate peptides capable of binding co a specific 
peptide sequence in order to generate rationally designed 
antipeptide peptides (Hurby et al,. , "Application of 

25 Synthetic Peptides: Antisense Peptides In Synthetic 

Peptides, A User's Guide, W.H. Freemian, NY, pp. '289-307, 
1992; Kaspczak ec al . , Biochemiistry 28:9230-9238, 1989). 

Anti-peptide peptides can be generated by replacing the 
basic amino acid residues found in the peptide secuences of 

30 .the kinases of the invention with acidic residues, while 

maintaining hydrophobic and uncharged polar groups. For 
example, lysine, arginine, and/or histidine residues are 

lOS 
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replaced with aspartic acid or glutamic acid and glutamic 
acid residues are replaced by lysine, arginine or hisridme. 

Tne present invention also encompasses a method of 
detecting a 3TE20-related kinase polypeptide in a sample, 
comprising: (a) contacting the sample with an above- 
described antibody, under conditions such that 
immunocomplexes form, and (b) detecting the presence of said 
antibody bound to the polypeptide. In detail, the methods 
comprise incubating a cest sample with one or more of the 
antibodies of the present invention and assaying whether the 
antibody binds to the test sample. Altered levels of a 
kinase of the, invention in a sample as ccm.pared to normal 
levels m.ay indicate disease. 

Conditions for incubating an antibody with a test 
sample vary. Incubation conditions depend on the format 
employed in the assay, the detection m.ethods employed, and 
the type and nature of the antibody used in the assav. One 
skilled in the art will recognize that any one of the 
commonly available immunological assay formats (such as 
radioimmunoassays, enzyme -linked im.munosorbent assays, 
diffusion based Ouchterlony, or rocket immunof luorescent 
assays) can readily be adapted to employ the antibodies of 
the present invention. Examples of such assays can be found 
in Chard ("An Introduction to Radioimm.unoassay and Related 
Techniques" Elsevier Science Publishers, .^Imsterdam, The 
Netherlands, 1986), Bullock et al . ("Techniques in 
Immunocytochemistry, " Academic Press, Orlando, FL Vol. i, 
1932; Vol. 2, 1985; Vol. 3, 1965), Tijssen ("Practice and 
Theory of Enzyme Immunoassays: Laboracory Techniaues in 
Bicchem.istry and Molecular Biology," Elsevier Science 
Publishers, Am^sterdam, The Netherlands, 1985) . 

The im.m.unological assay test samples of the present 
invention include cells, protein or memJorane extracts of 

lOli) 
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cells, or biological fluids such as blood, serum, plasma, or 
urine. The -est samples used in che above-described method 
will vary based on the assay format, nature of the detection 
m.echod and the tissues, cells or excraccs used as the samole 
5 to be assayed. Methods for preparing protein extracts or 

m.em.brane extracts of cells are well known in che arc and can 
be readily be adapted in order to obtain a sample which is 
testable v/ith che system, utilized. 

A kit contains all che necessary reagencs to carrv cut 
10 the previously described m.ethods of detection. The kic m.av 

comprise: (i) a firsc container means containing an above- 
described antibody, and (ii) second concainer means 
containing a conjugace comprising a binding partner of the 
antibody and a label. In another preferred emibodiment , che 
15 kic further comprises one or m.ore other containers 

comprising one or m.ore of the following: wash reagents and 
reagencs capable of detecting che presence of bound 
ant ibodies , 

Examples of dececcion reagents include, but are noc 
20 limited to, labeled secondary antibodies, or in the • 

alternative, if the prim.ary ancibody is labeled, the 
chromophoric , enzymatic, or antibody binding reagencs which 
are capable of reaccing with the labeled antibody. The 
comparcmencali zed krc m.ay be as described above for nucleic 
25 acid probe kits. One skilled in che art will readily 

recognize chat the antibodies described in the present 
invention can readily be incorporated. into one of the 
established kic formacs v/hich are well known in the -arc. 
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-^sola-ion of rn moounda Which Tnterarr wjch 5T52n- 
Relac ed Kina?si=^.=: 

The present: invention v-st=)--c- _ . , . 

aececting a compound capable of binding to a STE2 0-relaced 
kinase of the invention comprising incubating the compound 
with a kinase of the invention and detecting the presence ci 
the compound bound to the kinase. The compound may be 
present within a complex mixture, for example, serum, bcdv 
fluid, or cell extracts. 

The present invention also relates to a m^ethod of 
aetecting an agonist or antagonist of kinase activity or 
kinase binding partner activity comprising incubatino cells 
that produce a kinase of the invention in the presence of a 
compound and detecting changes in the level of kinase 
activity or kinase binding partner activity. The compounds 
tnus identified would produce a change in activity 
indicative of the presence of the compound. The compound 
n-.ay be present within a complex mixture, for examole, serum, 
body fluid, or cell extracts. Once the compound i 
identified it can be isolated using techniques well know 
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The present invencion also encompasses a method of 
agonizing (stimulating) or antagonizing kinase associated 
ac-ivity in a m:ammal comprising administering to said mammal 
an agonist or antagonist to a kinase of the invention m an 
amount sufficient to effect said agonism or antagonism. a 
rr.ethod of treating diseases in a m.amm.al with an agonist or 
antagonist of STE20 -related kinase activity comprising 
administering the agonist or antagonist to a mamm.al in an 

amount surf i'^-; <=^rr f-,^, 

1.^ -ci^on.ze or ant agonize STE2 0- related 

kinase associated functions is also encompassed m.the 

present application . 
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In an efforc to discover novel treacments for diseases, 

biofP-edical researchers and chemises have designed, 
synthesized, and tested molecules that inhibit che function 
of protein kinases. Some sm.all organic molecules form a 
5 class of comtpounds thac miodulace che function of protein 

kinases. Examples of m.olecules that have been reported to 
inhibit the function of protein kinases include, but are not 
limited to, bis monocyclic, bicyclic or heterocyclic aryl 
comipounds {?CT WO 92/20642, published November 26, 19 92 by 

10 Maguire ec al . ) , vinylene-azaindole derivatives ( PCT WC 

94/14S0S, published July 1, 1994 by Ballinari et 3.1.), 1- 
cyclopropyl -4 -pyridyl -quinolones (U.S. Patent No . 
5,330,992), styryl compounds (U.S. Patent No. 5,217,999), 
styrv*l - substituted pyridyl compounds (U.S. Patent No. 

15 5,302,506), certain quinazoline derivatives (EP Application 

Nc . 0 565 266 Al), seleoindoles and selenides (PCT WO 
94/03427, published February 17, 1994 by Denny et al . ) , 
tricyclic pclyhydroxyl ic comipounds (PCT WO 92/21660, 
published December 10, 1992 by Dov.O , and benzylphosphonic . 

20 acid compounds (PCT WO 91/15495, published October 17, 1991 

by Dcv; et' al) . 

Compounds that can traverse cell membranes and are, 
resistant to acid hydrolysis are potentially advantageous as 
therapeutics as they can becom.e highly bioavailable after 

25 being adm.inis tered orally to patients. However, many of 

these protein kinase :.nhibitor5 only weakly inhibit the 
function of protein kinases. In addition, many inhibit a 
variety of orotein kinases and v;ill cause multiple side- 
effects as therapeutics for diseases, 

30 Some ■ mdol inone compounds, hov/ever, form^ classes of 

acid resistant and memJt^rane perm^eable organic molecules. VJO 
96/22976 (published August 1, 1996 by Ballinari ec ai , ) 
describes hvdrosoluble indol inone comipounds that harbor 
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cecraiin, naphchaien.e, guinoline, and indole subs-ituencs 
fused CO Che oxindole ring. These bicyclic substicuencs are 
m turn substituted with polar moieties including 
hydroxylated alkyl, phosphate, and ether moieties. U.S. 
Patent Application Serial Nos . 08/702,232, filed August 23, 
1996, entitled "Indolinone Combinatorial Libraries and 
Related Products and Methods for the Treatment of Disease" 
by Tang et al . (Lyon & Lyon Docket No. 221/137) and 
08/435,323, filed June 7, 1995, entitled " Benzyl idene - Z - 
Indoline Compounds for the Treatment of Disease" by Tang et 
al. {Lyon & Lyon Docket No. 223/298) and International 
Patent Publication WO 96/22 976, published August 1, 1996 by 
Ballinari et al . , all of which are incorporated herein by 
reference in their entirety, including any drawings, 
describe indolinone chemical libraries of indolinone 
cori-pounds harboring other bicyclic moieties as well as 
monocyclic moieties fused to the oxindole . rina . 
Applications 08/702,252, filed August 23, 19S6, entitled 
"Indolinone Combinatorial Libraries and Related Products and 
Methods for the Treatment of Disease" by Tang at al . (Lyon & 
Lyon Docket No. 221/187), 08/435,323, filed June 7, 1995, 
entitled "Benzyl idene- Z- Indoline Compounds for the Treatment 
of Disease" by Tang et al . (Lyon u Lyon Docket No. 223/298), 
and V70 96/22976, published August 1, 1996 by Ballinari et 
al . teach methods of indolinone synthesis, methods of 
testing the biological activity of indolinone com.pounds in 
cells, and inhibition patterns of indolinone derivatives. 

Other examples of substances capable of modulatincf 
kinase activity include, but are not limited to, 
■0 tyrphcstins, qumazclines, qumoxol ines , and quinolmes. 

The quinazolines, tyrphcstins, qui no lines, and qumo.xol i.nes 
referred to above include well known compounds such as those 
described in the literature. For example, representative 

Ho 
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publications describing quinazolines include Barker ec al . , 

EPO Publication Mo. 0 520 722 Ai ; Jones et ai . , U.S. Pacenc 
No . 4 , 44 7 , 60S ; Kabbe et ai . ; U.S. Patent No. 4,757,072; Kaui 
and Vcugioukas, U.S. Patent No. 5, 316,553; Kreighbaum and 
5 Comer, U.S. Pacent No. 4,343,940; Pegg and Wardleworth, EPO 

Publication No. 0 562 734 AI ; Barker en al . , Proc . of Am. 
Assoc. for Cancer Research 32:327 (1991); Berrino, J.R., 
Cancer Research 3:293-304 (1979); Bercino, J.R., Cancer 
Research 9(2 part l}:293-304 (1979); Curtin et al . , Br . J . 

10 Cancer 53:361-368 (19S6); Fernandes et al . , Cancer Research 

43:1117-1123 (1933); Ferris ec al . J. Pro. Chem. 44(2) :173- 
17S; Fry et al . , Science 265:1093-1095 (1994); Jackman et 
al., Cancer Research 51:5579-5536 (1981); Jones ec al . J . 
Med. Cheni. 2 9 ( 6 ) : 1 1 14 - 1 1 IS ; Lee and Skibo, BiocheiT;isr rv ■ 

15 26(23) :735 5 -7362 (1987); Lemus et al . , J. Pro. Cherv. , 

54 : 3511-'351S (1989); Ley and Seng, Synthesis 1975:415-522 
(1975); Maxv/ell et al . , Magnetic Resonance in Medicine 
17:139-196 (1991); Mini ec al . , Cancer Research 45:325-330 
(1985) ; Phillips and Castle, J. Heterocyclic Chem. 

20 17 (19) : 1489-1596 (1980); Reece et al . , Cancer Research 

47 (11) : 2996-2999 (1977); Sculier et al . , Cancer I mrr.u no 1 . and 
Immunother . 23:A65 (1986); Sikora et al . , Cancer Leccers 
23:285-295 (1984); Si.kora et al . , Analyt ical Biochem. 
172:344-355 (1988); all of which are incorporated herein by 

25 reference in their entirety, including any drav/mgs . 

Quinoxaline is described in Kaul and Vougioukas, U.S. 
Patent No. 5,316,553, inccrporaced herein by reference in 
iZ3 entirety, including any drav/ings . 

Quinolines are described in Dolle ez al . , J. Med. Chem. 

30 37:2627-2629 (1994;; MaGuire, J. Med. Cherr.. 37:2129-2131 

(1994); Burke et al . , J. Med. Chem. 36:425-432 (1993); and 
Burke et al . . BioOroanic Med. Chem. Lecters 2:1771-1774 

Ml 
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(1992), all of v.'hich are incorporated by reference in cheir 
encirecy, including any drawings. 

Tyrphostins are described in Allen et al . , Clin. Sxd . 
Immunol . 91:141-155 (1993); Anafi et al . , Blood 82:12:3524- 
3529 (1993); Baker et al . , J. Cell Sci . 102:543-555 (1992); 
Bilder et al . , Amer. Phvsiol. Soc. pp. 63 63 - S143 : C72 1 -C730 
(1991); Brunton et al . , Proceedings of Amer. Assoc. r^.nr^y 
Rsch^ 33:558 (1992); Ervckaert et al . , Bxoerimental Cell 
Research 199:255-261 (1992); Dong et al . , J. Leukocyte 
Biology 53:53-60 (1993); Dong et al . , J. Immunol . 
151(5) :2717-2724 (1993); Gazit et al . , J. Med. ChRm 
32:2344-2352 (1989); C-azit et al . , " J. Med. Chem . 36:3556- 
3564 ;i993} ; Kaur et al . , Ant i -Cancer Drugs 5:213-222 
(1994); Kaur et al . , King et al . , Biochem. J. 275:413-418 
(1991); Kuo et al.. Cancer Letter.^ 74:197-202 (1993); 
Levitzki, A., The F.a.SEB J. 6:3275-3282 (1992); Lyall et al . , 
J. Biol. Chem. 264:14 503-14 509 (1989); Peterson et al . , The 
Prostate 22:335-345 (1993); Pillemer et al . , Int. J. Canc^-- 
50:80-35 (1992); Posner et al . , .Molecular Pharmacol noy 
20 45:673-633 (1993); Rendu et al . , Biol. Pharm.acol ogy 

44 (5) :381-8S8 (1992); Sauro and Thomas, Life Scienr-pc, 
53:371-376 (1993); Sauro and Thomas, J . Pharm . and 
Experimental Therapeutics 2 67 ( ^ ) • i i Q - n 9 ^ (1993); Woibring 
al-' J. Biol. Chf^m. 269 (36) :22470-22472 (1994); and 
25 Yoneda et al . , Cancer R esearch 51:4430-4435 (1991); .all of 

which are incorporated herein by reference in their 
entirety, including any drav;ings. 

Other compounds that could be used as modulators 
include oxindol inones such as those described in U.S. patent 
application Serial No. 08/702,232 filed August 23, 1995, 
incorporated herein by reference in its entirety, including 
any drawings . 
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VII . Biological Significance . AdcI ican ions and Clinical 

Relevance of Novel STE20"Relav:ed Kinases 
Human STLK2 , STLK3 , STLK4 , STLK5 , STLK6 , and STL.K7 

STLK2, STLK4, STLK5 , 3TLK6 and STLK7 belong to an 
5 expanding family of intracellular STKs that have varying* 

degrees of sequence homology to SOK-1, a kinase implicated 
in oxidative 'stress agencs ( Pom±>o , CM et al, ZMBO J. (17) 
4537-4546, 1996). Our daca shoves chat STLK2 is expressed 
highly in hem.atopoiet ic cells. Therefore, STLP:2 may 

10 participate in the oxidative response pathway during • 

inf iammacicn . In addition, STLK2 could also be a possible 
component in tihe signaling pathways leading to T cell 
activacion. High levels of STLK2 in several tum.or cell 
lines could also imply that STLK2 might be involved in 

15 cumor igenes i s - 

STLK2 is m.ost closely related co two human STE20- 
. subfamily kinases: MST3 and SOK-1. MST3 is a 52,000 

daltrons cytoplasmic kinase that is ubiquitously expressed 
v;i-h its highest levels of expression found in heart, 

20 skeletal muscle and pancreas. The serine/ threonine kinase 

activicy of MST3 is activated by phosphorylacion . unlike 
SOK-1, MST3 prefers Mn'^ over Mg^ ^ and can use both GTP and 
ATP as phosphate donors. MST3 may undergo dimeri zat ion . No 
agonists have yet been identified- that activate MST.3 . The 

2d downstream, signaling mechanism c^ this kinase is unknown 

(Schinkmann, K and Blenis, J. (1997) J.. Biol. Chem;- 272, 
23695- 2 S703). 

, SOK-1 is a 50,000 dalcons cytoplasm.ic kinase expressed 
predominantly in testis, large mcestine, brain and stomach 
30 and co a lesser extent in hearc and lung. 30K-1 is also 

expressed in the germ.inal center B-cell line (RAMOS) and m 
a miacure B cell line (HS Sultan) , The serine,/ threonine 
kinase accivicy of 30K- 1 is activated by ohosohoryl at ion . 
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The C- terminus of SOK-1 has been shown co be inhibitory cc 
the catalytic activity of this kinase. The only agonisrs 
knov.-n to aciiivate SOK-1 are oxidant agents, like and 
menadione, a quinone that is a potent intracellular 
5 generator of reactive oxygen species (Pombo, CM, et ai . 

EM3C J. 15, 4537-4546) . SOK-1 is also activated by chemical 
anoxia through the generation of reactive oxygen species and 
release of calciurr: inco the cytoplasm from incracellular 
stores. SOK-1, therefore, may play an imporcanc role in 

10 ischemia, the cause of myocardial infarction, stroke and 

acuce renal failure (PoirJoo, C..M. et ai . J. Biol. Chem. 272, 
29372-29379 (1997)). The activity of SOK-1 in the response 
to oxidant stress is inversely correlated with the activicv 
of the stress -act ivated protein kinases (SAPKs) : elevated 

15 SOK-1 activity correlates with absent SAPK activity and 

vice-versa. SOK-1 does not activate any of the four MAP 
kinase pathways, SAPKs, p3 8, ERK-1 or MEK-5/ERK-5 (Pombo, 
CM- et al. EMBC J. 15, 4537-4546). The downstream 
signaling mechanism of this kinase remains unknown. 

20 STLK2 is expressed in a wide variety of immune cell 

types and tissues including thym.us, dendrocytes, mast cells, 
monocytes, E cells (primary, Jurkat , RPMI , SR) , T cells 
(CD8/CD4-, THl, TH2 , CEM , MOLT4 ) and megakaryocytes (K562), 
whereas STLK3 is restricted to thymius and STLK4 is 

25 predominately expressed in thym.us, T cells (CD4/CD8+, THl, 

GEM) and 3 cells (Jurkat, RPMI). Consequently, chese STKs 
might participate in the oxidative response pathway during 
inflammation, reperfusion injury (stroke, surgery, shock), 
TNFa- mediated signaling , insulin desensi t i zat ion , 

30 atherogenesis , vascular injury, T or B cell cost imulation, 

or alternatively, participate in other MAPK-related signal 
transduction processes . 
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STLK5 is more distantly related to this STE20 -subfamily 
including SOK-1 and STLK2 , STLK3 and STLK4 . STLK5 , may 
therefore mediate a signaling pathway that is distinc- from 
the oxidative stress response pathway. 

The high degree of sequence homology in the C-term.ini 
of SOK-1; STLK2, STLK3 , STLK4 , STLK5 , and STLK6 raises che 
possibility that these novel STKs, like SOK-1, may be 
subject to autoinhibicion through a conserved C- terminal 
motif . 



Human ZCl , ZC2 , ZC3 and ZC4 

2C1 is a good candidate for any disease in which 
tyrosine kinase, cytokine, or hecerocrimer ic G-procein 
coupled receptors have been implicaced. The mouse homologue 

15 binds to NCK, and is recruited to activated PDGF (Su et 

al.,EM30 16: 1279-1290, 1997). The Drosophila homolog has 
been shov;n tc bind to TRAF2 , implicating it in TNF-a 
signaling (Liu et al . , (1999) Curr . Biol. 9:101-104, 1999)). 
While ZCl does not contain the exact NCK- and TRAF2 -binding 

20 domains,- it is likely to bind to related proteins. 

Of uhe ZC subfamily of 3TE20 -related protein kinases, 
ZCl has very broad over-expression in many tumor types, 
suggesting that it may be involved in cellular growth, 
transformation, or tumor progression. A truncated, form of 

25 ZCl containing only the C-termiinal putative MEKKl -binding ' 

domain was found zo reduce the numiber of foci generated by 
H-Ras-V12 in Rat Intestinal Epithelial cells (RIE-1) . These 
data ' indicace that ZCl may play a role in the ability for 
these cells to overcom.e contact inhibition and anchorage- 

30 dependent growth. 

The- ZCl homolog. Misshapen imsn) in Drosophila 
melanogas ter v;as cloned as a resulc of compiemient ing a 
mutacion in a develooment al oachwav reouired for dorsal 
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Closure, a process involving changes in cell shape and 
position in the embryo (Treisman et al. Gene 186 119-125, 
1997) . A D, melanogaster homolog of the uNKl/vJNK2 kinases 
from mammals was shown zo function downstream of rnsn in the 
5 dorsal - closure signaling pathway (Su et al . Genes Dev. 

12 : 2371 -2380, 1998) . 

While ZCl could be involved in multiple aspecns of 
tum.origenesis , by analogy with Drasophila. , the role of 
m.isshapen in dorsal closure suggests a critical role in the 

10 regulation of che cytoskelecon for che processes of cell 

attachm.ent, cell movement and perhaps migration. 

The association of the ZCl family members msn and NIK 
wich TRAF2 may indicate a role for this kinase in cell 
survival and/or in apoptosis. The ZCl family contains a 

15 highly conserved domain that in the mouse homolog, NIK, has 

been shown to bind to MEKKl (Mi togen-act ivated/ 
Extracellular-regulated Kinase Kinase 1) (Su et al . , (1997) 
EM30 16(6): 1279-90). MEKKl is involved in cell survival 
and/or apoptosis in several systems (Schlesinger et al . , 

20 Front. Biosci . 3 :D1181-6 , 1998). Depending on the context, 

MEKKl appears to be upstream of either the ERKl/MAPK or rhe 
JNK/3APK pathway [Schlesinger et al . , (1998 Front. Biosci. 
3:DllSl-6) . Three homiolcgues of ZCl: m^urine NIK (NCK- 
interacting kinase) (Su et al.EMBO 16:1279-90, 1997), 

25 Drosophila msn (Liu et al.Curr. Biol. 9:101-104, 1999) and 

human HGK (HPK/GCK- 1 ike kinase) (Yao et al . , J. Biol. Chem. 
274:2118-25, 1999) have all been shown to activate the JNK 
pathway when over-expressed in 293T cells. 

ZCl shares a high degree of homology with these onher 

30 family members in both the kinase domain and the "MEKK"- 

binding dom.ains, yet it differs in the intervening region, 
which contains several putative binding domains for upseream 
signaling adapter molecules (e.g. NCK, TRAF2 ) . Unlike the 

1 10 
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ccher family members, ZCi does not appear to accivace che 
JNK pathway in 293T cells as seen by its ability to induce 
expression of either a JUN or ATF2 -driven lucif erase gene. 
Upon CO- transf ect ion into these cells with HA-tagged JNK, 
modest activation of JNK was detected. ZCI also modestly 
activated co- transf ecced ERKl . Both the ERK and the JNK 
activation were very slight compared wich the positive 
controls in the assay (activated forms of MEKl and MEKKl , 
respectively) . In both cases, acnivanion required the full- 
length kinase. While the kinase domain alone is up to 5x 
more active in autophosphorylat ion and in phosphorylation of 
M3F, it does not lead to activation of these potential 
dovynstream kinases. . Based on the strong sequence homology 
of ZCI with che other family members, it is very likely that 
ZCI will be important for either JNK or ERK activation once 
the proper context is found. 

ZCI profoundly inhibits ERKl kinase expression in co- 
transf eccion assays. This effect is dependent on ZCI kinase 
activity, occurring v;ith the wild-type and the kinase domain 
alone, but not v/ith the kinase-dead mutant even though all 
three forms of ZCI are expressed az similar levels. This 
may suggest a role for this kinase in transcripc lonal or 
pose -transcriptional regulation. 

ZCI may be an important component in the signaling 
pathways mediated by the co- stimulatory receptor CD28 in T 
cells and/or by the pro- inf lamm^atory cytokine TNFa , since 
CO- transf ect ion of the wild-type ZCI activated the RE/AP- 
lucif erase and NFkB- luci f erase reporter genes. While our 
data showed that ZCI scrongly activates NFkB in T-cells, no 
activation of N?k3 driven luciferase was detectable in NIK 
3T3 cells. A recent paper (J. Biol. Chem. 274:2118-25; 
1999.) has shov;n thac a human ZCI splicing isoform, HGK, is 
involved in the TNFa-signalmg pathways. 
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Given che importance of T cell activation in 
aucoimmunity and nranspl ancac ion , as v.^eli as che key role 
chat TNFa plays in inf lanrimacory diseases, ic is Dossibie 
that ZCl could be a therapeutic target for imrriunological 
5 diseases which include but are not limited to; rheumatoid 

arthritus, chronic inflammatory bcivel diseases (ie Crohn's 
disease), chronic inf lammiatory pelvic disease, multiple 
sclerosis, asthma, osceoarthrit is , psoriasis, 

athercsciercsis , rhinitis, and autoimmunity as well as organ 
10 cranspiantarion and cardiovascular diseases. 

ZCl appears to be che human orchologue of murine NIK 
and possibly an orchologue of a C, eiegrans S.TE2 0 - subf amilv 
kinase encoded by the ZC504.4 cosmid. 

Murine NIK is a 140,000 daltons kinase that is most 
15 highly expressed in brain and heart. NIK inceraccs wich the 

SH3 domains of the adapcor molecule Nek through its prcline- 
rich regions found in the C-cerm.inal extra-cacalycic region. 

The specific regions thac mediate this interaction are cv;o 
?xxP motifs chat are nearly uniformly conserved between NIK, 
20 ZCl , 2 , 3 and the C. elegans STE2 0 ZC504.4 kinase. In 

addition, NIK binds MEKKl through its 719 amino acid C- 
terminal (Su, Y-C. et al . (1997) EM30 J. 16, 1279-1290). 
MEKKl is a menribrane- associated kinase responsible for 
activating MKK4 (also knov/n as SEKl) , which in turn 
25 activaces 3A?K (Yan, M ec al. (1994) Nature, 372, 798-800)-. 

NIK may funccion as a kinase thac links growth factor 
activated pathways and the stress - response pathway mediated 
by SAPKs . According co this hypothesis, activation of 
grov;ch factor receptors leads co receptor tyrosine 
30 phosphorylation. Nek binding co the phosphoryl aced cyrosmes 

via its SH2 domain, NIK redistribution co a mem.brane 
ccmpartm.ent via binding to the SH3 domain of Nek, and 
j uxcaposit ion to the membirane-associaced MEKKl. The NIK- 

lis 
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et al . (1997) EMBO J. 16. 1279-1290) . • 

Given the high homology between ZCl, ZC2 , ZC3 , and ZC4 
STKs and NIK, it is conceivable that chese kinases may each 
function to connect growth factor- and scress-act ivated 
signaling pathv/ays . The hecerogenei t y chat che ZC kinases 
exhibit within their pucative 3H3 -binding domain could 
provide signaling specificiny in terms of the nacure of the 
adapcor molecule that they bind. The high level of sequence 
conser-vat ion in the C-termdni of the ZCl , ZC2 and ZC3 
strongly suggests thac these human kinases, like murine NIK, 
also miay bind co MEKKl and activate SAFKs , The ZC kinases 
also display scrcng homology at their C- termini co procein 
domains that bind small GTFase proteins such as Rab, Rho and 
Rac. For example, the C-cerm^ini of ZCl is 36.2% idencical 
to citron, a m^urine Rho-binding protein, and 23.1% identical 
to the rab-binding region of GC kinase. This suggests that, 
in addition to adaptor molecules, small GTFase proteins m-ay 
also mediace membrane association and activation of the ZC 
kinases. The presence of a potencial coiled-coil region 
located immediately C-term;inal to the catalytic region 
strongly suggests thac the ZC kinases may also be subject co 
regulation via homo or hecerodimerizat ion events. 

The C. eieqrans STE20 ZC504.4 kinase is. the produce of 
the mig-15 gene. The product of this gene has been 
implicated in several development al processes such as 
epidermial developmenc, Q neuroblast m.igracions and m.uscle 
arm targeting in che developing worm (Zhu, X. and Hedgecock 
E. (1997) Worm Breeder's Gazette 14, 75) . The high level of 
sequence conservation becween che ZC kinases and the ZC504.4 
C. elegans kinase v;ill make C. '^1 eaans a valuable model 
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organism co study, chrough epistatic analysis, the signaling 
properties of the human ZC kinases. 

Human KHS2 

5 KHSl (kinase homologous to SPS1/5TE20) is a 100,000 

dalcon cytoplasmic STK that is expressed ubiquitously. KHSl 
has been implicated in the mechanism of SA?K activation in 
response to inflammatory cytokines such as TNFa as well as 
to ul t raviol ighc light, which also uses the TNF signaling 

10 pathv/ay. TNFa binding to its receptors (TMFRl and TNFR2) 

results in the sequential association with the receptor C- 
tail of multiple signaling molecules including TNFRl - 
associated, death domain protein (TRADD) , Fas-associated 
death dcm.ain protein (FADD or MCRTl) , TNFR-associated factor 

15 2 (TFJIF2), and the STK RIP (receptor interacting protein). 

The TF-ADD-TRA.F2 interaction is mediated by a conserved 
region present at the C- terminus of TRAF2 , the TPJIF domain. 

Activation of the NFkB and SAPK pathways is miediated by the 
ring finger motif present at the N-term.inus of TRAF2 (Curr. 

20 Opinion in Cell. Biol. (1997) 9:247-251). KHSl is activated 

by TISiFa stimulation in a TRAF2 -dependant rrianner and 
inhibition of KHSl blocks TMFa- induced SAPK activation but 
not NFkB activation. The m.echanism by v.'hich TRAF2 activates 
KHSl is not known. Cot rans f ec t ion of TRAF2- and KHSl- 

25 expressing constructs in 293T cells failed to reveal a 

direct association between these two mtolecules. KHSl 
activates the SAPK pathway by a direct association with the 
const itut ively active kinase MEKKl . MEKKl subsequently 
activates 5EK1 , which in turn activates SAPK. Neither the 

30 MAPK nor the p3S kinase pathways are activated by KHSl (Shi, 

C - S a d Ke ^'^''L J H (19 9 7/ -J Bio"' C e m 272 '^'2^0'^'- 
32107) . In addition to its catalytic domain, downstream 



BNSOOCID; <WO 9953036 A2JA> 



wo 99/53036 PCT/US99/08150 

signaling of KHSl requires its conserved C-cerminus (Diener, 
K. St a I (1997; Proc . Natl. Acad. Sci. 94, 9637-96 92/. 

GCK (germinal center kinase) is a consti tuc ively acciv-^ 
97,000 dalton STK that is broadly expressed. GCK nn.ay 
5 participate in B-cell differentiation since its expression 

is localized co the germinal center within lymphoid 
follicles. GCK activates the SAPK pathway in response to 
TNFcx via activation of SEKl . The upsnream activators of GCK 
in response co cytokines as well as the immediate downstream 

10 target of this kinase are unknown. The C- terminus of GCK is 

sufficient to activate SEKl (Pombo, CM. en al (1995) 
Nature, 377, 750-754). 

The miurine orthclogue of GCK, rabSip ( rabS - interaccing 
protein), is a 97,000 dalton protein that: f race ionaces with 

15 both the soluble cytoplasmic fraction as well as with a 

salt-sensitive fraction associated with the basolateral 
membrane of the crans-Golgi region in polarized MDCK 
epithelial cells. The C- terminus of rab8ip binds to rabS , a 
small GTP-binding protein required for vesicular transport 

20 from the Golgi apparatus (Ren, M. et al . (1996) Proc. Natl. 

Acad. Sci. 93, 5151-5155) . In addition to inducing the 
transcriptional activacion of cytokines like IL2 via SA.PK, 
GCK may also promote the rab-dependent: release of secretory 
proteins in response to TNFa (Buccione, R. et al (1995) Mcl . 

25 Bio. Cell 6, 291) . 

HPKl (hernia Tiopoietic protein kinase) is a 
constitut ively active 90,000 dalton STK resuricted to 
hemiatcpoie t ic cells. HPKl activates the 3A-PK pachway by 
directly binding to and accivacing MEKKl (Hu, M. ec al 

30 (1996) Genes and Dev . 10:2251-2264) as v/ell as the 

ubiquitously expressed m.ixed- 1 ineage kinase MLK-3 (Kiefer, 
F. ec ai (1996) EMBO J. 15:7013-7025). This function of 
HPKl recuires, in contrast co GCK, both its kinase domiain as 
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The uostrsam activators of H?K1 



well as i 



ts C-terminus. 



20 



25 



remain unknown. HPKl also plays a key role as a mediatcr of 
transforming growth factor-P (TGFp) signaling. HPKl 



stimulates the SAPK pathway by phosphorylat ing SEKl (Wang W. 
et ai (1997) J. Biol. Chem. 272:22771-22775). 

KHS2 is expressed in thymus, dendrocyces and monccyces. 
KKS2 could have a complementary function to that of KH31 as 
a mediator of SAPK activation in the cellular response to 
inflammatory cytokines. KKS2 could have the potential to 
interact directly with TRAF2 since a STK -with the predicted 
molecular v/eight of KHS2 (approximately 101,000 dalcons) is 
found in. .the TNFR-TPJ!^F2 complex upon TNFa stimulation 
(VanArsdale, T. and Ware, C.F. (19S4) J. Immunol, 153, 3043- 
3050) . The presence of a putative binding domain for Rab or 
a Rab-like molecule ac the C-cerminus of KHS2 indicates chat 
KHS2, in addition to having a potential role in the TRAF2 - 
dependant TNFa cytokine response, could also mediate 
signaling events thac utilize small GTPase proceins. 
Alternatively, the binding of a small GTPase protein cc che 
C-cerminus of KHS2 may be required for its potential TRAF2 - 
dependant signaling to a downstreami kinase such as MEKKl . 

Human GEK2 , SULUl and SULU3 

A recent report (Y-W Qian et al . , Science 282:1701- 
1704,1993) described xPlkkl as the activator of Plxl (the 
Xenopus Polo kinase) . In Xencpus oocytes, the STK Plkkl can 
phosphcrylate and accivate Plxl STK (the mam.m.aiian Polo 
kinase or PLK) . A dominanc -negat ive ( kinase - dead) form, of 
xPlkkl prevents Plxl accivacion and delays germinal vesicle 
breakdov/n. Yet another unidentified kinase is probably 
responsible for xPlkkl accivation during mdtosis. 



activates the TGFb-act ivated kinase (TAK) , which in turn 
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The homology chrough the entire lengch of the xPlkkl 
protein with GEK2 suggescs that GEK2 might represent the 
human honnologue for xPikkl . Based on this, GEK2 might be 
upstream of ?LK in mammalian cells. In addition, based on 
the phage display screen results using the 3ULU1 coiled- 
coil2 domain as bait, SULUl might also interact in vivo wich 
GEK2 and therefore regulate GEK2 (and/or SLK through the 
coiled-coil domain) activation leading to PLK activation and 
mitosis . 

If such a cascade of events is required for mitosis in 
mammalian cells, interruption of this signaling cascade at 
any point might block mitosis and could be beneficial for 
cancer treatmient . 

A recently cloned STE20-subf amily kinase, rat TACl , is 
most likely the rodent orthologue of human SULU3 - • 

(Hutchinson, M. at al . J. Biol. Chem. 273:28625-28632, 1998). 
TAOl activates MEK3 , 4 and 6 in vitro, while in transfected 
cells It associates and activates only MEK3 , resulting in 
phosphorylation and activation of p3S. These results 
implicate TAOl (SULU3) in the regulation of the p38 
containing stress-responsive MAP kinase pathway. 

Hum.an SULUl is weakly expressed in hematopoietic 
sources whereas SULU3 is found in B-cells and THl -restricted 
T cells. These m.am;m.alian SULU STKs display strong homology 
to the C. elegans SULU kinase. The role that this kinase- 
plays in- nematode development is unknown. The strong 
sequence homology between the catalytic domain of mammalian 
SULU kinases and other STE2C subfamily kinases such as SOK-1 
(humian STS2 0) and KHS2 suggests that the mammialian kinases 
may participate in the stress -response pathv/ay. The 
potential coiled-coil domains found at the C-terminus of the 
SULU kinases may play a role in the regulation of this 



kinase 
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Murine LOK { lymphocyte-orienced kinase) is a 
consticut ively activated STK of approximately 130,000 
daltons that is predominantly expressed in spleen, thymus 
and bone marrov; (Kuramochi, S. et al (1997) J. Biol. Chem. 
5 272: 22679-22684) as well as in meiotic testicular and 

primordial germ cells. The LOKl gene is Iccared in 
chromosome 11 of the mouse near the wr locus, a region that 
is associated v/ith reproductive and neurological defeccs 
( Yanagisav/a , M. et al (1996) Mol . Reprod . and Dev. 45:411- 

10 420) . LOK does not activate any of the known MAPK pathways 

(ERK, JNK and d3 8) nor tne NFkB pathway. The upstream 
signaling elements of LOrC as well as the extracellular 
stimuli uhac ucilize this kinase to elicit a biological 
response are also unknown (Kuram.ochi, S. et al (1997) J. 

15 Biol. Chem:. 272: 22679-22684). 

Human GEK2 is highly related to murine LOK, but based 
on sequence divergence in the non-catalytic domain, it 
appears to be a distinct member of this STE2 0 - subfamily . 
GEK2 may signal through a pathway that remiains to be 
20 defined. The presence of potential coiled-coil regions at 

Che C- terminus of GEK2 could play a key role in regulating 
the functions of this kinase. 

Human PAK4 and PAK5 
25 The p21 activated protein kinases (PAK) are a closely 

related subgroup of the STE20 family of serine/ threonine 
kinases. Extensive genetic and biochemical analysis of the 
budding yeast STE20 has shov/n the critical role this 
serine/ threonine kinase plays ac the juncture of several 
importanc intracellular pathways required to appropriately 
respond to extracellular signals. STE20 links the 
transcriptional response by mediating the activation of the 
appropriate downstream MAPK pathway as well as coupling 



30 
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changes in cellular mcrpholcgy via its control of the accin 
cytoskeleton . 

A halltTiark of the ?AK subgroup is their small G 



dependent activation upon this group of kinases. Via the 
PBD, ?AKs bind to activated small G proteins resulting in 
the derepression of the PAK's intrinsic kinase activity. 

Until recently, there were three known PAK kinases: 
PAKl , a 68 kD protein whose expression is restricted 
expression zo brain, muscle, and spleen; PAK2 (PAKl, PAK65), 
a 62 kD protein whose expression is ubiquitous; and PAK3 , a 
oBkD protein whose expression is restricted to the brain. 
Similar co STE20, the miammalian PAKs (1,2, and 3) have been 
shov/n CO respond to excracel lular signals (growth factors, 
mitogens, cytokines and a variety of cellular stresses) 
(Bagrodia.; et a J . (1995). J. Biol. Chem. 270: 22731-22737; 
Zhang, S., et al . (1995). J. Biol. Chem. 270: 23934-23936, 
Frost, J. en al . (1998) J. Biol. Chem. 273: 28191-2S198; 
Galisteo, M. et al . (1996) J. Biol. Chem. 271: 20997-21000), 
and are linked to TCR activation (Yablonski, D., et al . 
(199S) EMBO J. 17: 5 64 7 - 5657 ) , • and heterot r imieric G protein- 
coupled receptors (Knaus, U. et al . (1995) Science 269: 221- 
223) . 

The PAKs were originally identified as effectors for 
members of the Rho family of small G proteins (such as Rac • 
and Cdc42), hence their name, p2 1 -act ivated kinases (PAK) 
(Manser et al Nature 367:40-46). The recruitmenu of the 
PAKs to the appropriate intracellular location is cri::ical 
no their function. Attempts to elucidate the role played by 
PAKs in intracellular signaling and morphological changes is 
complicated due to the com.plex interactions by which they 
can be recruited by such factors as activated small G 



protein-binding domain (PBD) that confers G protein- 
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The adaptor molecule, Nek, is const icucively bound via 
its SH3 domain zo zhe proline-rich mocif in the N-cerminal 
portion of PAKl . Binding of the Nck-PAK complex to 
activated arov/th factor receptors in response co growth 
factor scimuiacion provides a nnechanism co link growth 
factor-stimulated and stress - response parhv/ays (Galisceo, M. 
ez al. (1996) J. Biol. Chem. 271:20997-21000). 

The FBD found at the N- terminus of PAKl is responsible 
for ics high-affinity inceraction with the GTP-bound forms 
of Cdc42 and Rac (Burbelo, ?. ec al , (1995) J. Biol. Chem. 
270:29071-29074).. The exact mechanism, through which the 
small GTPases activate ?AKs may involve, in pare, 
association of the kinase with activated growth faccor 
receptors through guanine nucleotide exchange factors 
(GEFs) . GEFs activate small GTPases by catalyzing the 
formation of their GTP-bound state, thereby promoting their 
association with, and activation of, PA^Ks . The known 
miam.mialian PAK kinases, as v;ell as Droscphlla and C. eiegans 
PAKs , all conserve an N-terminal ext racatalyt ic motif 
responsible for a high-affinity interaction with the GEF, 
PIX. The ?AK-Cdc42 interaction and subsequent PAKs occurs 
as a PIX/PAK ccm.plex (Manser, E. ec ai. (1998) Molecular 
Cell , 1 , 133-192) . 

PAK signaling stimulated by heterotrimeric G proteins 
is mediated through the interaction between a short 
conserved am^ino acid region located ac the C- terminus of 
PAKl v;ith the G-protein P-subunit (Leeuv/, T. et al.(199c) 
Nature, 391: 191-195). 

A variety of studies have indicated that the human PAKs 
are involved in mediating the activation of s tress - act ivated 
protein kinase pathways (JNK and to lesser extent p3S) . 
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?AKs are also ponencial n^.ediators in the crosstalk betv-eer: 
the pat:hv;ays regulated by the Rho family of small G r^rotems 



leading to the activation of the ERK pathway (Bagrodia, et 
al. (1995). J, Eiol. Chem, 270- 22731-22737; Zhang, 3., et 
al. (1995). J. Biol. Chem. 270: 23934-23936; Brov.^n, J., ec 
al . (1996) Curr Biol. 6:598-60596; Frost, J., et al . (1996). 
Mol . Cell. Biol. 16: 3707-3713). 

PAKl has been implicated in phosphorylan ing a 
regulatory sice in MEKl that is necessary for MEKl ' s ability 
to interact v/ith Rafl (Frost, et ai . (1997) EM30 J. 16:6426- 
643 S J . PAK3 has been shov.^n to phosphorylat e Rafl on a sice 
thac is imiportanc for Rafl activity (King, A., et ai . 
(1993). Nature 396: 1S0-1S3). 

PArCs play an im.portant role in controlling ■'■ 
morphological changes in cell shape mediated by the actxn 
cycoskeleton . Such miorphological changes are required for 
cellular functions ranging from cell division and 
proliferation to cell miotilicy and vesicle transport. ?AK 
activity has been implicaced in the localized assembly 
(leading edge) and disassembly {retracting edge) of focal 
adhesions . necessary for cell miotility (Frost J. et al (199S) 
J. Biol, Chem.. 2 73:23191-28198). 

PA.Pn:2 miay have a role in the morphological changes 
induced during apoptosis (MeTibrane and morphological changes 
in apopcocic cells regulated by caspase -miediaced activation 
of ?AK2 . (Rudel, T. (1997) Science. 276:1571-4)), and PAKl 
may be im^portant in prevencing apoptosis (Faure S, et al , 
(1997) EM30 J. (1997) 16:5550-61). In addition to 
overcoming mitogen- and anchorage - independent grov/ch, tumor 
cells need to escape che programmied cell death chac 
accompanies deregulaced cell growth. Thus, inhibition of 
PAKs may be effeccive in triggering apoptosis in tumors. 



and the signaling pachv/ays directly dovynscream of Ras 
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A direct requirenient for PAKs in che transf ormacion of 
uTtammalian cells has been shown for FAKl and PAK2 . Kinase- 
dead alleles of PAKl block ras transformation of RATI and 
Schwann ceils (Tang, Y., et al. (1997) Mol . Cell. Biol. 17, 
4454-4464) . Dominanc - negat: ive alleles of PAK:2 have been 
3hov;n co interfere v;ith ras -mediated transf ormtacion of 
m.ammalian cells (Osada, S., (1997) FEB3 Lett 404:227-233) 

Mutations in PAK3 have been implicated in nonsyndromic 
X- linked mental retardation suggesting a role for PA.K3 in 
cognitive function (Allen, K. et al . (1998) Nat. Genec . 20: 
25-30) . PA.Kl has been implicated in neurite outgrowth in 
PC12 cells (Daniels, R. et al . (199S) EMBO J. 17: 754-764; 
Nikolxc, M. et al . {1993) Nature 3 9 5:194-193). 

Finally, PAK-like STKs miay also play a role in AIDS 
pathogenesis since the myristoylared 27kD membrane- 
associated HIV Nef gene produce directly interacts with and 
activates these kinases via cdc42 and Rac . The Nef -mediated 
activation of PAK-like STKs correlates with the induction of 
high viral titers and the development of AIDS in infected 
hosts (Cullen, S, R. (1996) Curr . Biol. 6:1557-1559). 

Our results show that ?AK4 is expressed in thymus, 
dendrocytes, miast cells, m.onocytes, as well as in T cells 
(TH2 -restricted cells and MCLT4 ) and che B cell line RPMI . 
PAK5 is found in mast cells and in the T cell line M0LT4 . 
These daca suggest potential roles for PAK4 and PAK5 in the 
immune system. 

PAK4 and PA.K5 share v/ith the known PAKs a potential 
cdc42 -binding mocif at their N- termini. Both PAK4 and PAK5 
display sequence homolcgy in their C-termini to a m.otif 
responsible for an interaction becv.^een PAKl and the (3- 
subunit of he terot rim.ic Q-proteins (amino acid residues 665- 
676 in ?AK 4, and amino acid residues 336-3 93 in PAK5) . 
Consequently, PA.K4 . and possibly PAK5 , could mediate 
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signaling even'cs originating from growch factors as v/sil as 
frcrri iigands thac stimulate G-protein- 1 inked receptors. 

?AK4 conserves a leucine (leu 44) , that v/hen rr:uraced ~c 
a phenylalanine renders the kinase activicy of human PAKl 
5 consticut ively active, bypassing its cdc4 2 -binding 

requiremeni: for activacion (Brown J. et ai (1996) Current 
Biol . 6:598-605). ?AK5 contains an isoleucine ac the 
equivaienc position. Therefore, .the mechanism by which 
cdc4 2 potentially activates human PAKl, PAK4 , and possibly 

10 PAK5 , mtay be very sim.ilar. 

PAK4 and PA.K5 hov/ever, lack the PIX-binding m.ot if , and 
consequently cdc42 -act ivating GEFs other rhan ?IX (for 
example Dbl and Cool) m^ust be responsible for the activation 
of chese kinases. Al cema" ively , PAK4 and PAF:5 may be 

15 accivaced by another GT-Pase, such as Racl v/hich uses the ^ 

Tiar.'l GEF for its activation to the GTP-bound scace. 

PAK4 and PAK5 also lack che PxxP motif responsible for 
the Nck-PAKl association. Betv;een the PBD or cdc42-binding 
N-cerminal m.otifs and che putative GEF-binding regions, PAK4 

20 and PAK5 have long insertions (135 and 123 amano acids for 

PAK4 and PAK5 , respectively) relative co PAKl. This region 
probably confers different binding characteristics, -ro 
adaptor molecules and/or GEFs from those exhibited by knov;n 
mam.malian PAKs . 

25 PAKs have been shov/n to be upstream in pathways leading 

to accivation of both the JNK (Bagrodia, S., et al . (1995) 
J. Biol. Chem. 27C: 22731-22737) and ERK kinase pathways 
(Brown, J., ec ^l. (1996) . Curr Biol. 6:598-605; ..PAKl was 
shown to synergize v/ith ras in activation of the ERK pathv.*ay 

30 through phosphorylation of MErCl (Frosc, J. ec al . (1997).* 

EM30 J. 16:6426-6433) . Our da"a shows that MEKl serves as 
an in \rltrc subscrace for PA-K4 , suggesting a potential role 
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for ?AK4 in the accivacion of the ERIK pathv/ay and 
micogenesis . 

?AK5 may also have a mitogenic role, and be linked co 
cancer, based on izs expression profile (elevated RNA and 
5 procein levels in a wide variecy of tumor cell lines), its 

inneracuion with cdc42 via ics ?BD, and the ability of a 
kinase-dead allele (Lys350, 351 Ala) to block ras 
transformation of NIH3T3 cells. Thus, a screen for small 
molecule inhibieors of PAKE- kinase activity may yield 

10 compounds with therapeutic potential for intervention in 

cancer derived fromt a wide variety of tissue types. 

?AK4 and PAK5 may also play a role in HiV pachogenesis 
as pocential mediators of Nef signaling, since none of the 
kilov/n ?A:Ks correspond to the PAK-like kinase shown to 

15 inceracc wich, and be activated by, the HIV nef protein (Lu, 

X. et al. (1996) Current: Biology 6:1677-1684) 

The 3' untranslated region of PAK4 contains a CA repeat 
chac is prone to undergo expansion. CA dinucleotide repear 
instability has been associated with disease (Toren, M.Z. ec 

20 al (199S) Am. J. Hematol . 57: 148-152), and expansion cf 

such repeat in the 3' untranslated region of PAK4 could 
implicate this kinase in as yec unknown pathologies. 

CI inical aool icacions 

25 Hum.an STLK2 , STLK3 , 3TLK4 , STLK5 , STLK6 , and STLK7 

STLK3, STLK5, STLK6 and STLK7 , as v/ell as other 
hcmologues of the 3TLK subfamily of STE2 0 protein kinases 
such as STLK4 ,■ may play an imporcant role as mediators of 
the immiune response. Thus, they are targets for the 

30 development of specific small molecule inhibitors to treac 

immunological diseases, including, but noc lim.iced to, 
rheumaccid arthriuis, chronic inf lammaiiory bov/el diseases 
(e.g. Crohn's disease), chronic inf lamm.atory pelvic disease, 

/50 
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multiple sclerosis, aschma, osceoarnhric is , psoriasis, 

atherosclerosis, rhinicis and autoimmunity, as v/ell as in 
organ transplantacion . Ocher diseases include 
cardiovascular diseases. 
5 The human STLKs may also play an important role in cell 

grov;th regulation. Thus, they are targecs for developing 
small molecule kinase inhibitors for the treacmenc of cancer 
and m.etastases. STLKB maps to a chromosomal region 
frequently amplified in a variety of "umors including those 

10 from non- small cell lung cancer, breast cancer and 

peripheral nerve tumors. This suggests that STLK5 could 
play a role in che developm.enc , m.aincenance , or progression 
of hum.an tumors . 

The potential role of human STLKs 2,3, and 4 in 

15 . m.ediating oxidative stress scrongly suggests chat drugs 

targeting these kinases could prove useful in the creacmenc 
of myocardial infarction, arrhythmda and other 
cardiomyopathies, stroke, renal failure, oxidative stress- 
related, neurodegenerative disorders such amyotrophic laceral 

20 sclerosis, Parkinson's disease and Leigh syndrome, a 

necrotizing m.i toch-...idrial encephalopathy, as well. 

Human ZCl , ZC2 , ZC3 , and ZC4 

ZCl may be a comiponent of the CD2 8 - signaling pathway 

25 and therefore important in T cell accivacion. As such, ZCl 

as well as other ZC subfamily kinases, are targets for the 
development of specific smiall m.olecule inhibitors to treat 
im^m.unological diseases, including, buc not limited to, 
rheumiacoid arthritis, chronic inflammatory bov/el diseases 

30 (e.g. Crohn's disease), chronic inf lamtmacory pelvic disease, 

multiple sclerosis, aschmia, osceoarchr i 1 1 s , psoriasis, 
atherosclerosis, rhinitis and aut oimLmiuni t y , as well as organ 
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cransplancacion . Other diseases include cardiovascular 
diseases . 

ZCl and ZC2 are also implicated in cell grov/th 
regulation. Thus, ZC subfamily kinases are cargecs for 
5 developing small molecule inhibicors for che creatment of 

cancer and menastases. ZC2 maps to a chror.-csom.al region 
frequently amplified in a variety of tumtors including those 
from non- small cell lung cancer, small cell lung cancer, and 
cervical cancer. This suggests that ZC2 could play a role 
10 in the development, maintenance, or progression of human 

tumors . 

The role of human ZCl, ZC2 , ZC3 , and ZC4 in che 
inf lamima tory and scress - response pathv/ays, srrongly suggests 
that drugs cargecing chese kinases could have strong 

15 immunosuppressive actions. These drugs can prove valuable 

for che treatment of rheumatoid arthritis, artherosclerosis , 
autoim.mune disorders and organ transplantation among others. 

At least one very important class of im.m.unosuppresants , 
corticosteroids, functions by blocking SAPK activacion a- an 

20 as yet undefined sice on this pathway (Swantek, J . L . et al 

(1997) Mol . Cell. Biol. (1997) 6274-6282). Other 
im.munosuppresive drugs like the pyridinyl im^idazoles 
specifically target the p3S kinases (Kumar, S. et ai (1997) 
Biochemi. Biophys. Res. Commun . 235: 533-52S). Drug 

25 targeting of che MAPIK and p38 pathways could lead co the 

developm.ent of novel imm.unosuppresant s . 

Human SUI,-U and GSK 

The potential role of these novel STE2 0 - related protein 
30 kinases m the concrol of mitosis scrongiy suggests chac 

agents chat specifically inhibic these kinases could be 
useful for cancer and metastases creatmenc. 
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The close homology of human STLK5, GEK2 , 3ULU1 and 
SULU3 CO STE2G-subf amily kinases involved in the scress and 
oxidative response pathv;ay scrongly suggests that drugs 
targeting these kinases may also be useful as 
5 immunosuppressants as well as to treat ischemic disorders. 

Human KHS2 

The role of human KHS2 in the inflammatory and stress - 
response pathways, strongly suggests that drugs targeting 

10 this and related kinases could have strong immtunosuppressive 

actions. These drugs can prove valuable for the treatment 
of rheumatoid arthritis, artherosclerosis , autoimm.une 
disorders and organ transplantation among others. At least 
one very important class of im.munosuppresant s , 

15 corticosteroids, functions by blocking SA?K activation at an 

as yet undefined site on this pathway (Swantek, J.L. ec al 
(1997) Mol . Cell. Biol. (1997) 6274-6282). Other 
immunosuppresive drugs like the pyridinyl imidazoles 
specifically target the p3S kinases (Kumar, S. ec al (1997) 

20 Biochem, Biophys . Res. Commun . 235: 533-52 8). Drug 

targeting of the MAPK and p3 8 pathways could lead to the 
development of novel imm.unosuppressant s . 

Human ?AK family 

25 PAK5 has a role in cancer based on its expression 

profile (elevated RNA and protein levels in wide variety of 
tumor lines) , its interaction with Cdc42 via its PBD, and 
the ability of the kinase-dead allele of PAK5 (Lys350, 
351Ala) to block ras transformation of MIH3T3 cells. Thus, 

30 a screen for small molecule inhibitors of PAK5 kinase 

activity mtay yield compounds with therapeutic potential for 
intervention in cancers and m.etastases derived from a wide 
range of tissue types. 

I 33 
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?AK5 maps to a chromosomal region frequencly amoiifiec 
m a varie-y of tumors including those from non-small cell 
lung cancer, and smiall cell lung cancer. These findings 
suggesc that PAK5 could play a role in the development, 
5 maintenance, or progression of human tumors and/or 

metastases . 

The role of humian ?AK4 , and PArCB in the inflammatory 
and stress -response pathv/ays also strongly suggescs that 
drugs cargecing these kinases could have strong 

10 imimunosuppressi ve actions. These drugs can prove valuable 

for tne treatment of rheumatoid arthritis, artherosclercsis , 
autoimm.une disorders and organ transplantation among others. 

At least one very important class of immunosuppresant s , 
corticosteroids, functions by blocking SAPK activation at an 

1^ yet undefined site on this pathway (Swantek, J.L. et al 

(1997) Mol. Cell. Biol. (1997) 6274-6282).. Other 
im.m.unosuppresive drugs like the pyridinyl imidazoles 
specifically target the p38 kinases (Kumar, S. et ai (1997) 
Biochem. Biophys . Res. Commun . 235: 533-528). Drug 

20 targeting of the MAPK and p3S pat'-v-ays could lead to the 

development of novel immunosuppresant s , In addition, drugs 
targeting PAK4 or ?AK5 could prove useful as 
im.munosuppresant s as well as m AIDS treatmenc . 

25 VI 1 1 . Transoenic Anim.als . 

A variety of mtethods are available for the production 
of transgenic animals associated with this invention. DNA 
can be injected into the pronucleus of a fertilized egg 
before fusion of the male and fem.ale pronuclei, or injected 

30 into the nucleus of an embryonic cell (e.g., the nucleus of 

a tv/o-cell emJoryo) follcwing the initiation of cell division 
(Brinster et ai . , Proc . Nat. Acad. Sci . USA 82: 4438-4442, 
1985) . Emibryos can be infected v/ith viruses, esoeciallv 
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r"e c roviruses , modified zo carrv inorganic - ion receotcr 
nucleoside sequences of the inv^enticn, 

Pluripotent seem cells derived from che inner cell mass 
of the embryo and scabilized in culcure can be manipulated 
5 in culture to incorporate nucleotide sequences of the 

invention, A transgenic animal can be produced from, such 
cells chrough im.plantat ion into a blastocyst tha- is 
implanted into a foster mother and allowed to com^e to term*. 
Animals suitable for transgenic experiments can be obtained 
10 from standard commierciai sources such as Charles River 

(Wilmington, MA) , Taconic (Germantown, NY) , Harlan Sprague 
Dawley (Indianapolis, IN), etc. 

The procedures for m^anipulat ion of the rodent: embryo 
and for miicroxn j eccion of DNA into the pronucleus of the 
15 zygote are well known co chose of ordinary skill in the art 

(Kogan et al . , supra). Microinjection procedures for fish, 
amphibian eggs and birds are detailed in Houdebine and 
Chcurrout (Experientia 47: 857-905,- 1991) . Other procedures 
for introduction of DNA' into tissues of animals are 
20 described m U.S. Patent No., 4,945,050 (Sandford etai., 

■ July 30 , 1990) . 

By way of example only, to prepare a transgenic mouse, 
fem;ale mdce are induced to superovulane . Fem^ales are placed 
winh males, and the mated femiales are sacrificed bv CO2 
25 asphyxiation or cervical dislocation and embryos are. 

recovered -from excised oviducts.- Surrounding cumulus cells 
are removed. Pronuclear embryos are then washed and scored 
until the timie of injection. Randomly cycling adult fem.ale 
m.ice are paired v/ith vaseccom.ized miales . Recioient fem.ales 
30 are mated at the sam:e tim.e as donor femiales. Embryos then 

are transferred surgically. The procedure for generatinc 
transgenic rats is simiilar co thac of mice (Ham.mer et al . , 
Cell 63:1099-1112, 199 0). 

I3S 
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iMechods for the cuicuring cf embrycnic stem (ES) cells 
and the subsequent: production of nransgenic anirrials by zhe 
inrroduct ion of DNA into ES cells using methods such as 
eleccroporacion, calcium phosphate/DMA precipitation ana 
direct injection also are well knov;n to those of ordinary 
skill in the arz ( Teratocarcinomas and Embryonic Stem Cells, 
A Practical Approach, E.J. Roberrson, ed . , IRL Press, 19S7) . 

In cases involving random gene incegration, a clone 
containing the sequence (s) of the invention is co- 
cransrecced wich a gene encoding resistance. Alcernacivelv, 
the gene encoding neomycin resistance is physically linked 
to the sequence (s) of the invention. Trans fecc ion and 
isolation of desired clones are carried out bv anv one of 
several methods v/ell known to those of ordinarv skill in the 
1^ art (E.J. Robertson, supra) . 

DNA molecules introduced into ES cells can also be 
integrated into the chromosome through the process of 
homologous recombination (Capecchi, Science 244: 1283-1292, 
1989) . N!ethcds for posicive selection of the recombination 
20 event (i.e., neo resistance) and dual posi t ive - nega t ive 

selection .(i.e., neo resistance and gancyclovir resistance) 
and the subsequent identification of the desired clones by 
PGR have been described by Capecchi, supra and Joyner et al . 
(Nature 338: 1 53 - 156 , ' 1 98 9 ) , the Teachings of which are 
incorporated herein in their entirety including any 
drawings. The final phase of the procedure is co inject 
targeted ES cells into blastocysts and to transfer the 
olastocysrs into pseudopregnant females. The resulting 
chim.eric animals are bred and the offspring are analyzed bv 
Southern blotting to identify individuals that carry the 
transgene. Procedures for the production of non-rodent 
mammals and ouher animals have been discussed by others 
(Houdebine and Chourrout , supra/ Pursel et ai . , Science 

/3b 
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244:1251-1283, 19S9; and Simms et al , , 3io/Techncl ogy 6 : ITS- 
IS 3 , 1 9 3 S ) . 

Thus, the invencion provides transgenic, nonhuman 
mammals containing a transgene encoding a kinase of che 
invention or a gene effecting the expression of zhe kinase. 
Such transgenic nonhuman mammals are particularly useful as 
an in vi^/o test system for studying che effects of 
intrcduccion of a kinase, or regulating the expression of a 
kinase (i.e., through the introduction of additional genes, 
antisense nucleic acids, or ribozym.es) . 

A "transgenic animal" is an animal having cells chat 
contain DNA which has been artificially inserced into a 
ceil, v;nich DMA becomes pare cf the genome of the -anim.al 
which develops from thau cell. Preferred transgenic animals 
are prim^ates, mice, rats, cows, pigs, horses, goats, sheep, 
acgs and cats. The transgenic DNA m^ay encode human STE2Q- 
related kinases. Native expression in an anim^al may be 
reduced by providing an amounc of ant i- sense RNA or DNA 
effective to reduce expression of the receptor. ■ 

IX . Gene Theraov 

STE20-related kinases- or cheir genetic sequences will 
also be useful in gene therapy (reviewed m Miller, Nature 
357:455-460, 1992). Miller states that advances have 
resulted in practical approaches to human gene therapy that ' 
have demonstrated positive initial resulus. The basic 
science of gene therapy is described in Mulligan (Science 
260 : 926-931 , 1993) . 

In one preferred emibodimient , an expression vector 
containing STE2 0 - related kinase coding sequence is inserced 
into cells, the cells are grown in -icro and then infused in 
large numJoers into pacients. In another pref erred emibodi - 
ment,. a DNA segment containing a promoter cf choice (for 



BNSDOCID- <WO 9953036A2_1A> 



wo 99/53036 PCT/US99/08 1 50 

example a strong promccer) is cransferred intc cells 
containing an endcgencus gene encoding kinases of the 
mveneion in such a manner nhac the pronioter segmenc 
enhances expression of the endogenous kinase gene (for 
5 example, the promoter segment is transferred co che cell 

such that it becomes directly linked to che endogenous 
kinase gene) . 

The gene therapy may involve the use of an adenovirus 
containing kinase cDNA targeted to a tumor, systemic kinase 
10 increase by implantation of engineered cells, injection v;ith 

kinase-encoding virus, or injection of naked kinase DMA into 
appropriate tissues. 

Target cell populations may be modified by introducing 
altered forms of one or more components of the protein 
15 complexes in order to modulate the activity of such 

complexes. For example, by reducing or inhibiting a complex 
component activity within target cells, an abnorm.al signal 
transduction event (s) leading to a condition may be 
decreased, inhibited, or reversed. Deletion or missense 
20 mutants of a component, that retain the ability to interact 

with other components of the protein complexes but cannot 
function in signal transduction may be used to inhibit an 
abnormal, deleterious signal transduction event. 

Expression vectors derived from viruses such as 
2^ retroviruses, vaccinia virus, adenovirus, adeno-associated 

virus, herpes viruses, several RNA viruses, or bovine 
papilloma virus, may be used for delivery of nucleotide 
sequences (e.g., cDNA} encoding recombinant kinase of the 
invention protein into the targeted cell population (e.g., 
30 tumor cells) . Methods which are well known to' those skilled 

m tne art can be used to construct recombinant viral 
vectors containing coding sequences (Maniatis et al . , 
Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 

133 
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Laboratory, N.Y., 19S9; Ausubel ec al . , Currenc Frc-cccls ir. 
Molecular Biolcg;,', i3rsene Publishing Associates and VJiley 
Interscience , N,Y., 1989). Alternatively, recorrbinant: 
nucleic acid molecules encoding protein sequences can be 
used as naked DNA or in a reconsci tuned system e.g., lipo- 
soines or other lipid systems for delivery to target cells 
(e.g., Feigner et al . , Nature 337:337-3, 19S9), Several 
ocher methods for che direct transfer of plasmid DMA into 
cells exist for use in human gene therapy and involve 
targeting the DNA t:o receptors on cells by corr.plexing the 
plasmid DNA to proteins (Miller, supra) . 

In ics simplest form, gene transfer can be performed by 
sim.piy injeccmg minute amounts of DNA into the nucleus of a 
cell, through a process of. miicroinj eccion (Capecchi, Cell 
15 22:479-88, 1980}. Once recombinant Qenes are introduced 

into a cell, they can be recognized by the cell's normal 
m.echanisms for transcription and cranslacion, and a gene 
product v;ill be expressed. Other methods have also been 
attempted for introducing DNA into larger numb-erscf cells, 
20 These methods include: t ransf ect ion , v/herein DNA is 

precipitated v/ich CaPC. and taken into cells by pinocy-osis 
{Chen et al . , Mol . Cell 3icl . 7:2745-52, 1937); 
elect rcporac ion, wherein cells are exposed to large voltage 
pulses to introduce holes into the mtembrane {Chu et ai . , 
25 Nucleic Acids Res. 15:1311-26, 1987); lipof ect lon/liposome " 

fusion, wherein DNA is packaged into lipophilic vesicles 
which fuse with a carget cell (Feigner et al . , ' Proc . Natl. 
Acad. Sci. USA. 34:7413-7417, 1987); and'parcicle 
bombardm.enr using DNA bound tc- sm.all projectiles (Yang ec 
^0 ^I'f Proc. Narl. Acad. Sci 37 : 9558 - 9572 , 1990). Another 

method for introducing DNA into cells is to couple the DNA. 
to chemically mccdified proteins. 

/59 
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It has also been shown thac adenovirus proceins are 
capable of destabilizing endosomes and enhancj-ng che uccake 
of DNA into cells. The admixture of* adenovirus to solutions 
containing DNA complexes, or the binding of DNA to 
5 po^yiysine covalently attached to adenovirus using protein 

crosslinking agents substantially improves the uptake and 
expression of the recombinant gene (Curiel et al . , Am. J. 
Respir. Cell. Mol . Biol., 6:247-52, 1992). 

As used herein "gene transfer" means the process of 
10 incroducing a foreign nucleic acid molecule into a cell. 

Gene transfer is comtm.only performed to enable the expression 
of a particular product encoded by the gene. The product 
may include a protein, polypeptide, ant i- sense DNA or RNA, 
or enzymatically active RNA . Gene transfer can be performed 
15 in cultured cells or by direct administration inno animals. 

Generally gene transfer involves the process of nucleic 
acid contact wich a cargec cell by non-specific or receotor 
mediated interactions, uptake of nucleic acid incc the cell 
through the memb:-rane or by endocytosis, and release of 
20 nucleic acid into the' cyuoplasm from the plasm.a m.embrane or 

endosome. Expression may require, in addition, movement of 
the nucleic acid into the nucleus of the cell and binding to 
appropriate nuclear factors for transcripcion . 

As used herein "gene therapy" is a form of gene 
2^ transfer and is included within the definition of gene 

transfer as used herein and specifically refers co gene 
transfer to express a therapeutic product from a cell in 
v-iv-c or in vicro. Gene transfer can be performied ex \rivc on 
cells which are then transplanted inco a patient, or can be 
30 perform.ed by direct adminis- rat ion of the nucleic acid or 

nucleic acid-protein complex into the patienc . 

In another preferred em±)odiment , a vector having 
nucleic acid sequences encoding a STE2 0 - relaced kinase 

1^0 
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polypeptide is provided in v/hich the nucleic acid sequence 
is expressed only in specific tissue. Methods of achievir.o 
t issue - speci f i c gene expression are sec forch in 
International Publication No. WO 93/09236, filed NcverrJcer 3, 
5 19 92 and published May 13, 19 93. 

In all of che preceding vectors set forth above, a 
further aspect of the invention is that the nucleic acid 
sequence contained in the vector may include additions, 
deletions or modifications to some or all of the seauence of 

10 the nucleic acid, as defined above. 

In another preferred embodiment, a miethod of gene 
replacement is set forth. "Gene replacement" as used herein 
means supplying a nucleic acid sequence which is capable of 
being expressed in vivo in an anim^al and thereby providing 

15 or augmenting the function of an endogenous gene which is ' 

missing or defective in the animal. 

X . Administration of Substances 

Methods of determ.ining ' the dosages of compounds to be 

10 admiinis tered to a patient and modes of adrndnistering 

compounds to an organism are disclosed in U.S. Application 
Serial No. 08/702,282, filed August 23, 1995 and 
International patent publication number WO 96/22976, 
published August 1 1996, both of which are incorporated 

!5 herein by reference in their entirety, including anv 

drawings, figures, or tables. Those skilled m the art will 
appreciate that such descriptions are applicable to the 
present invention and can be easily adapted to it. 

The proper dosage depends on various factors such as 

'0 the type of disease being treated, the particular composi- 

tion being used, and the size and physiological condition of 
the patient. .Therapeutically effective doses for the 
compounds described herein can be estim.ated initiallv from: 

HI 
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ceil culture and animal models. For example, a dose can be 
formulated in animal models co achieve a cix-cula-in'^ 
concencracion range chat inicially cakes incc accounc che 
ICse as determined in cell culture assays. The animal m.odel 
data can be used to more accurately determ.ine useful doses 
in humans . 

Plasma half-life and biodist ribut ion of the drug and 
mietabolites in the plasma, tumors, and m.ajor organs can be 
also be determined co facilicate the selection of drugs most 
appropriate to inhibit a disorder. Such measurements can be 
carried out. For example, HPLC analysis can be performed on 
the plasp.ia of animals treated v/ith the drug and the location 
or radiolabeled compounds can be determined using detection 
methods such as X-ray, CAT scan, and MRl . Compounds that 
show potent inhibitory activity in the screening assays, but 
have poor pharmacokinetic characteristics, can be ootim:ized 
by altering the chemical structure and retestmg. In this 
regard, compounds displaying good pharmacokinetic character- 
istics can be used as a model. 

Toxicity studies can also be carried out by m.easuring 
che blood ceil composition. For example, toxicity studies 
can be carried out in a suitable animal model as follov/s: 1) 
the comipound is adm.inistered to mice (an untreated control 
mouse should also be used) ; 2) blood samples are 
periodically obtained via the tail vein from one m.ouse in 
each treatment group; and 3) the samples are analyzed for 
red and white blood cell counts, blood cell com.posi tion, and 
tne percent of lymphocytes versus polymorphonuclear cells. 
A comparison of results for each dosing regime v/ith the 
controls indicates if toxicity* is Dresent . 

At the termination of each toxicitv studv, further 
studies can be carried out by sacrificing the animals 
(.preferably, in accordance v/ith the Am.erican Veterinary 
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Medical Associatiion guidelines Report of the American 
Veterinary Medical Assoc. Panel on Euchanasia, Journal of 
American Veterinary Medical Assoc., 202:229-245^ 1993) 
Representative animals from each treatment grouc; can then be 
examined by gross necropsy for imm.ediate evidence of 
m.ecascasiS; unusual illness, or toxicity. Gross 
abnormalities in tissue are noted, and tissues are examined 
histologically. Compounds causing a reduction in body 
weight or blood components are less preferred, as are 
compounds having an adverse effect on major organs. In 
general, the greater the adverse effect the less preferred 
the comipound . 

For the treatm:ent of cancers the expected daily dose of 
a nydrophobic pharmtaceut zeal agent is between 1 to 500 
m.g/day, preferably 1 to 250 mg/day, and most preferably 1 to 
50 m^g/day. Drugs can be delivered less frequent Iv provided 
plasm;a levels of the active moiety are sufficient to 
miaintain therapeutic effectiveness . 

Plasma levels should reflect the potency of the druQ . 
Generally, the more potent the compound the lower the plasma 
levels necessary to achieve efficacy. 

EXAMPLES 

The examples below are not lim.iting and are m.erely 
representative of various aspects and features of the 
present invention. The examples below demonstrate the 
isolation and characterisation of the STE2 0 - related kinases 
of the invention. 
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EXAMPLE l:Isola-ion of cDNAs E nccaina Marr.nialian 

STE20-related Protein Kinases 
N^auerials and Mghhod5=; 
Identificatio n of novel clones 

Tocal RNAs were isolated using the Guanidine 
Sales/Phenol extraction prococol of Chomczynski and Sacchi 
(?, Chomczynski and N. Sacchi, Anal. Bicchem. 162, 156 
{1987}) from primary human tumors, normal and cumor cell 
lines, normal human tissues, and sorted human hematopoietic 
cells. These RNAs were used to generate single- stranded 
cDNA using the Superscript Preamplif ication System. (GIBCO 
ERL, Gaichersburg, MD ; Gerard, GF et ai . (19S9), FOCUS 11, 
56) under conditions recomm.ended by the m.anuf ac turer . A 
typical reaction used 10 ug total RMA with 1.5 |.ig oligoidT),, 
= 3 in a reaction volum;e of 60 }.iL . The product was treated 
with RNaseH and diluted to IOC j.lL with H.C. For subsequent 
PGR amplification, 1-4 uL of this sscDNA was used in each 
reaction. 

Degenerate oligonucleotides were synthesized on an 
Applied Biosystems 3943 DMA synthesizer using established 
phcsphoramidite chemistry, precipitated Vvith ethanoi and 
used unpurified for PGR, The sequence of some of the 
degenerate oligonucleotide prim.ers and the amino acid motif 
they encode is as follows: 

TRKl 5 ' - CTGAATTCGGNGCMTTYGGNAARGT - 3 ' GAFGKV ( sense ) 
TRK4 5 ' -GCTGGATCCYTCNGGNGGCATCCA-3 ' WMPPS (antisense) 
ROS 1 5 -GCNTTYGGNGARGTNTAYGARGG - 3 ' AFGEVYEG ( sense ) 
CCK4h 5 ' -GCTGGATGCYTCMGGNSWGATGGA- 3 ' WMSPE (antisense) 
CCK4C 5 ' -GAGTTYGGNGARGTNTTYYTNGC-3 ' EFGEVYEG (sense) 

These prim.ers were derived from, the sense and antisense 
strands of conserved miotifs within the catalytic domain of 
several protein kinases. Degenerate nucleotide residue 

H4 
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dssignacions are: N = A; C, G, or T; R = A cr G; Y = C or 
T; K = A, C cr T not G; D = A, G or T not C; S = C or G ; and 
W - A or T . 

PGR rsaccions were performed using degenerate prirr.ers 
5 applied to multiple single - stranded cDNAs . The primers were 

added at a final concentration of 5 uM each to a mixture 
containing 10 mM TrisHCl, pK S . 3 , 5C maM KGl , 1.5 rC'N MgCl., 
200 LiM each deoxynucleoside triphosphate, 0.001% gelatin, 
1,5 U AmpliTaq DMA Polymerase ( Perkin-Elmer/Getus ) , and 1-4 

10 cDNA. Fcilov/ing 3 min denaturation at 95^0/ the cycling 

conditions v^ere 94 ""C for 30 s, 5 0 ^G for 1 mm, and 72 ^G for 
1 min 45 s for 35 cycles. PGR fragments m.igrating bet^veen 
2G0-35C bp were isolated from 2% agarose gels using the 
GeneGlean Kit (EiolOl) , and T-A cloned into the pGRII vector 

15 (Invitrogen Gorp . U.S.A.) according to the manufacturer's 

protocol . 

Golcnies were selected for mini plasm.id DNA- 
preparations using Qiagen columns and the plasmid DNA was 
sequenced using a cycle sequencing dye - terminator kit with 

20 AmipliTaq DNA. Polymerase, ?S (A3I, Foster Gity, GA) . 

Sequencing reaction products were run on an A3I Prism. 377 
DNA Sequencer, and analyzed using the BLAST alignm-ent 
algorithm. (Altschul, S.F. et al . , J.Mol.Biol. 215: 403-10). 
Additional PGR strategies were employed to connect 

25 various PGR f ragm.ents or ESTs using exact or near exact 

oligonucleotide primers as detailed in the results section 
for each cDNA. . PGR conditions v;ere as described above 
except the annealing temperatures were calculated for each 
oligo pair using the form.ula: Tmi = 4 (G-^G) ^2 (A-hT) . 
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Isolation of cDMA clones: 

Human cDMA libraries were probed v/ith PGR or EST 
fragments corresponding to STE2 0 - related genes. Probes were 
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'^^"i^beled by random priming and used ac 2xlG' cc"/rriL 
lollowing standard techniques for libi-arv screenina. ?re- 
nybridizaticn (3 h) and hybridization (overnight) were 
ccnducced at 42 'C in 5X SSC, 5X Denhart ' s solution, 2.5% 
5 dextran sulface, 50 mT^l Na.?0,/NaH?C, , pH 7.0, 50% formamide 

v/ich 100 mg/m.L denatured salmon sperm DNA. Scringent v.^ashes 
were performed at 65 °C in 0 . IX SSC and 0.1% SDS . DNA 
sequencing was carried out on both strands using a cycle 
sequencing dye- terminacor kit wich Am.pl iTac Dr-TA Polymerase, 
^5 {A.3I, Foster City, CA) . Sequencing reaction products 
were run on an ABI Prism 377 DNA Sequencer. 

Makeqene 3ioinf ormac ics EST assembler 

The EST reports were downloaded from ncbi 
(wwv/ . ncbi . nlm . nih . aov ) . After uncompressing the files,- ehe 
program 'reporn2esc' was scripted co extract the following 
inform.ation: 1) EST names, 2) GenBank Accession numbers, 3) 
GenBank gi numbers, 4) Clone Id numbers, 5) the nucleotide 
sequences of the ESTs 6) the organism, 7) the librarv nam:-, 
8) the name of the lab, and 9) the insritution. The output 
of ^report.2est ' is a file in FASTA format wirh all of the 
information listed above in the first line of each entrv 
except the sequence, which is listed in the second line of 
each entry. The resulting file is form.atted for BLAST using 
25 'pressdb' (available as part of the ncbi tool kit) 

To build a gene or part of a gene from: ESTs, the 
program ^m.akegene' was developed. Input to this program is 
a query sequence and the organism/species for which a gene 
is to be built. An initial search of the formatted EST 
database described above is performed using BLAST (blastn) 
Any results that contain warnings, such as polyA tails or 
other repeat elem^ents, are elimiinated from future queries/ 
e program. ' bias t_parse_report s ' v^as developed to extract 
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zhe PASTA header line trovr: the search resales and zhe ouzx:)uz 
is Chen filtered to extracc only FASTA header lines for che 
desired species. 

The initial results, having been filtered for warnings 
5 and species, go into a loop in which searches against the 

database are repeated until no new ESTs are found. The looo 
consists of zhe following steps: 1) when possible the names 
of boch ends of the ESTs are extracted frorr^ the database by- 
searching using the 'Clone Id' field or the part of the ^ EST 
10 name' field before the , r or .s postscript, 2) any ESTs that 

have been used as queries in previous loops are removed from 
the current query by the program^ 'subtract', 3} the 
resulting list of ESTs is used to extract the sequences from 
tne database by the program bat ch_parse_f as t a , 4) BLAST is 
1^ run against the database using each sequence, 5) the output 

rij_es from: "BLAST containing warnings are rem.oved, 6} the 
results are filtered by species, and 7) the loop is 
reentered if there were new ESTs found in the previous pass 
through the loop. 

ESTs chosen by 'makegene' are used as input for the 
program ' m.pd2_cluster ' (Hide, W., Burke, J, and Davison, D. 
U. of Houston, unpublished) which clusters overlapping • 
sequences. The programs ^contig' (Kerlavage, T., TIGR, 
unpublished), 'gde2miult' and 'gde2sing' (Smith, S.W., ec 
al., CA3IOS 10, 671-675 (1994)), are used to make an 
alignment and consensus sequence of the overlaoping ESTs. 



RES'JLTS 

cDImA c lonmo and characterization of STLK2 

The human STLK2 cDNA sequence is composed of two 
overlapping EST fragments, AA191319 and Wlo504, that were 
laentified using a Smith- Waterm.an search of the EST database 
with STLPCl fM5T3 GB:AF024636) as a query. The com.plete 
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sequence of boch clones was deterrr^ined and used co generace 
the full-length hurr-an STL2 seauence . 

EST clone AA191319 contains a 1327 bp mserc and an OR? 
of 1145 bp (382 amino acids) . EST clone W16504 contains a 
5 24 74 bp inserc (not including the poly- A tail) and an ORF of 

637 bp (332 amino acids) . 

The full-lengch human 3TLK2 cDNA (SSQ ID NO. 1) is 3263 
bp long. AA191319 spans positions 1-1327 and W16504 
positions 743-3216. The overlap between chese tv/o clones 
10 exhibits 100% sequence identity. The human STLK2 cDNA 

constains a 1243 bp ORF flanked by a 181 bp 5' UTR (1-131) 
and a 1784 bp 3' UTR (1433-3216) that is followed by a 52 
nucleotide polyadenyla^ed region. A polyadenylat ion sicrnal 
(AATAAA.) is found a* positions 3193-319S. The sequence 
flanking the first ATG conforms to the ?:czak consensus for 
an initiating m.echionine, and is believed to be the 
translat ional start site for STLK2 . Furthermore, human 
3TLK2 , and the related SOK-1 and MST3 proteins, conserve the 
am.^no acid sequence immediacely following this presumed 
20 initiating methionine. 

Several EST fragments span the complete STLK2 sequence 
with AA191319 at the 5' end and W16504 at the 3' end. 

All searches against the public nucleic acid database 
(NRN) and protein database (MRP) were conducted using the 
Smith-Waterman gap alignment program ((Smith, TF and 
Waterman, MS (1981) J. Mol , Biol, 147, 195-197).) with the 
PAMIOO matrix and gap open and extension penalties of 14:1, 
resoectivelv . 



cDNA C lonmo and Characterization of STL:K3 

A mamm;alian STLK3 clone, 155-31-19, v/as first 
identified fromi a ?CR screen with the degenerate olicos, 
TRKl and TRK4 , applied to a sscDMA generated from adult rat 
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brain substantia nigra. Sequence analysis of che 4 57 bo 

inser" indicated that it represented a novel rr^ea^cer of che 
STE20 -subfamily of STKs . 

A Smith- Waterman search of the EST database wich the 
5 rat 3TLK3 fragment and human STLKl (MST3 G3:AF02463 6) as 

queries identified several overlapping ESTs soannmo most c 
the humian STLK2 cDNA sequence. A Makegene analvsis 
generaced a 3037 bp concig fromi approximat el v 4- EST 
sequences. Since the 3' ESTs were not comimex-cially 
10 available, a pair of primiers ( 5 ' -CACAGAAACGGTCAGA.TTCAC- 3 ' 

and 5 ' -GATCAGGGTGACATCAAGGGAC-3 ' ) were derived from this 
region to generate FCR clone 3xR21-2C-6 from human fetal 
liver 3ScDNA_ This clone and EST AA27S967 v;ere fullv 
s^querlc^d to generate the full-lengch 3TLK2 cDMA seouence . 

AA273 967 is a S37 bp EST isolated by the I^LAGE 
consortiumi from. cDNA made from CD20-f/IgD- germ.inal Center B 
cells sorted from human tonsillar cells, 

FCR clone 3R21-20-6 was isolated from, humian fetal 
sscDNA and contains a 1116 bp insert, .including a 1036 bp 
20 ORF encoding the 362 C-terminal ammo acids of- STLP:3 : 

The full-length hum.an STLrC3 cDNA (SEQ ID MO. 2; "is 3030 
bp long. A^.278967 spans positions 1-S14 and 3R21-20-6 spans 
positions 464-1579. The overlap between these two clones 
exhibits 100% sequence identity. The rem.aininq 1452 bp of 
-5 UTR is-derived from an assemJ^'ly of m;ultipl-., unconfirmed' 

EST fragm^ents. 

The near full-lengch human STLK3 cDMA (SEQ ID NO . 2 ) is 
3030 bp long and consists of a 1548 bp OR? flanked by a 1476 
bp 3' UTR (1550-3025) and a 5 nucleotide pel yadeny laced 
region. A polyadenylacion signal (AATAJLA) begins at 
position 3004. Since the coding region is open throughout 
tne 5' extent of this sequence:, this is apparently a partial 
cDNA clone lacking the M-cerm^inal stare m^et hionine*. Six 

m 
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copies of a '^GGCCCC" repeac were observed in posi-ions 21- 
67. Five independenc ESTs (AAi50333, AA2S6379, AA251679, 
AA2520C4, AA278967) shewed che same repear, suggesring chat 
chis sequence may be an integral region of che human STLK3 
gene. Stronger evidence for this being the case is provided 
by che sequence of the m.urine orthologue of 5TLrC3 
represented by a 376 bp EST W20737. 

Multiple EST fragments span che complete STLfCB seauence 
v/ich AA278967 at the 5' end and AA628477 and others at the 
3 ' end . 



cDMA C loning and Characterization of STLK4 

The human STLK4 cDNA sequence is composed of cwo 
overlapping EST fragments, AA297759 and AA1004S4, chac were 
15 identified using a Smi th-Wacerm.an search of che EST database 

wich STLKl (iMST3 G3:AF024635) as a query. The complete 
sequence of both clones was aetermiined and used to oenerate 
the near full-length hum^an STLK4 sequence. 

AA100484 is an IMAGE consortium cDNA_ clone isolated 
from the T-84 colonic epithelium cell line. It has an 
insert of 3694 bp and a coding region of 1146 bp (382 amino 
acids) . A Sm.i th-Waterm;an sequence alignment against che NRN 
dacabase showed chis EST co be 71.4% identical co the human 
STE20-like kinase (GB:X99325). 

W16504 is an IMAGE consortium: clone isolaced from a 
human fetal hearc cDNA library. It has an insert lencch of 
2474 bp (not including che poly-A tail) and a coding region 
or 6c7 bp (229 am^ino acids) . A Smiich- Watermian .sequence 
alignment of Wlo504 against che NRN database showed this EST 
CO be 6 9.2% identical co che hum.an STE2G-like kinase 
(GB :X9 932 5) . 

The full-length hum.an STLK2 cDNA (SEQ ID NO. 1) is 3 2 63 
bp long. AA191319 spans positions 1-1327, and W16504 
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positions 743-3216. The overlap between these zvjo clones is 
535 bp long with 100% sequence identity. 

AA100484 is an IMAGE consortiurrt cDNA clone isolated 
from the T-o4 colonic epithelium cell line. AA100434 covers 
the bulk of Human STLK4 wich its 3 6 94 bp, which spans 
positions 146-3839 of SEQ ID NO : 3 . A second EST, AA297759, 
isolated from a Jurkat T cell cDNA library, spans -posicions 
1-271 of the human STLK4 contig. The two ESTs overlap over 
a 126 bp screech that has only one nucleotide discrepancy at 
position 149 (G in AA297759 and T in AA100484) . A T at this 
position was chosen for the SEQ ID NO : 3 based on sequence 
daca generated from A100484 . The 5' 145 bp of hum.an STLK4 
contains three sequencing ambiguities (N' s in SEQ ID NO : 3 ) 
arising from, sequence errors in the GenBank entry for 
AA297759. Three amino acid sequence ambiguities in the N- 
cerm.inus of humian STLK4 are present also in SHQ ID NO : 7 as* a 
consequence of the sequence inaccuracies from che EST entry. 

The coding region of human STLK4 is 1242 bp long (2- 
1243), capable of encoding a 414 amine acid polypeptide, and 
is followed by a 2596 nucleotide 3' UTR (1244-3339). Human 
STLK4 ends in a polyadenylat ed strecch that has 18 adenines 
(3840-3857) . A polyadenylat ion signal (AATAAA) xs found 
between positions 3822-3827. Targeted-PCR cloning 
identified one rat orthclogue of human STIjK4 , clone 1-35-31- 
19. In addition, one murine orthologue of human STLK4 was 
recognized in the EST database as AA117483. None of these 
orthologues add additional N-termiinal sequence to the human 



The near full-length human STLK4 cDNA (SEQ ID NO . 3 } is 
3557 bp long and consists of a 1242 bp ORF flanked by a 2596 
bp 3' UTR (1244-3339) and an 18 nucleotide polyadenylated 
region. Polyadenylac ion signals ; AATAAA) begin ac positions 
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2181 and 3822. Since the ceding region is open chroughouc 
che 5' excenr of this sequence, this is apparent: Iv a oartial 
cDNA clone lacking the terminal scare methionine. A near 
full-length murine STLK4 cDNA is represented in zhe 1773 bp 
SST AA117438. It extends an additional 21 nucleotides 5' of 
the human STLK4 consensus, but since its coding reqion is 
open throughout the 5' extent of the sequence, this is also 
probably a partial cDNA clone lacking the M-terminal start 
methionine . 

Several EST fragments span the complete STLK3 sequence 
with AA297759 at: the 5' end and AA100484 and ochers at the 
3 ' end . 

cDNA C loning and Characterization of STLPC5 

'^^s human STLK5 cDNA sequence is com.pcsed of four 
overlapping sequences, AI418298, 2R96-13-1, 3R25-45-3 and 
R46685. A hum.an STLK5 clone, F07734, was first identified 
using a Smith-Waterman search of the EST database with 
SPS__3C (U33057) as a query. 

AI4132 9S is an IMA.GE consortium cDNA clone with ah 895 
bo insert 

PCR clone 2R96 - 13 - 1 " was isolated from human brain 
sscDNA using primers 5 ' -CTCA.TCTGTACACACTTCATGG and 5'- 
GATTCCCACACTGTAGATGTC derived from F07734 . 2R96-13-1 
con-ains a 330 bp insert and an ORE of 330 bp (llC amino 
acids) . 

EST clone R46685 was :Ldentified using a Smd th- Wacerman 
search of the EST database with the C-cerminus of SPS sc 
(G3:U33057) as query. Sequence analysis of che 1047 bp 
insert identified this EST to contain an ORE of 23 5 bp (95 
amino acids) encoding the C-term.inus of human STLK5 . 

PCR clone 3R25-45-3 was isolated from human fetal brain 
sscDMA using primers 5'- GGCCCTCGACTACA.TCCACCA.CAT and 5'- 

152. 
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CAACGAAACTAACACAGCATAAGG derived from 2R96-13-1 and R466S5, 

respectively. 3R25-45-3 contains a 330 bp i.nserc and an ORF 

of 750 bp (250 amino acids) . 

The full-length human STLK5 cDNA (SEQ ID NO:96) is 2110 
5 bp long and consists of a 1119 bp ORE flanked by a 22 9 bp 5' 

UTR and a 762 bp 3' UTR . The sequence flanking the first 

ATG conforms to the Kozak consensus (supra) for an 

initiating methionine; and is believed to be the 

translational snart site for STLK5 . 
10 Several EST fragm.ents span the complete STLK5 sequence 

with AA297059 and F07734 at the 5' end and R4o686 and F03423 

and ochers at the 3' end. 

STLK5 displays a 100% match over a 41 bp stretch 

(position 2-42, SEQ ID NO. 97) to a human CpG island repeat. 
15 {Z612 77) . 

cDNA Cl oning and Characterization of STLK6 

Humian STLK6 was first identified in the -translaced* EST 
database (AA219667) as a novel serine threonine kinase, 

-0 The partial human STLPC6 cDNA (SEQ ID NO: 93) is 2,001 bp 

long and consists of a 1,2 54 bp ORE flanked by a 75 bp 5 ' 
UTR and a 673 bp 3* UTF:, The sequence flanking the first 
ATG conforms to the Kozak consensus (Kozak, M., Nucleic 
Acids Res, 15, 8125-8148 (1987)) for an initiating 

^5 methionine, and is believed to be the translational start ' 

sice for SThKG , 

At the time of filing, inventors believe that STLK6 
does not have any significant m.atch in the nucleic acid 
database . 

10 

cDNA C loning and Characterization of 5TLK7 

Human STLK7 was first identified in the translated EST 
database (AA9S8 954) as a novel serine threonine kinase. The 
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original clone was noc available chrough public sources, so 
a PGR fragment amplified from the sequence of AA98S95-1 
yielded 5R54 -21 -2 . 

The partial human STLK7 cDNA (SSQ ID K/O:100) is 311 bp 
5 long and consists of a 309 bp ORF . Since the coding reaion 

is open throughout the 5' and 3' excent of this sequence, 
chis. appears to be a partial cDKiA clone lacking the N- 
terminal start methionine and C-t:erm:inal stco codon. 

STLK7 shares 80% sequence identity to human 3?AK 
10 {AF0999S9) over a 167 bp region and 50% nucleotide sequence 

identity to SLTK7 (SEQ ID NO. 101) over 3 91 nucleotides. 

cDNA Cloning and Characterization of ZCl 

The human ZCl cDNA sequence is composed of two 
15 overlapping PGR clones, 3R25-24-2 and R65-12-2. 

A hum.an ZCl clone, 125-33-5, was first identified from 
a PCR screen with degenerate oligos, TRKl and TRK4 , applied 
to sscDNA generated from, human small airway epithelial cells 
(Clontech) . Sequence analysis of the 503 bp insert 
20 identified a 501 bp ORF (167 amino acids) with the potential 

to encode a novel human STK related to the C. elegans 
ZC504.4 gene product. 

PCR clone 3R25-24-2 was isolated from human SNB19 
giioblas tomia sscDNA using prim.ers 5 ' - ATGGCGAJ^.CGACTCTCCCGCGAA 
25 and 5 ' -ACACCAAAATCAACAAGTTTCACCTC derived from the N- 

terminus of a murine orthologue of ZCl (KIK, GB:U889S4) and 
the original human ZCl clone 125-33-5, respectively. 3R25- 
24-2 contains a 527 bp insert and an ORF of 519 bo (173 
amino acids) . 

^'"^^ Clone R55-12-2 was isolated as follows: A Smith- 
Watermian search of the EST database with the C. elegans 
ZC504.4 gene (GB:Z50029) as a query identified a hum.an EST 
(we 165 6) whose ORF is related to the C. elegans aene and 
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cerminaces in an idencical residue (Trp) . A primer was 

designed 3' to this stop codon { 5 ' - AGTTACz^GGA.2^TTCCAAGTTCT; 
and used in a PGR reaction wich a primer derived from the 
original human ZCl clone 125-33-5 (5'- 

ATGAAGAGGAAGAAATGAAACTG) using sscDNA from numan SNB19 
glioblastoma as a template. PGR clone R65-12-2 was 
identified and was found to contain a 3611 bp insert with a 
3534 bp ORF encoding the G-cerminal portion of human ZGl 
(117S amino acids) . 

The full-length human ZCl cDNA (SEQ ID NO. 9) is 379S 
.bp long. Clone 3R25-24-2 spans positions 1-527, and clone 
^^--i2-2 spans positions 138-379S. The overlap between 
these two clones exhibits 100% sequence identity. The human 
ZCl contains a 3717 bp ORF (17-3723) flanked by a 6 bp 5 ' 
15- UTR and a 75 bp (3724-3793) 3' UTR . No polyadenyl at ion 

signal (AATAAA) or polyadenylated region are present m the 
3 'UTR. The sequence flanking the first ATG conforms to the 
Kozak consensus for an initiating mechionine, and is 
believed to be the translat ional start site for human ZCl. 

Multiple EST fragments (W81656) match the 3' end of the 
human ZCl -gene, but at the time of filing, the inventors 
believe that none exist in GenBank or the EST database that 
match ics 5' end. 

cDNA C loning and Characceri zat ion of ZC2 

The human ZC2 cDNA sequence is composed of four 
overlapping PGR clones, G75-31-17, R55-24-6, 2R23-8-1, and 
R99-6-1C . 

A human ZC2 clone, G75-31-17, was first identified from 
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a PGR screen with degenerate oligcs, R031 (5'- 
GCNTTYGGNGARGTNTAYGARGG) and CGK4b (5'- 

GCTGGATGCYTCNGGNSWCATGGA) , applied to sscDNA generated from 
the human HLT3S3 primary non-small, cell, lung cancer tissue. 
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Sequence analysis of the 4 92 bp insert identified a 492 ORF 
(164 amino acids) v;ith che pocential to encode a novel human 
3TK relaced co the C. elegans ZC504 . 4 gene product. 

PGR clone R99-6-10 was isolated as follows: A Srrdch- 
5 Waterman search of the EST database w±zh C. el&gans ZC504.4 

gene (GB: Z50029) as a query identified cv/o overlapping human 
EST fragments (AA115.844 and R51245) whose ORFs were related 
to the C. eiegrans gene and terminate in an identical residue 
(Trp) . A primer was designed 3' to the stop codon found in 
10 R51245 (5' -AGATGGACTGTACTGGGAGG) and used in a PGR reaction 

v/ith a prim.er derived from AA115844 (5'- 

AGTTTGTGGAGCTGTGTGGG) using human fetal brain sscDNA as a 
template. ?CR clone R99-6-10 was identified and was found 
to contain a 1095 bp insert with a 93 0 bp ORF encoding the 
C-term.inal portion of hum.an ZC2 (310 amino acids) 

PGR clone R65-24-6 was isolated from human HT29 colon 
cancer cell line sscDMA using primers 5'- 

AAGGTTATGGATGTGACAGGG and 5 ' -AGATGGACTGTACTGGGAGG derived 
from G75-31-17 and R51245, respectively. The 3' prim.er used 
m this PGR reaction misprim.ed between positions 1634-1653 
of this gene leading to the formation of a truncated 
product. R65-24-6 contains a 1593 bp insert and an ORF of 
1593 bp (531 amino acids) . 

PGR clone 2R28-8-1 was isolated from human colon cancer 
cell line HT29 sscDNA using primers 5 ' - CTCACAAGGTTGCCAACAGG 
and 5' -AGTCCCCACCAGAAGGTTTAC derived from R65-24-6 and R99- 
6-10, respectively. 2R28-8-1 contains a 1533 bp insert and 
an ORF of 1536 bp (512 am.inc acids) . 

The partial human ZC2 cDNA (SEQ ID MO. 10) is 4055 bp 
-ong. Clone G75-31-17 spans positions 1-492, clone R55-24-6 
spans positions 58-1650, clone 2R23-S-1 spans positions 
1466-3003 and clone R99-5-10 spans positions 2961-4055. The 
overlaping regions between these clones exhibit 100% 
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sequence identity except for a single guanine (G75-31-17) zo 
adenosine (R55-24-5) mismatch at: position 230 resulting in a 
Glu to Lys change. Based on the presence of a.n acidic 
residue in this position in human ZCl and 2C3 and C. elegans 
5 ZC504.4, the sequence encoding the Glu is probably correct. 

The human 2C2 gene contains a 3 8 91 bp ORF (1-3S91) flanked 
by 164 bp (3892-4055) 3' UTR . No polyadenylac ion signal 
(AATAAA) or polyadenylated region is presenc in rhe 3 ' UTR . 

Multiple EST fragments (R51245) match the 3' end of the 
10 hum.an ZC2 gene, buc at the nime of filing, the inventors 

believe that none exist in GenBank or the EST database that 
match its 5' end. 

cDNA Cloning and Characc er i zac ion of ZC3 
15 The human ZC3 cDNA sequence is composed of four 

overlapping PGR clones, G75-30-30, 3R33-5-3, 3R19-17-6, and 
R99-43-11 . 

A human ZC3 clone, G75-30-30, was first ident i f led ' from 
a PGR screen with degenerate oligos, ROSl and CCK4b, applied 

20 no sscDNA generated from a human HLT3 7C primary non-smiall 

cell lung cancer tissue. Sequence analysis cf the 4 92 bp 
insert identified a 492 ORF (164 amino acids) with che 
potential to encode a novel human STK related to the C. 
elegans ZC504.4 gene product. 

25 PGR clone R99-43-11 was isolated as follows: A Smith- 

Waterman search of the EST database with the C. elegans 
ZG504.4 gene (GB:Z50029) as a query identified a human EST 
(R54563) v/hose ORF is reiaced to the C. elegans gene and 
terminates in an identical residue (Trp) . A primmer was 

30 designed 3' co the stop codon found in R54563 (5'- 

TGAGGGGTGAGAGGTGAGG) and used in a PGR reaction with a 
prim.er derived from the 5' end of R54563 (5'- 
CGGAAAGGGTAGGAGAAATTG) using sscDMA from hum.an fetal brain 
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as a template. ?CR clone R99-43-li v;as idencified and was 
founa to contain a 719 bp inserc with a 564 bp ORF encodino 
the C- terminal portion of human ZC3 (133 amino acids) . 

?CR clone 3R19-17-C was isolated from human A54 9 lunc 
5 cancer cell line sscDNA using primers 5'- 

CCCCCGGGAAACGATGACCA and 5 ' -AGCCGCTGCCCCTCCTCTACTGT derived 
from G75-20-3C and R99-43-11, respectively. The 3' crimer 
used in zh±s PGR reaccion misprimed leading co nhe formation 
of a truncaced product. 3R19-17-6 contains a 1172 bp insert 
10 and an ORF of 1170 bp (390 am.ino acids) . 

?CR clone 3R33-5-3 was isolated from human A549 lung 
cancer cell line sscDNA using primers 5'- 

AGGGGAACATCGCGACCTAGTAC and 5 ' - GTCGAGGTGGTGGACGACC derived 
from^ G75-30-30 and 3R19-17-6, respect ivelv . 3R33-5-3 
15 contains a 2465 bp insert and an ORF of 2463 bp (821 amino 

acids) . 

The full-length hum.an ZG3 cDNA {SEQ ID MO. 11) is 4133 
bp long. Glone G75-30-30 spans positions 1-4S3, clone 3R33- 
5-3 spans positions 134-2598, clone 3R19-17-6 spans 
20 positions 2356-3 512 axnd clone R9::--43-ll spans positions 

3415-4133. The overlaps becween these clones exhibit 100% 
sequence idencicy. The human ZG3 gene contains a 3 978 bp 
ORF (1-3978) flanked by a 152 bp 3' UTR (3979-4133). No 
polyadenylation signal • (AJ\TAAA) or polyadenyl at ed region is 
25 present in the 3 ' UTR . 

Multiple EST fragm.ents (R54563) match the 3 ' end of the 
human ZC3 gene, but at the time of filing, the inventors 
believe that none exist in GenBank or the EST database that 
match its 5' end. 
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cDNA ClonincT and Characrerizacion of ZC4 

The human 2C4 cDNA sequence, represented hv ?CR 
fragment 3R25-27-1, was firsc identified in the human 
genomic cosmid 32J11 {GE:ZS33S50) containing exon sequences 
that displayed surong homology to the ZC504.4 C. elegans 
gene . 

PGR clone 3R25-27-1 was isolated from human fetal live: 
sscDNA and primers 5 ' -CAATGTTAACCCACTCTATGTCTC and 5'- 
AGTTTGGCGATGTTTTTGTTTTG derived from a pocencial ORF 
(positions 25729-25352) from the 82J11 cosmid and from an 
EST (R98571) encoding che G-cerminus of the human 3G4 gene, 
respectively . 

The partial human ZG4 cDNA (SEQ ID NO. 12) is 1459 bo 
long and consists of a 1047 bp ORF (2-1048) flanked by a 411 
bp (1049-1459) 3'UTR region. No polyadenylat ion signal 
iPJKTAAA) or polyadenyla ted region is present in the 3 ' UTR . 

• The N- terminal coding- sequence for ZG4_h was extended 
by building a contiguous ■ DNA sequence of 233,137 be 
containing ZS385C and four other sequences: cU84E10 and 
CU230B10 (from the Sanger Human Genome Sequencing Project, 
htcp : / /www . san oer . ac . uk/nGP/ ) and Z97356 and Z69734 
(available from zhe National Ins.ticute for Biotechnology. 
Information, 

ncto : / / www . n cbi . nlm . nih . aov/Ent rez/nucleoc ide . html 'j . The 
position of each sequence in the contig is represented in ' 
the table below. 
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The 233,137 bp concig was analyzed fcr exons usino the 
programs FGENES 1,5 and FGENESH , huTian gene scruccure 
predictzion software available from The Sanoer Centre 
ihztLO: / / genomic . sancrer . ac . uk/af /gf . shcml ) . 

The resulting human ZC4 coding sequence (SSQ ID NO; 104) 
is 3,631 bp long (excluding the stop codon) and encodes for 
a STE2 0 kinase of 1227 amino acids . 

cDNA C loning and Characterization of KH32 

The human KHS2 cDNA sequence is comiposed of four ' 
overlapping clones, 3R25-51-2, 3R16-34-2, 3R16-31-2, and 
T79915 , 

A hum.an KHS2 clone, AA250S55, was firsc identified 
using a Smith- Waterman search of the EST database with KHSl 
15 (GB :U77129) as a query. Sequence analysis of the 1112 bp 

insert identified a 618 bp ORF (206 ammo acids) with the 
potential to encode a novel STK related to the human KKSl 
gene product. Using AA250855 as a query, a second EST 
(AA446022) was found whose sequence was shown to contain the 
initiator methionine fcr hum.an KHS2 based on a comparison 
with KKSl, 

PCR clone 3R25-51-2 was isolated from human testicular 
cancer sscDNA using primers 5 • -CCGCCATGAACCCCGGCTT and 5'- 
CGATTGCCAAAGACCGTGTCA derived from A.A446C22 and AA250855, 
respectively. 3R25-51-2 contains an 850 bp insert and an 
ORF of 849 bp (233 amino acids) . 

EST clone, T7991c, was identified using a Sm.ith- 
Waterman search of the EST database with the C- terminus of 
KHSl (G3:U77129) as a query. Sequence analysis of the 2107 
op insert identified this EST to contain an ORF of 34 5 bp 
(115 amino acids disrupted by a single stop codon) encoding 
Che C- terminus of hum.an KHS2 , follov;ed by 1762 bp 3 ' UTR . 
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PGR clone 3R16-34-2 was isolated from hu-an testis 
sscDMA using primers 5 ' - AGAAGTTGCAGCTGTTGAGAGGA and 5'- 
TATGGCCCGTGTAAGGATTTG derived from AA2 5 0835 and T7 9 916, 
respectively. 3R16-34-2 contains an 1516 bp insert and an 
5 ORF of 1123 bp (376 ammo acids) . 

PGR clone 3R16-31-2 was isolated from normal human 
colon sscDNA using primiers 5'- GTGGGAGAAGTGTTGTGTTGT.A^ and 
5 ' -TATGGGGGGTGTAAGGATTTG derived from EST T7 9916. 3R16-31-2 
contains a 728 bp insert and an ORF of 669 bp (223 am.ino 
10 acids) . This clone lacked the stop codon present v/ichin EST 

T79916 (postion 2662 in the KKS2 sequence) . 

The full-length hum-an KHS2 cDNA (SEQ ID MO, 17) is 4023 
bp long. Glone 3R25-51-2 spans positions 1-855, clone 
AA250SS5 spans positions 336-923, clone 3R16-34-2 spans 
positions 545-2061, and clone T79916 spans positions 1917-' 
4023. The overlaping regions between these clones exhibit 
100% sequence identity, except for 4 nucleotide differences, 
two of which are silent, a third corrects the internal stop 
codon at position 2662, and the fourth at position 247 (T to 
C change) results in a Pro to Leu . change . The human KHS2 
cDNA contains a 2682 bp ORF (6-26S7) flanked by a 5 bp (1-5) 
5'UTR and a 1336 bp (2688-4023) 3' UTR . A potential 
polyadenylation signal (.^w^TAAA) is found at positions 4008- 
4013. No polyadenyiated region is present m the 3 ' UTR . The 
25 sequence flanking the first ATG is in a poor context .for 

translational initiation, however, a 134 bp 5'UTR sequence 
from EST AJ\446022 did not reveal any additional ATG's and 
displayed two in- frame stop codons 5' to the putative start 
ATG for hum.an KHS2 . 

Multiple EST fragmients match the 5 ' end (AA44b022) as 
well as the 3' end (R37625) of the hum.an KHS2 aene . 
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cDNA Cloning and Character! zac ion of SULul 

The human SULUl cDNA sequence is corriposed of chree 
overlapping clones, N40051, 2R90-1-1 and R90907. 

A human SULUl clone, N40091, was first idennified using 
5 a Smich-Waterman search of che EST database wich zhe C. 

elegans 3ULU gene (GB: U3 22 75) as a query. Sequence 
analysis of the 1321 bp insert identified a 906 bp ORE (302 
amino acids) with the potential to encode a novel human STK 
related to the C. elegans SULU gene product. 
10 EST clone R90907 was first identified using a Smith- 

Waterman search of the EST database with the 3' end of che 
C. elegans SULU gene (GB : U322 75) as a query. Sequence 
analysis of che 1647 bp insert identified a 573 bp ORF (192 
ammo acias) v/ith the potential to encode the C- terminus of 
15 che human SULUl gene product. 

FCR clone 2R90-1-1 was isolated from human HT2 9 colon 
cancer cell sscDNA using primers 5'- TATTGAATTGGCGGAACGGAAG 
and 5'- TTGTTTTGTGCTCATTCTTTGGAG derived from N40091 and 
R90907, respectively. 2R90-1-1 contains a 1625 bp insert 
20 and an ORF of 1623 bp {541 amino acids) . 

The full-length human SULUl cDNA (SEQ ID NO. 19) is 4177 
bp long Clone N40091 spans positions 1-1321, clone 2R9C-1-1 
spans positions 1048-2671, and clone R90907 spans positions 
2531-4177. The overlaping regions becv^een these clones 
25 exhibit 100% sequence identity. The human SULUl cDNA 

contains a 2694 bp ORF (415-3103) flanked by a 414 bp (1- 
414) 5'UTR and a 1069 bp (3109-4177) 3' UTR followed by a 19 
nucleotide polydenylat ed region. A pocential 

pclyadenylacion signal (AATAAA) is found at positions 4164- 
30 4169. The sequence flanking che first ATG conforms co the 

Kozak consensus for an initiating m.ethionine, and is 
believed co be che cranslac ional start sice for hum^an SULUl. 
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Multiple EST fragmencs match the 5 ' end (N27153 ) as v/ell 
as Che 2' end {R9090Sj of the human SULUl gene. 

cDMA CloninQ and Characterization of Murine SULU3 
5 The murine SULU3 cDNA sequence is represented by ?CR 

f ragmenc 2R92-1-6. 

A murine SULU3 clone, G83-4-5, was first identified 
from a PGR screen wich degenerate oligcs, CCK4c and CCK4b; 
applied to sscDNA generated from murine day- 12 embryos. 

10 Sequence analysis of the 473 bp insert identified a 471 ORF 

(157 amino acids) wich the potential to encode a novel human 
STK related to the C. elecfans SULU gene (GB: U32275) 
produce. The antisense strand of G83-4-5 is identical ac 
the nucleic acid level co the 5 ' UTR of the murine etsl 

15 protooncogenic transcription factor (GB:X53953). This 

horrtolcgy is likely the result of a cloning arcifact attached 
to the 5' -end of the database entry for mui^ine etsl. 

PGR clone 3R19-17-6 was isolated from, hum.an A54 9 cell 
sscDNA using primers 5 ' -CGGCGGGGAAAGGATGACGA and 5'- 

20 AGGCGCTGCCCGTGCTGTACTGT derived from G75-30-30 and R99-43- 

11, respectively. The 3' prim.er used in this PGR reaction 
misprimed leading co the formation of a truncated product. 
3R19-17-6 contains a 1172 bp insert and an OR? of 1170 bp 
(390 amino acids) . 

-5 PGR clone 2R92-1-6 was isolated from murine do em.bryo 

sscDNA using primers 5 ' - AGGGGAAGATGGGCAGGTAGTAG and 5'- 
GATTGGTTTGTGGTGATTGTTTGG derived from the 5' UTR of the etsl 
gene and the human EST AA234623, respeccively . The latcer 
(shown herein) encodes che G-cerminus of human SULU3 . 2R92- 

30 1-6 concains a 224 9 bp insert and an OR? of 2244 be (74S 

amino acids) . 

The partial m.urine 5ULu3 cDNA (5EQ ID NO . 2 1 ) is 2 24 9 bo 
long and consists of a 2244 bp ORE (6-2249) flanked by a 5 

l<^3 
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bp {1-5)'5'UTR. The sequence flanking the first ATG 

confcrms to rhe Kozak consensus for an iniciacing 
rr:ethionine , and is believed cc be the transl at ional scart 
site for murine SULiU3 . 
5 One EST fragment (AA446022) marches the 3' end of the 

partial murine SULU3 gene, but an the time of filing, the 
inventors believe that none exist in GenBank or the EST 
database that match its 5' end. 

10 cDNA Cloning and Characterization of Human 5ULU3 

The human SULU3 cDNA sequence is composed of civo 
overlapping clones, 2R90-22-1 and AA234623. 

A human SULU3 clone, AA234623, was firsc identified 
using a Smith- Waterman search of the EST database with the 
15 C. elegans SULU gene {G3: U32275) as a query. Sequence 

analysis of the 2552 bp insert identified a 11S5 bp ORF (395 
amino acids) with the potential to encode the C-terminus of 
a novel human STK related to the C. elegans SULU gene 
product . 

20 PCR clone 2R90-22-1 was isolated from hum.an SKMell28 

melanoma cell line sscDNA using primers 5'- 

TATTGAATTGGCGGAACGGAAG and 5 ' - TTGTTCTAJ\GAGTGCCCTCCG derived 
from the murine SULU3 2R92-1-6 clone and from AA234623, 
respectively. 2R92-1-6 contains a 1397 bp insert and an OR? 

25 of 1896 bp (632 am.ino acids) , 

The partial human SULU3 cDNA (SEQ IB NO.20) is 3824 bp 
long. Clone 2R90-22-1 spans positions 1-1897 and clone 
AA234623 spans positions 1173. The overlaping region 
between these clones exhibits 100% sequence identitv. The 

30 human 5ULU3 cDNA contains a 2 3 58 bp ORF (2-2 359) flanked bv 

a 1465 bp (2350-3324) 3 ' UTR followed by a 19 nucleotide 
pclydenylated region. A potential polyadenylat ion signal 
(AA.TAAAJ is found at positions 2602-2607.. Since the codinq 
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region is open "hr-oughout the 5' excent of chis secuence , 

this is appar-encly a partial cDNA clone lacking che M- 
terminal start methionine. 

Multiple EST fragments (R02283) match the 3 ' end of zhe 
human SULU3 gene, buc at the time of filing, the invencors 
believe that none exist m GenBank or the EST database that 
match :Lts 5' end. 

cDNA Cloning and Characterization of GSK2 

The hum.an GEK2 cDNA sequence is composed of three 
overlapping clones, AA45944S, 3R25-48-1 and GEK2_h#3 . 

A human GEK2 clone, AA45944S, was first identified 
using a Smi t h-V7acerman search of the EST database vvith the 
hum.an SLK gene (GB : AB0C2804) as a query. Sequence analvsis 

the 1286 bp insert: identified a 1227 bp ORx^ (409 amino 
acids) with the potential co encode the N-terminus of a 
novel human STK related ro the human SLK gene product. An 
additional Smi th- Waterman search using the C- terminus of the 
SLK gene as a query yielded three additional EST's, 
AA3236S7, AA330492 and .2VA168869, that encode the C-terminal 
region of human GEK2 . 

PGR clone 2R98-41-17 was isolated from human testis 
sscDNA using primers 5'- AAGACCATGCCGTGCGCCG and 5'- 
ATTCCTTCAGGTTCTGGTTATGG derived from AA3 236S7 and from 
25 AA380492, respectively. 2R98-41-17 contains a 851 bp insert 

and an ORE of 849 bp (283 amino acids) 

FCR clone GEK2_hl^3 was isolated from human sscCMA made 
fromi che H23 cumber cell line using prim.ers 5'- 

GCAGC.2^.GTGGAGAAGATGG and 5 ' - GGAAGCATCCCCAGAGCTGTAG derived 
from the sequence of clone 3R2 5-4S-1 and from- the 3' end of 
murine LOK (GB:D8972S;, respectively. GEK2_h#3 concains a 
1042bp insert and an OR? of 1041 bp (347 amino acids) . 



20 



30 
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The full-length human GEK2 cDMA (3EQ ID NO: 106) is 295 
bp long. Clone AA4 5 94 4S spans posicions 1-1286, clcne 3R25 
48-1 spans positions 1100-2449 and clone GEK2_h#3 spans 
posicions 1920-2362. The overlapping regions between these 
clones exhibit 100% sequence identity. 

The human GEK2 cDNA contains a 2 904 bp OR? (59-2962) 
flanked by a 5 3 bp (1-58) 5 ' UTR . The sequence flanking -he 
first ATG conforms to the Kozak consensus for an iniciatinq 
m.ethionine, and is believed to be the translat: ional scart 
site for human GEK2 . 

Multiple EST fragments (AA465671) macch the 5 ' end of 
the sequence, bun only one (AA3S0492) matches the 3' end of 
the human GEK2 gene . 

cDNA Cloning and Characterization of ?AK4 

The human PAK4 cDNA sequence is represented by clone 
SNB2#1 . 

A human PAK4 clone, R88460, was firsc identified usinq 
a Smich-Wacerm.an search of the EST database with che human 
PAK gene (GB: U24152) as a query. Sequence analysis of che 
2332 bp insert idencified a 930 bp ORF (310 amino acids) 
v/ith the potential to encode the C-terminus of a novel human 
STK related to the humian PAK gene produce, 

cDNA clone 3MB2#1 was isolated from, human glioblastoma 
ceil line SNB75 cDNA library using a probe derived from 
R8S460. SNB2#1 contains a 36C4 bp insert and an ORE of 2043 
bp (681 amino acids) . 

The full-length hum.an PAK4 cDNA (SEQ ID NO. 27) is 3 604 
bp long and consists of a 2043 bp ORF (143-2185) flanked by 
a 142 bp (1-142) 5'UTR and a 1419 3' UTR followed by a 22 
nucleocide pclydenylated region. A potential 

polyadenylation signal (AATTAA.^^) is found at posic:Lons 3 582- 
3583. The sequence flanking che firsc ATG conforms no the 
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Kozak consensus for an initiating methionine, and is 

believed to be che translat ional start site for hurr.an PAK4 . 

The 3' UTR of the ?AK4 gene contains a GT dinucleotide 
repeat prone to undergo expansion based on the number of 
repeats found in clones SN3#1 and R88460, 32 and 23, 
respectively. Several neurologic disorders have been 
correlated v;ith the expansion of di- or cri - nucleot ide 
repeacs similar to those found in the PAK4 seaue^nce, 
suggesting PAK 4 may also be a disease targec and that this 
repeat in ics 3 ' uTR may serve as a diagnoscic marker. 

Multiple EST fragments (AA535791) match the 3 ' end of 
the hum.an PAK4 gene, but at the time of filing, che 
inventors believe chac none exist in GenBank or the EST 
database that match its 5' end. 



c_DNA Cloning and Characterization of PAK5 

The full-length human PAfCS cDNA sequence is composed of 
two overlapping clones, H450#l-1 and SNB8#5 . 

A human PAK5 clone, R18825, was first idencified usmc 
a Smi-h-Waterman search of the EST database wich the human 
PAK4 gene as a query. SequBncQ analysis of the 124 8 bp 
insert identified a 420 bp ORE (140 ammo acids) with the 
potential to encode the C-term.inus of a novel human STK 
related to the human PAK4 gene product. 

-^^-^ clone SNBS#5 was isolated from hum.an SNB75 cDNA 
library using a probe derived from. R18825. 5NB2#1 contains 
a 2028 bp insert and an ORE of 1194 bp (398 ammo acids) . 

The partial hum.an PAK5 cDNA (SEQ ID NO. 28) is 2028 bp 
long and consists of a 1194 bp ORE (2-1195) flanked by an 
30 323 bp (1196-2C2S) 3 ' UTR followed by a 22 nucleotide 

polydenyiated region. A potential polyadenylat ion signal 
(AATTAA^^) is found at positions 2004-2010. Since the codina 
region is open tnroughout che 5' extent of this sequence. 
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chis IS apparently a parcial cDNA clone lacking che N- 
uerminal scar-i methionine. 

Clone H4bO#l-l was isolated from a hurran lung H450 cDNA 
liorary using a probe derived from che partial SNB2U1 cDNA 
D clone described above. Sequence analysis of che 2526 bo 

insert identified a 1773 bp ORF (592 amino acids) wi-h che 
potential to encode a full-length PAK5 . 

The human PAK5 cDNA (SEQ ID NO:102) is 2,SC6 bp long 
and consiscs of a 1,773 bp ORF flanked by a 201 bp 5' UTR 
10 and a 833 bp 3' UTR. The sequence flanking the first ATG 

conform.s co che Kozak consensus (Kozak, M., Nucleic Acids 
Res. 15, 3125-8148 (1937)) for an iniciatino mechionine, and 
is believed to be che translat ionai start sice for ?AK5 . 

PAK5 shares 99% sequence identicy over 2 7 95 bp to a 
recenc database entry, AF005C46. These sequences are 
presumed co be from che same gene, with minor polymorohic 
variat ions . 



EXAMPLE 2 : Expression Analysis of Mamm.alian 5TE2 0 - related 

Protein Kinases 
Materials and Methods 
Northern blot analvsiQ 

Northern blocs were prepared by running 10 g tocal RNA 
isolaced from; 50 human .tumor cell lines (hoP-92, EKVX, NCT- 
H23, NC1-H226, NCI-H322M, NCI-H460, NCI-H522, A549, HOP-62, 
OVCAR-3, OVCAR-4, OVCAR-5, OVCAR-S. IGROVi , SK-OV-3, SNE - 1 9 , 
SNB-75, U251, SF-268, SF-295, SF-539, CCRF-CEM, K-562, MOLT- 
4, HL-60, RFMI 3226, SR, DU-145, PC-3, HT-29, HCC-2998, HCT- 
116, 3^/^620, Colo 205, HTC15, KM-12, UC-31, SN12C, A49S, 
30 CaKil, RXF-393, ACKN, 7Sc-0, TK-10, LOX iMVI , Malme-3M, 3K- 

MEL-2, SK-MEL-5, SK-MEL-23, UACC-52, UACC-257, M14 , MCF-7, 
MCF-7/ADR RES, Hs57ST, MDA-ME-231, MDA-M3-435, MDA-N, BT- 
549, T47D) , fromi 22 human adult tissues (chym.us, lunc. 
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duodenum, colon, testis, brain, cerebellunn, cortex, salivary 

gland, liver, pancreas, kidney, spleen, stomach, ucerus, 
prostate, skeletal muscle, placenta, m.ammary gland, bladder, 
lymph node, adipose tissue), and 2 human fetal normal 
5 tissues (fetal liver, fetal brain ) , on a denaturing 

f ormtaldehyde 1.2% agarose gel and transferring to nylon 
membranes . 

Filters were hybridized with random prim.ed [a''?jdCTP- 
labeled probes synthesized from the inserts of several of 

10 the STS20-relaced kinase genes. Hybridization was performed 

at 4 2 °C overnighu in bX SSC, 0.1% SDS , IX Denhardt ' s 
solution, 100 ug/m.L denatured herring sperm DNA with 1-2 x 
10^ cpm./ mL of ^^F- labeled DNA probes. The fillers were 
v;ashed in 0 . IX S3C/0.1% SDS, 65 °C, and exposed on a 

15 Molecular Dynamrcs phosphcrimager . 

Ouancitative ?CR analysis 

RNA v;as isolated from a variety of normal human tissues 
and cell lines. Single stranded cDNA was synthesized from 

20 10 ug of each RNA as described above using the Superscript 

Preamplif ication System^ (GibcoBRL) . These single strand 
templates were then used in a 25 cycle PCR reaccion v;ich 
primers specific to each clone. Reaction products were 
electrophoresed on 2% agarose gels, stained with ethiaium 

25 bromide and photographed on a UV light box. The relative 

intensity of the STK-specific bands were estimated, for each 
samiDle 



30 



DMA Arrav ^ased Expression Analysis 

Plasmiid DNA array blots v/ere prepared by loading 0.5 yia 
denatured plasmiid for each STE2 0 - related kinase on, a nylon 
m.emjorane . The [a^'P] dCT? labeled single stranded DNA probes 
v/ere synthesized from: the total RNA isolaced from several 
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human immune tissue sources or tumor cells (rhvmus, 

dendrocyces , mast cells, monocytes, B cells (primary, 
Jurkac, RPMI8226, SR) , T cells (CD8/CD4-r, THl, TH2 , CEM, 
MOLT4) , K5 62 {megakaryocyces ) . Hybridization was performed 
an 42 ^^C for 16 hours in bX SSC, 0.1% SDS , IX Denhardt ' s 
solution, 100 |.tg/mL denatured herring sperm DNA wich IC"^ 
cpm/mli of [a'-?]dCTP labeled single scranded probe. The 
filters were washed in C . IX SSC/O.1% SDS, 65 ""C , and exposed 
for quancicat ive analysis on a Molecular Dynamics 
phosphor imager . 



RESULTS 

Discributi.on of STE2 0 - Rel at ed Gene Transcripts in Normal 
Tissues and Tumor Cell Lines 

ZCl , ZC2 , and ZC3 RNA expression was analysed by 
quanticacive PGR from mult iple ■ human normal tissues, 
cultured primary epithelial and endothelial cells, and tumor 
cell lines. The results are summ.arized in Tables 1 and 2, 
with relative expression values ranging from 0 

(undetectable) to 23 (very strong) . An "x^' refers to sample 
not -ested. ZCl, ZG2 , and ZC3 were all expressed at very 
low levels in most normal hum.an tissues, however ZCl and ZC2 
were more abundant in cultured epichelial cells and ZC3 in 
normal kidney and breast tissue. 

Expression of chese 3 genes was also examined in a 
panel of human tum^or cell lines representing a diverse 
sampling of tumor -ypes (Table 2) . ZCl and ZC2 showed 
strong expression in cell lines from most m.elanomas and 
renal cumors and fromi some non- small cell lung cancers and 
-:>0 colon tumors. ZC3 expression w*as consistently lower in the 

cum.or cell lines except for high expression in m.ost breast 
cancers and leukemias . The robust: cverexoression ZCl, Z^2 

1-70 



BNSD0C;D: <W0 9953036A2JA> 



wo 99/53036 PCT/US99/08150 

and ZC3 in tunior cells versus normal tissues may provide an 
actirac^tii ve cargec for oncology drug developmenc . 

Expression of all tihe novel STE2 0 - related kinases was 
examined in a panel of human -immune tissues/cells bv 
5 hybridization to a DNA array blot containing plasmids 

encoding each of these genes. STLK2 was broadly expressed 
in all 14 imm.une samples, v;hereas STLK4 and PAK4 v;ere highly 
expressed in a subset of 6-7 of the samples (Table 3). 
Several other kinases (SULU3, ZC4 , KKS2) had more restricted 
10 expression, while others v;ere expressed in only a single 

immune source (STLK3, thymus; ZCl , dendrocytes; ZC3 , 
mtonocytes; ?AK5 , miast cells and MCLT4 ) , and several more 
v;ere acsenc from all the immune sources assayed (GEK2, 
SULUl , ZC2 , STLK5) . These expression patterns 'were quite 
15 distinct among menabers of the same subfamily (i.e., ZCl, 

2C2, ZC3 and ZC4 , or PAK-1 , PAK2 , PAK3 , ?AK4 , PAK5 ) . This 
analysis suggests that some of these kinases may be 
candidate targets for various imm.une disorders, and chat 
som^e, which are more broadly expressed, may mediate 
^0 functions vital to the basic biology of most proliferating 

cells . 

TABLE 1 

ZCl, ZC2 and ZC3 Expression 
in Normal Human Tissues and Cells 



Sample 




ZCl 


ZC2 


ZC3 


NORMAL 










Brain 


Tiss 


2 . 8 


0 . 6 


0 . 9 


Duod 


Tiss 


3 . 8 


1 . 5 


0 . 3 


Heart 


Tiss 


1 . 2 


0 . 3 


0 . 0 


Kidney 


Tiss 


C . 7 


0 . C 


7 . 0 


Lung 


Tiss 


1 . 6 


0 . 2 


0 . 0 


Pancreas 


Tiss 


2 . 0 


0 . 4 


2 . 5 
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Placenca 


Tiss 


1 . 4 


0 . G 


0 . 0 


Sai gl . 


Tiss 


3 . 0 


0.3 


3 . 2 


Sk mus . 


Tiss 


2 . 3 


0 . 1 


0 . 1 


Spleen 


Tiss 


0 . 4 


0 . 0 


X 


Stomach 


Tiss 


0 . 8 


0 . 0 


0 . 0 


Thymus 


Tiss 


3 . 5 


0 . 4 


1 . 5 


Cereb 


Tiss 


2 . 8 


1 . 1 


4 . 4 


Liver 


Tiss 


1 . 8 


0 . c 


0 . 4 


Uterus 


Tiss 


1 . 6 


0 . 0 


1 . 4 


Prostate 


Tiss 


1 . 4 


0 . 0 


1 . 6 


Testis 


Tiss 


X 


X 


5 . 8 


f Brain 


Tiss 


X 


^- 


3 . 1 


Mam gl 


Tiss 


X 


X 


7 . 2 


HCAEC 


ENDO 


1 . 0 


0.0 


0 . 0 


HMVEC-d 


ENDO 


0 . 7 


0 . 0 


0 . 4 


HMVEC-L 


ENDO 


2 . 2 


1 . 6 


1 . 8 


HPAEC 


ENDO 


9 . 3 


5 . 3 


6.4 


HMEC 


EPI 


4 . 1 


2 . 3 


1 . 9 


RPTEC 


EPI 


3 . 6 


2 .2 


0 , 2 


HRCE 


E?l 


5,3' 


3 . 5 


1 . 3 


HSAE 


EPI 


0 . 9 


3 . 3 


4 . 8 



TABLE 2 

ZCl, ZC2 and ZC3 Expression in Tumor Cell ILnes 



Sample 


Origin 


2C1 


ZC2 


ZC3 


HO?- 9 2 


Lung 


9 . 3 


7 . 2 


3 . 3 


EKVX 


Lung 


10 . 7 


3 . 7 


— 


NCI-H23 


Lung 


5 . 8 


6 . 3 


4 . 1 


NCI-H226 


Lung 


6 . 5 


6 . 8 


3 . 3 



Sample 


Origin 


ZCl 


ZC2 


ZC3 


HOC -2 993 


Colon 


2 . 4 


3 . 8 


3 . 0 


KCT 116 


Colon 


2 . 2 


2 . 


5 . 4 


SW-620 


Colon 


7.3 |l2.1 


3 . 1 


COLO 205 


Colon 


9.1 jl6 . 2 


3 . 0 
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MCI -K322M 


Lung 


3 . 5 


5 . 3 


4 . 9 




HCT-15 


Colon 


13 .3 


4 . 9 


2 . 5 


NCI -H460 


Lung 


4 . 5 


3 . 7 


2 . 9 




KM- 12 


Colon 


"7 . 0 


13 . 2 


3 . 1 


NCI -H522 


Lung 


^ 7 


3 . 3 


4 . 6 




UO-31 


Colon 


10.4 


10 . 6 


G . 9 


A54 9/ATCC 


Lung 


3 . 8 


3 . 6 


4 . 1 




SN12C 


Renal 


8 . 1 


3 . 4 


2 . 8 


HOP-62 


Lung 


4 . 3 


3 . 8 


4 . 2 




A4 96 


Renal 


5.2 


3 . 1 


2 . 9 


OVCAR-3 


Ovary 


2 . 9 


3 . 1 


1 . 5 




Caki-1 


Renal 


9.2 


14 . 4 


2 , 3 


OVCAR-4 


Ovary 


3 - 3 


1 . 0 


3 . 8 




RXF 393 


Renal 


10.6 


4 . 8 


2 . 8 


OVCAR-5 


Ovary 


2 . 6 


3 . 6 


2.2 




ACKN 


Renal 


9 . 3 


6 . C 


3 . 9 


OVCAR-8 


Ovary 


3 , 6 


2 . 0 


4 . 7 


786-C 


Renal 


3 . 8 


15 . 6 


5 . 6 


IGROVl 


Ovary 


3 . 8 


1 . 7 


3 . 2 


TK-10 


Renal 


20.9 


21 . 2 


5 . 0 


SK-OV-3 


Ovary 


4 . 9 


0 . 0 


3 . 5 


LOX IMVI 


Mel 


2 . 3 


2 . 4 


3 .3 


SN3 -19 


CNS 


5 . 1 


5 . 4 


4 . 2 


Malme- 31V1 


Mel 


X 


X 


'J 


SNB - 7 5 


CNS 


2 . 5 


0 . 9 


0 . 7 


SK-MEL-2 


Mel 


15 . 7 


14 . 1 


2 . 9 


U251 


CNS 


1 . 5 


1 . 2 


0 . 6 


SK-MEL-5 


Mel 


7 . 9 


7 . 0 


0 . 0 


SF-268 


CNS 


5 . 8 


2 . 7 


3 . 0 


SK-MEL-28 


Mel 


15 . 5 


23 . 1 


0 . 0 


SF-295 


CNS 


6 . 4 


1 . 1 


3 .2 


UACC-62 


Mel 


12 . 1 


18 .3 


5.3 


SF-539 


CNS 


5 . 1 


2 . 9 


4 . 3 


UACC-257 


Mel 


10 . 8 


9 . 4 


6.2 


CCRF-CEM 

I 


Leuk 


3 . 4 


2 . 7 


3 . 1 


M14 


Mel 


4 . 4 


0.9 


7 . 9 


K-562 


Leuk 


4 . 1 


6 . 3 


4 . 3 


MCF7 


Breasc 


4 . 8 


1 . 3 


7 . 7 


MOLT- 4 


Leuk 


7.1- 


3 . 4 


4 . 2 


MCF- 7/ADR 


Breasc 


8 . S 


■— 

3 . 4 


7 . 7 


HL-60 


Leuk 


X 


X 


0 , 4 


Hs 57ST 


Breasc 


6.9 ■ 


2 . 6 


5 . 7 


RPMI 82 2 5 


Leuk 


0 . 5 


0 . 2 


1 . 4 




MDA-MB-231 


Breast 


5 . 7 


1 . 9 


6 . 4 


SR 


Leuk 


3 . 5 


7 , 2 


5 , 4 




MDA-MB-435 


Breast 


4 . 8 


6 . 7 


9,1 


DU- 14 5 


Pro 


X 


X 


3 . 4 




MDA-N 


Breasr 


7 . 3 


6 . 3 


9.1 


?C-3 


Pro 


X 


X 


3 . 4 




3T-549 


Breasc j 


3 . 6 


1 . 9 


3 , 0 


HT-2 9 


Colon 


2 . 4' 


5.9 i6 . 6 

1 




T-47D 


Breasc 1 0 . 4 
i ^ ' "* 


12 , 3 


9 . 3 
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Table 3: STE2 0 - related kinase expression in a hurrtan immune 

oane i 







Dendro- 


Mast 


Mono- 


B 


CD8 + 


1 




KINASE 


thymus 


cytes 


cells 


cytes 


cells 


CD4 + 


THl 


TH2 


GEK2 


3 5 0 


350 


350 


350 


350 


350 


3 50 


350 


SULUl 


350 


350 


350 


350 


350 


350 


3 50 


350 


SULU3 


350 


350 


350 


350 


12149 


350 


5115 


350 


STLK2 


117770 


13771 


27620 


92036 


18305 


39109 


5403 


3 564 


STI.K3 


3624 


350 


350 


350 


350 


350 


350 


350 


STLK4 


3524 


350 


350 


350 


350 


8685 


5642 


350 


STLrCB 


XXX 


XXX 


XXX 


XXX 


350 


350 


350 


XXX 




350 


. . 3377 


350 


3 50 


350 


350 


3 50 


350 


ZC2 


350 


350 


350 


350 


3 50 ! 

i 
1 


350 


350 


3 50 


ZC5 


350 


350 


350 


20156 


350 


350 


3 5 0 


350 


ZC4 


XXX 


XXX 


XXX 


XXX 


350 


350 


350 


XXX 


KKS2 


8766 


2508 


350 


56575 


350 


350 


350 


350 


PAK4 


32658 


7684 


3729 


100948 


350 


350 


350 


1604 


PAK5 


3 50 


350 


4905 


350 


350 


350 


350 


350 



KINASE 


CEM 

(T cell) 


MOLT4 
(T cell) 


JURKAT 
(B cell) 


jRPMI8226 
(B cell) 


SR 

(B cell) 


K562 
(MO) 


GEK2 


350 


350 


350 


350 


350 


350 


SULUl 


350 


350 


350 


350 


350 


350 


SULU3 


350 


350 


350 


350 


350 


350 


STLK2 


47236 


53252 


47605 


22560 


6 5 936 


3 0 3 9 0 


STLK3 


350 


350 


3 5.0 


350 


350 


3 50 


STLK4 


3648 


350 


26772 


157 0 


350 


3 50 


STLK5 


350 


350 


3 5 0 


XXX 


350 


350 


ZCl 


350 j 350 


35 0 


3 50 


350 


3 50 
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35C 


3 50 


350 


350 


3 50 


350 


ZC3 


350 


350 


350 


350 


350 


350 


ZC4 


1094 


7813 


14945 


XXX 


350 


6385 


KHS2 


350 


350 


350 


350 


350 


350 


PAK4 


350 


10246 


350 


3229 


350 


350 


pa:k5 


35 0 

1 


12672 


350 


350 


350 


350 



Transcript size from Northern data 



Kinase j (k±») 


STLK2 


3 , 8 


STLK4 i 5.0 


ZCl 1 6.9/4.7 


ZC2 1 6.0/8.0 


ZC4 




KHS2 


4 . 4 


SULUl 


4 . 5 


SULU3 


10 . 0 


GEK2 


5 . 5 


PAK4 


4 . 8 


PAK5 


3 . 5 



STLK2is widely expressed; the highest expression levels 
5 were found in placenra, spleen and PEL. 

STLK4 is also widely expressed in normal tissues 
including heart:, brain, piacenra, lung, liver, skeletal 
muscle, kidney, pancreas, spleen, nhymus, prostace, testis, 
ovary, small intestine, colon, and peripheral blood . . 
10 lymphocytes. STLK4 was also detected in Jurkat T cells. 

ZCl is highly overexpressed in the following human 
cancer cell lines: HOP-92,- EKVX, NCI-H23, NCI-H226, NCI - 
K322M, ,MCI-H522 , A549, KOP-62 (lung); OVCAR-3, OVCAR-4, 
OVCAR-5 (ovary); SNB-19, U251, SF-268, SF-295, SF-539 (CNS) ; 
15 K-562, RPMI-8226 (leukemia); DU-145, PC - 3 (prostate); HT-29, 

HCC-2998, HCT-115, SW620, COLO-2G5, HCT-15, KM-12 (colon); 
UO-31, CAKi-1, RXF-393, 786-0, TK-10 (renal); LOXIMVI , 
Malme-3M, SK-MEL-2, SK-MEL-23, UACC-62, UACC-257, M14 

^15 
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(melanoma); and MCF-7, MCF-7/ADR, His 573T, MDA-ME-231, MDA- 
MB-431, MDA-N, 3T-549, T-47D (breast:) . 

2C2 is expressed in brain and tescis. Ic is highly 
overexpressed in the follov;ing human cancer cell lines: TK- 
10 (renal); SK-MEL-28, UACC-62 (melanoma); T47D (breast). 

Moderate expression in HOP92 (lung); OVCAR4 , IGROVl 
(o'^/ary) ; DNB75, U251 (brain); K-562 (leukemia); andCOL02C5 
(colon) . 

SULUl is overexpressed in the following human cancer 
cell lines: KOP-92, EKVX, NCI-H23, NCI-H226, NCI-H322M, NCI - 
H522, A549, HOP-62 (lung); OVCAR - 3 , OVCA.R-4, OVCAR-5. SK-OV- 
3 (ovary); 3NB-19, U251; 3F-26S, SF-295, SF-539 (CMS); K- 
562, RPMI-B226 (leukemia); DU-145, PC-3 (proseace); HT-29, 
KCC-299S, HCT-116, SW6 2 0 , COLO - 2 0 5 , KCT-15, KM-12 (colon); 
UO-31, CAKi-i, RXF-393, 736-0, TK-10 (renal); LOX, IMVI , 
Maime-3M, 3K-MEL-2, SK-MEL-28, UACC-62, UACC-257, M14 
(melanoma); MCF-7, MCF-7/ADR, HIS 57ST, MDA-MB-231, MDA-MB- 
431, MDA-N, BT-549, T-47D (breasc) 

SULU3 shov/ed a broad pactern of expression in the 
normal tissue panel of RNAs . 

GEK2 -was expressed in spleen, thymus and testis. 
Expression was high in the cell lines R3L-2H3 and K441. 

PAK4 was expressed in the normal tissues: brain, testis 
and prostate, and in the human cancer cell lines: HMCI-H23 
(lung); OVCAR-3 (ovary); SN3-19, U251 (CNS) ; RPMI-8226 
(leukemia); DU-145 (prostate); COLO-205, KCT-15 (colon). 

PAK5 shov/ed weak expression levels in zhe normal 
tissues: brain, testes, bladder, colon, adrenal m.edulla, 
spleen, fetal liver, breast, cerebral cornex, cerebellum, 
thymius, salivary gland, lung, si:om.ach, duodenum, uterus, 
prostace, skeletal muscle and placenta. PAK5 was 
overexpressed in the human cancer cell lines: HOP- 92, EKVX, 
NCI-H23, NCI-H226, NCI-H322M, MCI - H52 2 , . A54 9 , HOP-62 (lung); 
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OVCAR-3, CVCAR-4, OVCAR-5, SK-OV-3 (ovary); SNE-19, U251, 

SF-26S, SF-295, SF-539 (CNS); K-562, RPMI-322o ( leukerrtia ) ; 
DU-145, PC-3 (proscace) ; KT-29, HCC-2993, HCT-li6, SW620,. 
COLO-205, HCT-i5, KM-12 (colon); UO-31, CAKi - i , RXF-393, 
5 786-0, TK-10 (renal); LOXIMVI, Malme-3M, SK-MEL-2, SK-MEL- 

23, UACC-62, UACC-257, M14 (melanoma); MCF- 7 , ' MCF- 7/ADR, KIS 
57ST, MDA-MB-231, MDA-fvIB - 4 3 1 , MDA-N, BT-549, T-47D (breast). 

EXAMPLE 3 : STS2 0 - related Protein Kinase Gene Expression 
10 Vector Con 5 n rue c ion 

Materials and Methods 
Expression Vector Construction 

Several expression constructs were generated for some 
of the human STE2 0 - relaced cDNAs including: a) full-length 

15 clones m a pCDNA expression vector; b) a GST- fusion 

construcc concaining the catalytic domain of the novel 
STE20 - related kinase fused zo the C-term.inal end of a GST 
expression cassette'; and c) a full-length clone containing a 
Lys to Ala (K to A) mutation at the predicr^ed ATP binding 

20 sice v/ithin che kinase domain, inserted in the pCDNA vector. 

The CO A'' mutancs of the STS2 0 - relaced kinase might 

function as dominant negative conscruccs, and will be used to 
elucidate che function of chese novel STKs . 

25 RESULTS 

Constructs for ZCl, ZC2 , ZC3 , 3ULU1 , 3ULU3 , PAK4 and PAK5 
have been generated. 

Numerous additional constructs have been generated for 
the various STE2 0 - subfamily kinases, including full length, 
30 kinase inactive and tagged versions. In addition, the 

following three constructs were designed for specific 
appl icat ions, based on their unique domain structure: 
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Constiruct 1; SULL-l -coiled- coil2 

Vector: pGEX-4T 
• Inserc : Coiled-coil2 

Sequence: Amino acids 752-398 
5 Purpose: phage display 

Resulc: Interacts wich GEK2 CCl 

Construct 2 : SULU3 -coiled- coil 2 
Veccor : pGEX4T 
10 Inserc: coiled-coii 2 domain fused to GST 

Sequence range of insert: amino acids 802-898 of SEQ 
Purpose: phage display 

Result: Interacts v/ith coiied-coiled region of huinan SLK 

15 Construct 3: PAK5 Dominant Negacive 

Vector: pGANS 

Inserc : Full length coding sequence of human PAK5 containing 
the following mutation: K350,351A (Lys at aa positions 35C 
and 351 changed to Ala) . 
10 Purpose: no determine role of human PA.K5 kinase acnivitv in 

cell growth and transformation. 
Result: Interferes wich Ras transformation. 



25 



30 



EXAMPLE 4: Ge neration of Specific Immunoreaaent s r,o 
STE20-Related Protein Kinases 

Materials and Menhods 

Specific im.munoreagents were raised in rabbits aqainst 
KLH- or MJ=i.?-ccnjugated synthetic peptides corresponding to the 
human STE2 0 - related kinases. C-cerminal peptides were 
conjugated to KLH with glun araldehyde , leaving a free C- 
cerminus . Internal pepcides were MA?-ccni ugaced ivith a 
blocked N-terminus. Additional immunoreagencs can also be 
generated by immunizing rabbics v/ich the bacteriallv expressed 
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GST- fusion proteins oon^aining the oytoplasnriic doraains of ea: 

novel STK. 

The various imrrune sera are first tesced for reactivity 
and selectivity to recombinant protein, prior to testing for 
endoQenous sources . 



10 



Western blots 

Proteins in SD3 PAGE are transferred to immobilon 
membrane. The washing buffer is PBST (standard phosphate- 
buffered saline pH 7.4 -r 0.1% triton x 100) . Blocking and 
antibody incubation buffer is PBST ^5% milk. Antibody 
dilutions varied from 1:1000 to 1:2000. 



RESULTS 

15 Three SULUl antisera (against both 539A and 540A; and 

two SULU3 antisera (542A) reacted specifically with the 
peptide antigens. Antisera binding was competable with 
peptide. Experiments with extracts from ceils transfected 
with epitope- tagged SULUl and SULU3 genes are underv;ay. 

20 Antisera against the PAK4 C-terminal peptide 554A 

reacted with purified Gst-PAK4 and detected a protein. of the 
correct molecular v/eight from tissue culture cells. 
Specific immunoprecipi tat ion experiments are ongoing to 
determ^ine the reactivity with native protein. ■ • 

25 Similar im.m.uni za t ion and antisera testing experiments 

are underway for each of the other novel STE2 0 -kinases . 

STE2 0 - related protein kinase peptide immunogens and their 
specificity in recognizing endogenous protein by Western 
blots or immunoorecioi tat ions . 



; Protein ' 

i i 


Sequence 


; Aa 

1 positions 


Con j 


West . 


i 

! 

1 


1 STLK2 


EKFQKC5ADESP 


i 405-416 


KLH 


Y i Y 


1 STLrC4 1 


SISM3ELEPTTDPVGT 


1 252-267 


klh 


Y 


1 -,7 

' L 


j SULUl 1 


ldfpp:edyr 


j 3 5 0-39 8 


KLH 






1 SULUl 1 


HGDPRPEPRPTQ 


! 4 0 9-420 


KLH 


Y 1 Y 
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! SULU3 




1 2 - 


- 1 a 


ITT 




1 SULU3 




1 411 


- 4 9 Q 






PAK4 


CLVPLIQLYRKQTSTC 


o 6 o 


- A A n 

O O vj 






PAK5 


PLMRQNRTR 


1 390 


-398 


KLK 


1 Y i V 


PAKE- 


SGDRRRAGPEKRPKSS 




148 


-163 


KLH 


! V ! V 
1 - 1 - 


PAK5 


( C ) xRRKSLVGTPYWMAPE 


471 


-485 


KLH 


! Y 1 ND 



3TS20 -related protein kinase GST fusion prccein 
immunogens and their specificity in recognizing endogenou: 
protein by Wescern blots or imnrtunoprecipicat ions . 



i Protein 


j domain 
i 


Aa 

posi t ions 


W e :=) c . 


\ i 


i ZCl 


j Coiled-coil/pro/5/C 


350-367 


Y 




! ZCl i B 


615-732 


Y 


Y j 


; ZC2 


! Coiled-coil /pro/B 


348-762 1 


ND 


ND ' 


1 ZC2 


i B 


6 5 S - 7 6 2 


Y 


Y 1 


j PAK4 


1 . . . Nterm 


.252-426 1 


ND 


ND i 


1 PAK4 


j Kinase/Cuerm 


350-631 


ND 


\' ' 


j PAK5 


1 A/ Nterm 


53-330 i 


ND 1 


ND 1 


! PAKE 


1 A/Nterrr. 


53-309 ; 


ND j 


ND i 



10 



15 



20 



The 50kD STLK2 protein was expressed highlv in several 
hematopoietic cell lines including Jurkac, pGDlO, Ramos, 
A20, WEHl-231, K562, HEL and freshly isoiatied thymocytes 
from C57/EL6 m.ice . High levels of STLK2 expression were 
aisc detected in several cumor cell lines including Calu6, 
Colo205, LSISO, MDAM231 and A549, 

The IbO kD ZCl procein was detected m Jurkat T cells, 
Colo205, KCTllc, RIE-1, 293T, MDAMB2 2 1 , and SK-MSL23. 

The 170 kD ZC2 protein was detected in 3K-Iviel28 and 
UACC-62. 

Elevated levels of the 64 kD PAKE protein were 
confirm.ed m the breasc cancer cell lines MDA-231 and viCF-7, 
and in the lung cancer cell line A54 9. 
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Sxamole 5 : Recorr.binant Expression and Biological Assav^s 



Macerials and Methods 

Transient Expression of the Sce2 0 - relaced Kinases in 
Mamrr.alian Cells 

The pcDMA expression plasmids (10 i-ig DNA/100 mm plate) 

containing the STE20 - related kinase constructs are 

introduced into 293 cells with 1 ipcfect amine (Gibco 3RL) . 

After 72 hours, the cells are harvested in 0.5 mL 

solubilization buffer (20 nriM HEPES, pH 7.35, 150 miM NaCl , 

10% glycerol, 1% Triton X-IOC, 1.5 mM MgCl., 1 rrJv] EGTA , 2 mT-i 

phenylmethylsulf onyl fluoride, 1 (Ag/m.L aprotinin). Sample 

aliquot s . were resolved by SDS polyacrylamide gel 

electrophoresis (PAGE) on 6% acrylamide/ 0 . 5% bis - acrylamide 

gels and electrophoret ically transferred to nitrocellulose. 

Non-specific binding was blocked by preincubac ing blots in 

Blotco (phosphate buffered saline containing 5% w/v non-fat 

dried milk and 0.2% v/v nonidet P-4C (Sigma)), and 

recomibinant protein was detected using the various' anti- 

peptide or ant i -GST- fusion specific antisera. 

In Vitro Kinase Assavs 

Three days after cransfection wich the STE2 0 - related 
kinase expression contruccs, a 10 cm plate, of 2'93 cells was 
washed with PBS and solubilized on ice with 2 mL PBSTDS 
containing' phosphatase inhibitors (10 mM MaHPO^, pH 7.25, 150 
vm NaCl, 1% Triton X-lOG, 0,5% deoxychol ate , 0.1% SDS, 0.2% 
sodium azide, 1 wM NaF, 1 rnl^A EGTA, 4 niM sodiumi 
orthovanadate , 1% aprcrinin, 5 ug/mL leupeptin) . Cell 
debris was removed by cencrif ugat ion (12000 x g, 15 min, 4 
*^C) and the lysate v/as precleared by two successive 
incubations with 50 uL of a 1:1 slurry of protein A 
sepharose for 1 hour each. One -half mL of the cleared 



for STE2 0 - related Protein Kinases 




BNSDOCID: <WO 9953036A2_IA> 



wo 99/53036 PCT/US99/08I50 

supernarant was reacted v;ith 10 ).iL of pronein A purified 
kinase - speci f ic antisera (generaced from the GST fusion 
prorein or antipeptide ancisera) plus 50 ).iL of a 1 : 1 slurry 
of protein A-sepharose for 2 hr at 4 ^C. The beads were 
5 then washed 2 times in P3STDS , and 2 times in HNTG (20 mM 

HEPE3, pH 7.5/150 mT^ NaCl , 0,1% Tricon X-100, 10% glycerol). 

The immuncpurif led kinases on sepharose beads were 
resuspended in 2 0 )aL HNTG plus 3 0 wM MgCl. , 10 mi^ iMnCl., and 

10 20 uCi [a^"P]AT? (3000 Ci/mmoi). The kinase reactions were 

run for 3 0 min at room temperacure, and sccpped by addition 
of HNTG supplem.ented with 50 rriM EDTA. . The samples were 
washed 6 times in HNTG, boiled 5 m.in in SDS sample buffer 
and analyzed by 6% 3DS-PAGE followed by aut oradiograohy . 

15 Phosphoaminc acid analysis was performed by standard 2D 

methods on ^^?-labeled bands excised fromi the SDS-PAGE gel. 

Similar assays v/ere perform.ed on bacterially expressed 
GST-fusion constructs of the kinases. 

20 ZCl Assay buffer : 2 0 rv^ Tris pH 7 . 4 , 200 rrJ^l NaCl , 0.5 

miM DTT, 3 xvM MgGl2, 0.3 mJ^^ MnCi2 , 100|.iM '^PyATP. 

Substrates: myelin basic protein (MB?) ac 0.2S mtg/miL 
and phosphorylated ZCl peptide RTVGRRNTFIGT- PPYWMAPE at 17 
jaM {bold underlined residue shows site of phosphorylation) . 

-5 At higher concentrations of MgCl., (3 mM) , the activity 

of ZCl (both full-length and recombinant kinase domain) is 
up to 10 -fold greater toivards exogenous substrate MBP. In 
contrast, the autophosphory la t ion and the phosphorylation of 
tne activation loop peptide substrate are both inhibited. 

30 Mn-r-r does not inhibit the autophosphorylat ion and the 

peptide phosphorylation by the tr^uncated kinase domain form.. 
However, both the MBP phosphorylation, Mn + -i- -preferring 
activity AND the autophosphory lat inc , Mg -r-f -oref err inp 
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activity are eliminaced with mutation of the ATP-bindmc 
lysine in ZCl (Lys54Ala) indicating that both activities are 
attributable to the ZCl kinase domain. 

^ SULUl Assay buffer : This buffer is identical to that 

for ZCl, except for 5 mM MgCl2 . Under these conditions, 
ocher STE2C family members (PAK4, ZCl) were inhibited for 
autophosphorylat ion and required reducing the [Mn] cc <0.3 
miM for an efficient autophosphorylat ion reaction. 
'0 Substrates: MBP, phosvicin, or a-casein at 0.2S mg/mL, 

?AK4 . PAK5 Assa y Buffer : 2 0mM Hepes pH 7 . 2 , 130 mM KCl , 
10 mM MgCl2, 1 miM Na?, 20 m.M E -glycerclphospha t e , C.5 vxM 
DTT, 50 |_iM ATP, 0.5 |.iCi '^PyATP. 

Subscrates: MBP at 0.2S mg/mL and peptide substrates 
derived from PAK5 activation loop at 2 . 5 |.iM . 

STLK2 Assay buffer: Similar to that described above, 
except for the inclusion of 5 mM MgCl,, 5 niM MnCl... and 5 fiCi 
20 ''PyATP. 

Trans formation ( PAK exoer im.ent s ) 

Lovy-passage NIK3T3 fibroblasts displaying normal 
morphology (flat, non-ref ractile cellular morphology), as 

25 well as low rates of spontaneous transformation, were used 

in transformation assays. N"IH3T3 cells were maincamed in 
Dulbecco's modified Eagle's m.edium supplemented with 10% 
(v/v) fecal calf serum, penicillin (100 U/mh) and 
streptomycin (IOC U/miL) and kept m an humidified incubator 

30 at' 37 °C and 5% CO.. 

Cells were transfected with DNA- lipid complexes. As 
per manufacturer instructions, 1 ipof ectam.ine was utilized to 
transfect NIK3T3 cells. All trans fee t ions were with equal 

1^^ 
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amounts of plasmia DNA (DNA frorr- the appropria-e expression 
vector v/ithout inserc v;as used to give equivalenc arr.cunts of 
DMA per cransf eccion) . 1 fig of activated allele of H-Ras 
v/as CO- transf ected v/ith increasing amouncs of various 
5 alleles of PAK5 . 

Foci were scored after 3 weeks by -fixing 10 min in 10% 
methanol, 10% acetic acid for 10 min, followed by staining 
with 0.4% (v//v) crystal violet in 10% methanol for 10 min, 
and W'ashing v;ith deionized water and drying at room 
10 temperature. 

Transf ect io ns , stimulations, and luciferase assavs (ZCl 

exoer imenrs ) 

Cells do") were cransiently transfecced by 
15 electroporacion using a Gene Pulser (Bio-Rad Labs) wich the 

setting of 960 _? and 250 V. 20-40 hours later, transfected 

cells (about 10') were stimulated with various stimuli. 

Arter a 6-hour stimulation, cells were lysed, and luciferase 

activities were mieasured using the MicroLum.at Plus (EGScG 
10 Ber::hold) . (J. Exp. Med. 183:611-620, 1996, hereby 

incorporated by reference .herein in its entirety includina 

any drav/ings, tables, or figures.) 



RESULTS 

Protein expression and kinase activity of novel STE20- 
related protein kinases 
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size 


1 Size(kD) 
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50 
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1 


1 STLK4 
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i ZCl 
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FAK4 


30 


! 75 


y 


i y 


?AK5 i 


64 


! 64 


y 


! y i 



ZCl : ReQulation of kinase activity 

ZCl IS constitucively active as a fuli-lengch kinase 
when expressed either in vitro (TNT rabbit reticulocyte 
5 systerr.) or in NIH 3T3, 293T, or H1299 tissue culture cells. 

The endogenously expressed kinase is also active when 
irTLnn.uncprecipitated from carcinoma cell lines. 

ZCl signaling Pathwavs 

'0 Using human leukemic T cell line Jurkat as a model 

system, the impact of cctransf ected wild-type ZCl on the 
activation of two reporter genes, RE/AP-lucif erase and NFkB 
luciferase, was examined, RE/AP is a composite in the IL-2 
gene prom.oter containing both a NFKB-like site and an AP-1 

15 site. 

Optimal activation of both RE/A?- lucif erase and NFkB- 
luclferase reporter genes in Jurkat T cells reouires signals 
generated from stimulation of both T cell receptor and the. 
costimulator receptor CD26. Cotransf ect ion of wild-type ZCl 

20 with either the RE/A? - luci f erase or che NFkB- lucif erase 

reporter results in the activation of RE/AP or NFk3 v/hen 
costimulated with the anti-T cell receptor monoclonal 
antibody or the pharmacological reagents Pf4A and ionomLvcm 
that bypass proximal T cell receptor. No activation was 

25 seen when costimulated with an anti-CD2S monoclonal 

ant ibody . 

These results suggest that v/ild-type ZCl, when 
overexpressed, v/as replacing a CD2 8 - specif ic signal to 
activate RE/AP or NFkE . These results imply that ZCl is 
30 involved in the CD2S signaling pathway. Since MFkE is one 

of the major pathways also activated by the pro- inflammatory 
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cytokine TNr-a signaling, ic is also likely chac ZCl may be 
a componenc in the TN?-a signaling pathways. 

PAK5 : Design of specific peptide subscrates 
5 To aid in the developmenc of in \/itra kinase assays for 

screening small molecule libraries to identify specific 
inhibitors, the search for specific peptide substrates for 
PAK5 was undertaken. 

The rationale used to design such peptides is based on 

10 the hypothesis that upon binding activated small G protein, 

?AK5 undergoes a conformational change that results in 
derepression of its kinase activity followed by 
autophosphorylat ion on the activation loop resulting in a 
fully active kinase. The site of autophosphorylat ion for 

15 related family m.embers has been identified by biochemical 

and/or genetic m.eans (e.g. Wu, C, et al . J.Biol.Chem 
270:15984-15992 and Szczepanowska , et- al . Proc . Nat 1 . Acad . Sci 
94, 8503-8508, 1997). Specific peptide substrates for PAK5 
were designed from the sequence of the activation loop of 

20 this kinase. 

An activation loop ?AK5 peptide phosphorylated on the 
Thr residue of the T?Y motif served as a high-affinity 
substrate for PAK5 . 

25 ?AK5 activation loop peptides as kinase substrates 



Peptide 


Kinase ! 

i 


Sequence 


Aa 


SEQ 
ID 


Kinas 


1 substrate 


1 1 


PAK5 ; 


( C ) RRKSLVGTF YWMAPE ; 


4 7 1 - 4 S 5 


10 2 


PAK5 


i ves 


2 ! 


PAK5 i 


(C) RRKSLVGTPYWyiAPE ] 


471-485 


102 


PAK5 


1 yes 


3 ! 


PAK5 i 


(C) RRKSLVGTPYWMAPE 


4 71-485 


102 


PAK5 


1 no 


j 


KHSl i 


KRKSFIGT? YV; M A P E 1 


171-185 


U77129 


PAK5 


1 yes 


^ ! 


STLK2 ; 


KRNT FVGT ? F WMAPE i 


175-189 


5 


PAK5 


1 poor 


6 ! 


SULUl t 


PANS FVGT ? YWMAPE 


174-188 


22 


PAK5 


j poor 


7 j 


ZCl ■ 


RRMT F I GT? YWMAPE 


184-198 


13 


?AK5 


i poor 


3 i 


ZCl : 


RRNTF I GT? YWMAPE i 


184-198 


13 


PAK5 


! poor 


9 ! 


3TLK4 I 


RNK^/RKTFVGTPCWMAPE 


66-83 


7 


PAK5 


! poor 


10 1 


PAKE- : 


(C) RRKSLVGTPYWNL-.PE \ 


471-485 


102 


?AK4 


! ves 1 



Note: underlined/ bold reside was phosphorylated 
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Pspcide j Kinase 



No c e s 









xrr-ii\D 


c-C[Uci±iy we XI i^lriiD 


2 










b IS cne s 1 1_ e Oi_ 

p n O S p 11 0 x y JL a. i_ X o n 


4 


KHSl 


Similar to peptide 1 




STLK2 




6 


SULUl 




7 


ZCl 




s 


ZCl 


Beccer than 7 ; 


9 


3TLK4 


t 


10 


?AK5 


Same Km as phosoh, by FAK5 \ 



10 



PAK5 : Transformation 

Transf ormacion of lov;-pa3sage NIH3T3 cells by ras in 
the presence or absence of various alleles of PAKE shov/ed 
char che dominant negative, kinase-dead eillele of FAK5 was 
able to block ras transformation of NIH3T3 cells. Th\is , PAK5 
activity is required for ras transformation of NIH3T3 cells. 

Inhibition of ?AK5 activity may have therapeutic value as 
an anti -pi-ol i f erat ive agent for treating cancer. 



15 



20 



PAK4 and PAK5 : interaction with Cdc42 

PA:< 4 interacts w^ith CDC42 small G-protein but not Rac, 
RhoA, or Ras as determined by co- transf ect ion of recombinant 
genes and detection by kinase assays. PAK5 also interacts 
with Cdc42. Coding sequences of activated alleles of small 
G proteins (ras, Cdc4 2, Rac, Rho) tagged v/ith a Myc epitooe 
were transiently expressed in 293T cells, various alleles of 
35£-labeled PAKE- tagged with HA epitope w^ere expressed in 
vitro with the reticulocyte (TNT) system. 
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Examole 6 : Chromosomal Loc alization of 5t:e2 0 -Relared 
Prorein Kinases 

Macerials And Methods 

■ STH20 protein kinases STLK3 , STLK4 , ZCl , ZC2 , ZC3 , 
KHS2, SULUl, PAK4, and PAK5 were mapped using che GeneEridge 
4 Radiation Hybrid Panel, RH02.05 (Research Genetics.;. The 
GeneEridge 4 Panel consists of 91 hybrid panel samples, m 
addicion to one human positive concrol (HFL) , and one 
hamscer negative control (A23). The standard reaction 
conditions used to cesc and conduce PGR reaccicns using che 
GeneEridge 4 Panel are available from Research Genetics. 

Oligonucleotide sequences (all 5' to 3 M used for PGR 
mapping were : 



15 



20 



50 



3TLK3 : CTCCCATTTCCTAGCAJLAATCA , 
STLK4 : CCACACATGCGTATGTGTGTTG ; 

zcl : a.tgcctggatcagactgcttct, 
agatggagtgtactgggaggg , 
catgatgaagtggtgacggg , 
sulul : gaaaacgtggccgtctcttctatt , 
k:-is2 : gaatagcggtaccatgatagaata! 

GTCAGTATTCTCTGCAAAGATTG , 
CATCAGTGGAAGTCTGCAGTG, ■ 



ZC2 
ZC3 



PAK4 
PAK5 



AG AG G GAG TA.T TG T C AG ATGTA 
TTGGTAG.AATTCACATCAGGTACA 
CAAGGTGTTCTTTGCCTCTGTT 
AGAJ=lGAGGACTTGGCACTTATG 

ATTTGTGGTACTGGGATTCTGTG 
TACCAJ=lA_AAGAjGCCAAAAGTGTG 
G ATG TT C T CT CCATTCTG T AAAG 
C AGG TG C A G tag T CAT T TG C 



Positive reactions were assigned a score of ^^1", 
negative reactions are assigned a score of "0", and 
ambiguous reactions are assigned a score of "2''. Results 
v/ere submitted to the Whitehead Institute 

{www©genom.e. wi.mit.edu) for position analysis. Chromosomal 
localizations for ZC4 , SULU3 , STLK2 , STLK5 and 3TLK5 were 
available publicly (for example, from Unigene) . The 
chromosomal locations of GEr:2 and STLK7 have not been 
determined . 



i STLK2 h 


Xq25 -2 7 . 1 


{Publlc^ 1 


1 STLK3 


2q31.3 


'Suoen; 


j STLK4 h 


3p22.3-p22.2 


( Suoen 


! 
1 


i STLK5 h 


17c23 .2-24.2 


(Public. 


I 
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: bTljKb n 


2q 3 ^ , 2 -Q J 3 , o 


( FUP^ 1 C / 


b i ^ K / il 


NA. 






t z p 1 i , z 


( Suoen } 




3q-:iD . oi-3a*^^ - 


( Suqen ) 


Zi'w j» n 


1 /pi3 .^-IJ , o 


{ Sugen ) 






1 r U 0 1 X C / j 




0-»-^00 Ov-\00 O 


(Sugen) | 




1 ^ Cjz 'a . -^i 1 


(Sugen) \ 


I— ' J—' W O 11 




\ tr Li-h^' X J. w y 


GEK2 h 


NA 




; ?AK4 h 


15ql4 


(Sugen) 


?AK5 h 


19qi3 . 2-ql3 . 3 


(Sugen) 



Many of che STE 2 0 kinases were mapped to regions 
associated v;ich various human cancers, as shov/n belov;. 

The regions were also cross-checked with che Mendalian 
5 Inheritance in r4an database, v;hich tracks genetic 

information for many human diseases, including cancer. 
Preferences fc^r association of the mapped sites v/ich 
chromosomal abnormalities found in human cancer can be found 
in: Knuutila, et al . , Am J Pathol, 1993, 152:1107-1123, 
1(J hereby incorporated herein be reference in its entirety 

including any figures, tables, or drawings. Associacion of 
these mapped regions with other diseases is documented in 
the Online Mendalian Inheritance in Man 
(OMIM) ( httP: / /wwv;.ncbi . nl m . ni h . oov/ ht bin - oos t /Omim j . 

15 

STLK2_ji . Xq25-27.1, (Public) 
Osceosarcom.a , Xq25-qter, 2 of 31, 

Lym.phoprolif erative syndromie , X- linked (OMIM No. 3 0S24 0) 
human STLK3 . 2q31.3, (Sugen) 
20 Squamous cell carcinom.a o*f Head and Neck, 3 of 30. 



STLK4_h . 3p22 . 3 -p22 . 2 , (Sugen) 

Mancle cell lymphoma 3pl4-p22 1 of 27 

Squamious cell carcinom.a of Head and Neck 3p2 2-p24 1 of 14 
25 Cardiomyopathy, dilaced (OMIM 501154) 
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STL.K5_h , i7q23 . 2 -24 . 2 , (Public) 
Cervical cancer, i7q, i of 30 

Gascroesophageal junction adenocarcinorr.a xenograft, 17q, 1 
5 of 5 

Breast carcinoma, 17ql2-qter, 1 of 16 
Bladder carcinoma, 17q22-q23, i of 14 
Breast carcinoma, 17q2'2-q2 5, 8 of 101 
Non-srr.all cell lung cancer, 17q24-q25, 6 of 50 
10 Testis, i7q24-qter, 2 of 11 

Malignant peripheral nerve sheath tumors, 17q24-qter, 5 of 7 
Alzheimer disease, susceptibility to (OMIM 106130) 



15 



^£_h, 2q32.2 -q33.3, (Public) 
Non- small cell lung cancer, 2q31-q32, 1 of 50 
Squamous cell carcinoma of Head and Neck, 2q31-q33, 3 of 30 
Small cell lung cancer, 2q32-q35, 1 of 22 

ZCl_h, 2pil.2, (Sugen) 
20 non-small cell lung cancer, 2pter-ql3, 1 of lo 

non- small cell lung cancer, 2pter-q21, l of lo 
Pulmonary alveolar proteinosis, congenital (OMIM 173640). 



25 



30 



2C2_h, 3q26 . 31-3q26 . 32 , (Sugen) 

Non-small ceil lung cancer, 3q2 6 . 1 -q2 S . 3 , 26 of 103 

Cervical cancer, 3q26.1-q2 7, 4 of 30 

Small cell lung cancer, 3q26.3-qter, 3 of 35 

Squamous cell carcinoma of Head and Neck, 3q25.3-ccer, 
13 

Marginal zone B-cell lymphoma, 3q2c-c2~, l of 2 5 
Parosteal osteosarcoma, 3q26-q2S, 1 of 1 
Gascroinrescinal scromal tumor, 3q26-q23, l of 16 
Mantle ceil lymphoma, 3q26-q29, l of 5 

\ ^0 
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10 



ZC3_h 17pl3.2-13.3 (Sugen) 

Maligrxant fibrous histiocycoma of soft tissue, 17p, 2 of 5S 

Leiorrivosarcoma , 17p, 7 of 2 9 

Non-small cell lung cancer, 17p, i of 50 

ZC4_h . Xq22, (Public) 

Diffuse large cell lymphoma, Xq22-ter, 1 of 22. 
Deafness, X-linked 1, progressive. (OMIM 304700). 



KHS2_h . 2p22-2p22.2, (Sugen) 
Synovial sarcoma, 2p21-ql4, l_of_D7 
Follicular lymphoma, 2p22-p24, l_of_46 

Colorectal cancer, heredicary, nonpolyposis , t'/oe 1, Ovarian 
15 cancer (MSH2, COCAl , FCCl) , (OMIM 12C435) . 

SULUl__h . 12q24.21 (Sugen) 

Neuroglial cumors, 12q22-qter, l_of__15 

Gastroesophageal junction adenocarcinomLa , 12q23-a-ier, 1 of 
20 5. 

Non-small cell lung cancer, 12q2 4 . 1 - 24 . 3 , 2 of 50. 
SULU3_h 17pll.2 (Public) 

Malignant fibrous hist iocytomia of soft tissue, 17p, 2 of 53 
25 Leiomyosarcoma, 17p, 7_of__2 9 

non-sm.all cell lung cancer, 17p/ l_of_50 . 
Diffuse large cell lymphoma, 17pll.2, l_of.32 
Osteosarcoma, 17pii.2-pl2, 4_of_31 

30 PAK4_h: 15ql4 (Sugen) 

Schizophrenia, (OHIM 113511) . 
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PAK5 h : 19crl5.2-ol3.3 (Sug^n) 

Follicular lymphoma, 19ql3, 1 of 46- 
Mantle cell lymphoma, 19ql3, 1 of 5 
Hepacocel lular carcinoma, 19ql3.1, 2 of 50 
5 Small cell lung cancer, 19ql3.1, 10 of 35 

Breasc carcinom.a, 19ql3 . .1 -qter , 1 of 33 
cervical csincer , 19ql3 , 1 -qter , 1 of 30 
Testis, 19ql3 . 1-qter, 1 of 11 
Chondrosarcoma, 19ql3.2, 1 of 2 9 
10 Malignant fibrous hisciocytoma of soft tissue, 19ql3.2-qcer 

2 of 53 

Non-small cell lung cancer, 19qcen-ql3 . 3 , 6 of 104 



15 



Exam.ole / : Demonstration Of Gene Amol i f ica t ion Sv 
Sou c hern Blor. t ing 



Materials and Merhoda 

Mylon mtemjDranes v;ere purchased from. Boehrinaer 
Mannheim;, Denaturing solution contains 0.4 M NaOH and 0.6 M 
NaCl. Neucralization solution contains 0.5- M Tris-HCL, pH 

'^-^ ^ ^^'^ 1.5 M NaCl . Hybridization solution contains 50% 

formamide, GX SS?E, 2 . 5X Denhardt ' s solution, 0.2 mg/mL 
denatured salm.on DNA, 0.1 mg/miL yeast tRNA, and 0.2 % sodium 
dodecyl sulfate. Restriction enzym.es were purchased from 
Boehringer Mannheim. Radiolabeled probes v/ere prepared 

25 using the Prime-it II kit by Stratagene. The beta act in DNA 

fragmtent used for a probe template was purchased from; 
Clontech. 

Genom.ic DNA was isolated from^ 2 0 different tumor cell 
lines: MCF-7, MDA-MB-231, Calu-6, A549, HCT-15, HT-29, Colo 
30 205, LS-180, DLD-1, HCT-116, ?C3 , CAPAIxT-2, MIA-?aCa-2, PANC- 

1, AsPc-l, BxPC-3, OVCAR-3, SKOV3 , Sw 626 and ?A-1, and from 
two norm-al cell lines: hum.an mjamtm.ary epithelial cells and 
human umbilical vein endothelial cells. 
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A 10 ug aiiquoc of each genomic DNA sarr.pl e was digested 

with EcoR I r-estriccion enzyme and a separate 10 ug samole 
was digested wich Hind III restriction enzyme. The 
rescriccion-diges t ed DMA samples were loaded onto a 0.7% 
agarose gel and, follov;ing elect rophoret ic separacicn, the 
DNA v.'as capillary- transferred to a nylon membrane bv 
standard m.ethods (Sambrook, J. et al (1989) Molecular 
Cloning: A Labcratory Manual, Cold Spring Harbor 
Laboratory) . • 



PAK5 Amolicon : 

A 60 0 base pair fragment {EcoR I - Sac I) of che PAK5 
gene v/as ,used as a template for a radiolabeled BNA probe 
which v/as hybridized to the blots at 42 ^C for 43 hours in 

15 hybridization solucion using standard methods (supra) . The 

blots were exposed to a phosphorimager screen' for 4 days/ 
then scanned and analyzed using a Molecular Dynamacs Storm 
840 phosphorimager. The relative mass and gene copy number 
values of the PAK5 DNA fragments were calculated from the 

20 band density values obtained. The blots were re-hvbridized 

v;ith a radiolabeled probe copied from a fragment of hum^an 
beta actin DMA and developed as above to confirm the sam^ple 
mass loading equivalency. 

RESULTS 

-5 The PAK5' gene was determined to exhibit 3 -fold 

amplification compared to the normal DMA copy num.ber in 
PANC-1 (pancreatic epithelioid carcinoma) and OVCAR-3 
(ovarian adenocarcinom:a ) humtan cell lines, and approximately 
2 times -ihe norm.al copy number in the 3x?C-3 (primiarv 

30 pancreatic adenocarcinomia ) hum.an cell line. 

Sim.ilar Southern analyses can be performied for other 
STE20 kinases. 
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r;xamole S: Detection Of r-rocem- ir-rotein mceracrion 
Through Phage Display 

Materials And Meuhods 

5 Phage display provides a mechod for isolating molecular 

inceraccions based on affinity for a desired bait. cDNA 

fragmencs cloned as fusions to phage coat proteins are 
displayed on the surface of the phage. Phage (s) interacting 

v/ith a bait are enriched by affinity purification and the 

10 insert DMA from individual clones is analvzed. 



T7 Phage Disolav Libraries 

All libraries were construcced in the T7Selectl-lb 
veccor (Movagen) according co the manuf accurer ' s direccions. 

15 

Baic Presentation 

Protein domains to be used as baics were generated as 
C-"erminal fusions to GST and expressed in E. coli. 
Peptides v/ere chemically synthesized and biocinylated at the 
20 N-cerminus using a long chain spacer biotin reagent. 

Selection 

Aliquocs of refreshed libraries (lO^'-lO'- pfu; 
supplemented v/ith PanMix and a cocktail of E, coli 
inhibitors (Sigmia P-3465) were incubaced for 1-2 hrs at room. 
25 temperature wich rhe " imim.obilized bains. Unbound phage was 

ex-ensively washed (at least 4 cimes) with wash buffer. 

After 3-4 rounds of selection, bound phage v/as eluted 
in 100 ^iL of 1% SD5 and pla::ed on agarose plaiies zo obtain 
single plaques. 

30 

Identification of inserc DNAs 

Indian dual plaques v/ere picked inco 2 5 uL of 10 rr^A EDTA 
and che phage was disrupced by heacing at 70 for 10 min. 

1^1M 
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2 uL of the disrupced phage v;ere added co 5C uL PGR 
reaccion mix. The insert DNA was amplified by 35 rounds c-f 
thermal cycling (94oC, 50sec; 50oC, Imin; 72cC, Imin) . 

5 Composition of Buffer 

lOx PanMix 
5% Triton XlOO 

10% non-fat dry milk (Carnation) 
10 mM EGTA 
10 2 50 rriiM NaF 

2 50 Lig/rriX. Heparin (sigma) 

250 ug/mL sheared, boiled salmon sperm DNA (sigma) 
0.. 05% Na -azide 
Prepared in PBS 

15 

Wash Buffer 

PBS supplemented v/ith: 

0.5% MP- 4 0 

2 5 |L.il g/mL heparin 

20 

PGR reaction mix 

-•^ lOx PGR buffer ( Perkrn-Elmer , v^ieh 15 w^i Mg) 

0.2 mL each dNTPs (10 rvM stock) 

^ • ^ T7U? primer (15 pmol/|.iL) GGAGCTGTCGTATTCCAGTC 

^-^ T7DN prim.er (15 pm.ol/uL) AACCCCTCAAGACCCGTTTAG 

0.2 mL 25 m^M MgCl, or MgSO, to compensate for EDTA 

Q.S. to 10 mL v;ith distilled water 
Add 1 .unit of Taq polymerase per 50 uL reaction 

30 LIBRARY : T7 Selectl-H441 
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RESULTS 



Phage display bales and inceractors 



1 Bale 
i 


Domain 


Aa 


Patenc 
SEQ ID 


CDNA 
library" 


Interaccor j Sequence 
j Range 
1 Sc SEO ID 


i SULUl 
1 


Coiied- 
coil2 


752-S96 


22 


H44 1 


GEK2 cc dom (1) 


677-820 
SEQ #26 


i SULU3 ! Coiled- 
i 1 ccil2 


755-39S 


23 


K441 


SLK isofcrm 


M33780 i 



10 



15 



20 



25 



30 



iateracned ro a lesser extent: with the coiied-coil domain of an SLK 
isof orm . 

The phage display daca suggest potencial inceracnions 
of SULU3 wich SLK and SULUl with GEK2 through their coiled- 
coil - domains. ■ ■ Therefore two members of che SuLU subfamilv 
of STE20 kinases interact wich two members of a separate 
STE20 family, the prototype being SLK. 

These results suggest a specificity in the interaction, 
and imply thac these STE20 kinases may interact with each 
other 'through hom.o- and heterc-dimerizatxon . Alternatively 
SULU-related kinases could acc immediately up- or dov/n- 
stream. .of the SLK-related kinases in a signaling cascade. 

One skilled in the arc would readily appreciate that 
the present invention is well adapted to carry out the 
objeccs and obtain the ends and advantages mentioned, as 
well as those inherent therein. The molecular complexes and 
the methods, procedures, creatm.ents, molecules, specific 
compounds described herein are presently representative of 
prererred embodimencs are exemplary and are not intended as 
-imitations on che scope of the invention. Changes therein 
and other uses will occur to those skilled in the art which 
are encompassed within the spirit of the invention are 
defined oy the scope of the ciaim.s. 

It will be readily apparent to one skilled in the art 
tnat varying substitutions and modifications may be made to 
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the invention disclosed herein without departing from the 
scope and spirit of the invention. 

All patents and publications mentioned in the 
specification are indicative of the levels of those skilled 
in the art to which the invention pertains. 

The invention illustratively described herein suitably 
may be practiced in the absence of any elemient or elements, 
lim.itation or lim.itations which is not specificallv 
disclosed herein. Thus, for example, in each instance 
herein any of the terms "comprising", "consisting 
essentially of" and "consisting of" may be replaced with 
either of the other two terms. The terms and expressions 
which .have been employed are used as terms of descriotion 
and not of lim.itation, and there is no intention that m the 
use of such term;s and expressions of excluding any 
equivalents of the features shown and described or portions 
thereof, but it is recognized that various modifications are 
possible v/ithin the scope of the invention claimied. 

In particular, although som.e formulations described 
herein have been identified by the excipients added to the 
formulations, the invention is meant, to also cover the final 
formrulation form.ed by the combination of these excipients. 
Specifically, the invention includes formulations in "which 
one to all of the added excipients undergo a reaction during 
2^ formulation and are no longer present in the final 

f orm^ulation, or are pre-'ent m modified forms. 

In addition, v/here features or aspects of the invention 
are described in terms of Markush groups, those skilled in 
the art will recognize that the invention is also thereby 
described m terms of any individual member or subgroup of 
members of the Markush group. For example, if X is 
described as selected from the group consisting of brom.ine, 



20 



30 



BNSDOCID: <WO 9953036A2JA> 



wo 99/53036 PCT/US99/08 1 50 

chlorine, and iodine, claims for X being bromine and claims 
for X being broimne and chlorine are fully described. 

Other embodiments are v/ichin the followinq claims. 
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CLAIMS 

1. An isolaced, enriched, or purified nucleic acid 
molecule encoding a kinase polypeptide selected from the 
group consisting of 3TLK2 , STLK3 , STLK4 , STLIKB , STLK6 , 
STLK7, ZCl , ZC2, ZC3 , ZC4 , KHS2 , SULUl , SULU2 , GE?:2 , PAK4 , 
and ?AK5 . 

2. The nucleic acid molecule of claim. 1, wherein said 
nucleic acid molecule comprises a nucleoside sequence that: 

(a) encodes a polypeptide comiprising rhe amiino acid 
s^qy.^nc'- 'set forch in SEQ -ID NOrS, SEQ ID NO : 6 , SEQ ID NO : 7 , 
SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO : 1 5 , SEQ ID NO 1 S , SEQ 
ID NO: 22, SEQ ID NO : 2 3 , SEQ ID NO : 2 4 , SEQ ID NO : 2 9 , SEQ ID 
NO: 31, SEQ ID NO: 97^ SEQ ID NO: 99, SEQ ID NO : 1 0 1 , SEQ ID 
NO: 103, or SEQ ID NO: 10 7; 

(b) is che complem.ent of the nucleotide sequence of 

( a } ; 

(c) hybridizes under highly stringenc conditions co 
the nucleotide molecule of (a) and ^ncad^s a nacurally 
occuirring kinase polypeptide ; 

(d) encodes a kinase polypeptide having the amiino acid 
sequence of SEQ ID NO : 5 , SEQ ID NO : 6 , SEQ ID NO : , SEQ ID 
NO: 13, SEQ ID NO: 14, SEQ ID NO : 1 5 , SEQ ID NO : 1 £ , SEQ ID 

NO: 22, SEQ ID NO : 2 3 , SEQ ID NO: 24, SEQ ID NO : 2 9 , SEQ ID 
NO: 31, SEQ ID NO: 97, SEQ ID NO: 99, SEQ ID NO: 103, or SEQ ID 
NO: 107, except chac ir lacks one or more, but noc all, of 
the follcv;ing segm.encs of. am.ino acid residues: 1-21, 22-274, 
or 2 75-416 of SEQ ID NO : 5 , 1-31, 32 -3 OS, 309-439 or 490-516 
of SEQ ID N0:6, 1-17S or ,179-414 of SEQ ID NO : 7 , 1-22, 23 - 
239, 290-526, 527-640, 641-395, or 397-1239 of SEQ ID NO : 1 3 , 
1-255, 256-442, 443-626, 627-954, or 955-1297 of SEQ ID 
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NO:14, 1-255, 256-476, 477-6S0, 631-933, or 9S4 -1326 oi: SEQ 
ID NO:i5, 1-13, 14-273, 274-345, 347-534, or 535-894 of SEQ 
ID NO:lS, 1-21, 22-277, 27S-427, 423-537, 6 3 3-751, or 752- 
393 of SEQ ID NO:22, 1-66, 67-215, 216-425, 426-539, 540- 
736, or 787-887 of SEQ ID NO:23, 1-25, 26-273, 274-422, 423- 
532, or 533-743 of SEQ ID KT0:24, 1-51, 52-224, 225-393, 394- 
65S, or 659-6S1 of SEQ ID NO : 2 9 , 1-25, 26-281, 232-430, 431- 
640, 641-754, 755-901, or 902-1001 of SEQ ID NO : 3 1 , 1-10, 
11-321, or 322-373 of SEQ ID NO:97, 1-57, 58-369, or 370-413 
of SEQ ID MC:99, 1-52, 53-173, 174-307, 308-572, or 573-591 
of SEQ ID MO:103, or 1-33, 34-294, 295-337, 338-472, 473- 
724, or 72 5-968 of SEQ ID N'C:107; 

(e) IS the complenfien- of che nucleocide sequence of 

(d) ; 

(f) enzodes a polypeptide having the amino aoid 
sequence set forth in SEQ ID NO : 5 , SEQ ID NO : 5 , SEQ ID NO : 7 , 
SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO : 1 5 , SEQ ID NO: 18, SEQ 
ID NO: 22, SEQ ID NO : 2 3 , SEQ ID NO: 24, SEQ ID NO : 2 9 , SEQ ID 
NO:31, SEQ ID NO:97, SEQ ID NO : 9 9 , SEQ ID NO:103, or SEQ ID 
NO:107 from amino acid residues 1-21, 22 -274, or. 275-416 of 
SEQ ID NO: 5, 1-31, 32-303, 309-489 or 490-516 of SEQ ID 
N0:6, 1-173 or 179-414 of SEQ ID NO : 7 , 23-289, 290-526, 527- 
540, 641-896, or 897-1239 of SEQ ID N0:13, 1-255, 256-442, 
443-625, 627-954, or 955-1297 of SEQ ID NO:14, 1-255, 256- 
476, 477-6SC, 631-983, or 934-1326 of SEQ ID NO : 1 5 , 1-13, 
14-273, 274-346, 347-534, or 535-894 of SEQ ID MO : 1 3 , 1-21, 
22-277, 273-427, 428-637, 633-751, or 752-393 of SEQ ID 
NO:22, 1-66, 67-215, 216-425, 426-539, 540-736, or 787-837 
Of SEQ ID MO:23, 1-25, 26-273, 274-422, 423-532, or 633-743 
of SEQ ID NO:24, 1-51, 52-224, 225-393, 394-65S, or 659-631 
of SEQ ID NO:29, 1-25, 26-231, 232-430, 431-640, 641-754, 
755-901, or 902-1001 of SEQ ID NO : 3 1 , 1-10, 11-321, or 322- 
373 of SEQ ID NO:97, 1-57, 53-369, or 370-418 of SEQ ID 

^00 
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NO:99, 1-52, 53-173, 174-307, 30S-572, or 573-591 of SEQ ID 
NO:i03, or 34-294, 295-337, 338-472, 473-724, or 725-963 of 
SEQ ID NO : 107 ; 

(g) is the complement of the nucleotide sequence of 

( f ) ; 

(h) encodes a polypeptide comprising the ammo acid 
sequence set forth in SEQ ID NO : 5 , SEQ ID NO : 6 , SEQ ID NO : 7 , 
SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 13, SEQ 
ID NO: 22, SEQ ID NO : 2 3 , SEQ ID NO: 24, SEQ ID NO : 2 9 , SEQ ID 
NO: 31, SEQ ID NO: 97, SEQ ID NO : 9 9 , SEQ ID NO: 101, SEQ ID 
NO: 103, or SEQ ID NO: 107, except thac it lacks one or more, 
bur noc all, of the domains selected from, che group 
consisting of a C-terminal domain, a catalytic domiain, an N- 
terminal dom.ain, a spacer region, a proline-rich region, a 
coiled-coil structure region, and a C- terminal tail; or 

(i) 13 che ccmiplement of the nucleotide sequence of 

(h) . 

3. The nucleic .acid molecule of claim 1, further 
comprising a vecror or promioter effective to initiate 
transcription in a host cell. 



4. The nucleic acid molecule of claimt 1, '.vherem said 
nucleic acid molecule is isolated, enriched, or purified 

25 from a m.ammial . 

5. The nucleic acid molecule of claim 4, wherein said 
mam.mal is a human. 

nucleic acid probe for the decection of nucleic 
acid encoding a kinase polypeptide in a sample, v/herein said 
polypeptide is selected from, the group consisting of STLK2 , 



^0/ 
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STLK3, STLK4 , STLK5 , STLK6 , STLPC? , ZCl , ZC2 , ZC3 , KHS2 , 
SULUl, SULU3, GEK2, ?AK4 , and PAK5 . 



7. The probe of claim 6, wherein said polyoepcide is 
5 a fragmenc of the protein encoded by the amino acid sequenc: 

set forth in 3EQ ID NO : 5 , SEQ ID NO : 6 , SEQ ID NO : 7 , SEQ ID 
NO: 15. SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO : 1 3 , SEQ ID 
NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO : 2 9 , SEQ ID 
NO:31, SEQ ID NC:97, SEQ ID NO:9S, SEQ ID NO:lCl, SEQ ID 
10 NO: 103, or SEQ ID NO: 107. 

S. A recombinant cell comprising a nucleic acid 
molecule encoding a kinase polypeptide seiecced from the 
group consisting of 3TLK2 , "STLK3, STLK4 , STLK5 , STLK6 , 
15 3TLK7, ZCl, ZC2, ZC3 , PCHS2 , SULUl , SULU3 , GEK2 , ?AK4 , and 

PAK5 . 



9. The ceil of claim 3, wherein said polyoeocide is a 

fragment of the protein encoded by the amino acid sequence 

set forth in SEQ ID NO : 5 , SEQ ID NO : 6 , SEQ ID NO : 7 , SEQ ID 

NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO : 1 S , SEQ ID 

NO: 22, SEQ ID NO : 2 3 , SEQ ID NO: 24, SEQ ID NO : 2 9 , SEQ ID 

NO: 31, SEQ ID NO: 97, SEQ ID NO: 99, SEQ ID NO : 1 0 1 , SE( 
NO: 103, or SEQ ID NO:. 107. 



20 



25 



30 



- W -Li-, 



10. A.n isolated, enriched, or purified kinase 
polypeptide selected from the gi-oup consiscing of STLK2 , 
STLK3, STLK4, STLK5 , STLK6 , STLK7 , ZCl , ZC2 , ZC3 , ZC4 , KHS2 
SULUl , SULUS , GEP.:2 , ?ArC4 , and PA.:<5 . 

11. The polypeptide of claim 10, wherein said 
polypeptide is a fragment of zhe procein encoded bv the 
amino acid sequence set forth in SEQ ID NO : 5 , SEQ ID NO : 5 , 
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£EQ ID NO: 7, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO : 1 5 , SEQ 

ID NO: IS, SEQ ID NO: 22, SEQ ID NO : 2 3 , SEQ ID NO : 2 4 , SEQ ID 
NO: 29, SEQ ID NO : 3 1 , SEQ ID NO: 97, SEQ ID NO : 9 9 , SEQ ID 
NO:101, SEQ ID NO:103, SEQ ID NO:105, or SEQ ID NO:107. 

5 

12. The polypeptide of claim 10, wherein said 
polypeptide comprises: 

(a) the amino acid sequence set forth in SEQ ID NO : 5 , 
SEQ ID NO: 6, SEQ ID NO : 7 , SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID 

10 NO: 15, SEQ ID NO: 13, SEQ ID NO : 2 2 , SEQ ID NO : 2 3 , SEQ ID 

NO: 24, SEQ ID NO : 2 9 , SEQ ID NO : 3 1 , SEQ ID NO: 97, SEQ ID 
NO: 99, SEQ ID NO: 101, SEQ ID NO: 103, SEQ ID NO : 1 0 5 , or SEQ 
ID NO : 107 ; 

(b) the amino acid sequence sec forth rn SEQ ID NO : 5 , 
15 SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID 

NO: 15, SEQ ID NO: IS, SEQ ID NO : 2 2 , SEQ ID NO : 2 3 , SEQ ID 
NO: 24, SEQ ID NO : 2 9 , SEQ ID NO : 3 1 , SEQ ID NO: 97, SEQ ID 
NO: 99, SEQ ID NO: 103, SEQ ID NO : 1 0 5 , or SEQ ID NO : 1 0 7 , 
except that it lacks one or more, but not all, of- the 

20 following segments of amdno acid residues: 1-21, 22-274, or 

275-416 of SEQ ID NO : 5 , 1-31, 32-30S, 309-489 or 490-515 of 
SEQ ID NO:6, 1-17S or. 179-414 of SEQ ID'NO:7, 1-22, 23-2S9, 
290-526, 527-640, 641-S96, or 397-1239 of SEQ ID NO : 1 3 , 1- 
255, 256-442, 443-626, 627-954, or 955-1297 of SEQ ID NO:14, 

25 1-255, 256-476, 477-6SC, 5S1-983, or 984-1326 of- SEQ ID 

N0:15, 1-13, 14-273, 274-345, 347-534, or 535-894 of SEQ ID 
NO:18, 1-21, 22-277, 278-427, 423-637, 638-751, or 752-S9S 
of SEQ ID NO:22, 1-66, 57-215, 216-425, 426-539, 540-786, or 
7S7-S87 of SEQ ID NO : 2 3 , 1-25, 26-273, 274-422, 423-632, or 

30 633-743 of SEQ ID NO : 2 4 , 1-51, 52-224,' 225-393, 394-658, or 

659-631 of SEQ ID NO : 2 9 , 1-25, 26-281, 232-430, 431-640, 
641-754, 755-901, or 902-1001 of SEQ ID NO : 3 1 . 1-10, 11-321, 
or 322-373 of SEQ ID NO:97, 1-57, 5S-369, or 370-418 of SEQ 
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IDi\TO:95, 1-52, 53-173, 174-307, 30S-572, or 5~3-591 of SEQ 
ID MO:103, 1-24, 25-2S9, 290-397, 39S-62S, 623-56=, 559-372, 
or 873-1227 of SEQ ID NO:105, or 1-33, 34-2S4, 295-337, 338- 
472, 473-724, or 725-968 of SEQ ID N0:1C7; 

(c) the amino acid sequence set forth 3.n SEQ ID NO : 5 , 
SEQ ID MO: 5, SEQ ID MO : 7 , SEQ ID MO: 13, SEQ ID MO: 14, SEQ ID 
MG:15, SEQ ID NO:lS, SEQ ID NO:22, SEQ ID MO : 2 3 , SEQ ID 
MO:24, SEQ ID NO : 2 9 , SEQ ID NO : 3 1 , SEQ ID NO : 5 7 , SEQ ID 
NO: 99, SEQ ID NO: 103, SEQ ID MO: 105, or SEQ ID MO: 107 from 
amino acid residues 1-21, 22-274, or 275-416 of SEQ ID NO : 5 , 
1-31, 32-308, 309-4S9, or 490-516 of SEQ ID MO : 6 , 1-178 or 
179-414 of SEQ ID NO : 7 , 23-289, 290-525, 527-640, 641-896, 
or 397-1239 of SEQ ID NO:13, 1-255, 256-442, 443-625, 527- 
954, or 955-1297 of SEQ ID'n0:14, 1-255, 256-476, 477-680, 
581-983, or 934-1326 of SEQ ID N0:15, 1-13, 14-273, 274-346, 
347-534, or 535-894 of SEQ ID NO:18, 1-21, 22-277, 278-427, 
428-637, 638-751, or 752-893 of SEQ ID NO:22, 1-56, 67-215, 
216-425, 425-539, 540-786, or 787-887 of SEQ ID MO:23, 1-25, 
26-273, 274-422, 423-632, or 633-748 of SEQ ID MO:24, 1-51, 
52-224, 225-393, 394-658, or 659-681 of SEQ ID NO:29, 1-25, 
26-231, 2S2-43C, 431-640, 641-754, 755-901, or 902-1001 of 
SEQ ID MO:31, 1-10, 11-321, or 322-373 of SEQ ID NO:97, 1- 
57, 58-369, or 370-413 of SEQ ID MO : 9 9 , 1-52, 53-173, 174- 
307, 308-572, or 573-591 of SEQ ID NO:103, 1-24, 25-239, 
290-397, 393-628, 629-663, 669-372, or 873-1227 of SEQ ID 
MO:105, or 34-2S4, 295-337, 338-472, 473-724, or 725-963 of 
SEQ ID NO: 107; or 

(d) the amino acid sequence set forth in SEQ ID NO: 5, 
SEQ ID NO: 6, SEQ ID NO : 7 , SEQ ID NO : 8 , SEQ ID NO : 1 3 , SEQ ID 
MO: 14, SEQ ID MC : 1 5 , SEQ ID NO : 1 6 , SEQ ID MO: 18, SEQ ID 
NO: 22, SEQ ID MO: 23, SEQ ID NO: 24, SEQ ID MO : 2 9 , SEQ ID 
MO: 31, SEQ ID MO: 97, SEQ ID NO: 99, SEQ ID NO : 1 0 1 , SEQ ID 
NO: 103, SEQ ID MC:105, or SEQ ID NO: 107, except tha" ic 

2 OH 
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lacks one or more, but not all, of the domains seleccec from 
the group consisting of a C-terminal domain, a catalvcic 
domain, an N- terminal domain, a spacer region, a proline- 
rich region, a coiled-coil structure region, and. a C- 
terminal tail . 

13. The kinase polypeptide of claim 10, v/herein said 
polypeptide is isolated, purified, or enriched from a 
m.amm.ai . 

14. The kinase polypeptide of claim 13, wherein said 
mamm£il is a human. 



15. The kinase polypepcide of claim 10, v;herein said 
15 polypeptide is a STLK2 , STLK3 , STLK4 , STLK.B, STLK6 , 3TLK7 

polypeptide . 

16. The kinase polypepcide of claim IG, wherein said 
polypeptide is a ZCl , ZC2 , ZC3 , or 2C4 polypepcide ■ 

20 

17. The kinase polypepcide of claim. 10, wherein said 
polypeptide is a KH52 polypeptide. 

IS. The kinase polypepcide of claim. 10, wherein said 
2^ polypepcide is a SULUl or SULU3 polypeptide. 

19. The kinase polypeptide of claim 10, v/herein said 
pol\^eptide is a GEK2 polypepcide. 

20. The kinase polypepcide of claim 10, v;herein said 
polypeptide is a ?AK4 or PAK5 pol voect ide . 



5.05 
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21. An antibody or antibody fragment having specific 
binding affinity co = kinase polypeptide selected frorr: the 
group consiscmq of STLK2 , STLK3 , STLK4 , STLK5 , 3TLK6 , 
STLK7, ZCl, ZC2, ZC2 , 2C4 , KKS2 , SULUl , SULU3 , GEK2 , ?AK4 , 
5 and PA.K5 or a kinase domain polypeptide selected from the 

group consisting of STLK2 , STLK3 , STLK4 , STLK5 , STLK6 , 
STLK7, ZCl, ZC2 , ZC3 , ZC4 , IvHS2 , SULUl , SULU3 , GEK2 , PAK4 , 
and PAK5 . 

10 22. A hybridoma v/hich produces an ancibody having 

specific binding affinity to a kinase polypeptide selected 
from the group consisting of STLK2 , STLK3 , STLK4 , STLK5 , 
STLK6, STLK7, ZCl, ZC2 , ZC3 , ZC4 , KKS2 , SULUl , SULU3 , GEK2 , 
PAK4 , and PAK5 . 

15 

23. A method for identifying a substance that 
modulates kinase activity comprising the steps of : 

(a; contacting a kinase polypeptide selected from, the 
group consisting of STLF:2 , STLK3 , STLK4 , STLK5 , STLK6 , 
20 STLK7, ZCl, ZC2 , ZC3 , ZC4 , KHS2 , SUI.U1 , 3ULU3 , GEK2 , PAK4 , 

and PAKE with a test substance; 

v'b) m.easuring the activity of said polypeptide; and 
(c) determiining v/hether said substance miodulates the 
activity of said polypeptide. 

25 . 

24 . A method for identifying a substance that 
m.cdulates kinase activity in a ceil comprising the steps of: 

(a) expressing a kinase polypeptide in a cell, v/herein 
said polypeptide is selected from, the group consisting of 
STLK2 , STLK3 , STLK4 , STL?:5 , STLK6 , STLK7 , ZCl , ZC2 , ZC3 , 
ZC4 , ¥1~IS2 , SULUl, SULu3 , GE?:2 , PAK4 , and PAKE; 

(b) adding a test substance to said cell; and 
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(c) monitoring a change in cell phenocvoe cr the 
interaction between said polypeptide and a natural binding 
partner . 



:> 25. A method for treating a disease or disorder bv 

adminiscering . to a patient in need of such treatment a 
substance that modulates the activity of a kinase selected 
from the group consisting of STLI-:2 , STLK3 , 3TLK4 , STLK5, 
STLK5, STLK7, 2C1 , ZC2 , ZC3 , 2C4 , KHS2 , SULUl , SULu3 , GEK2 , 
10 ?AK4 , and PAK5 . 

26. The miethod of claims 25, wherein said disease or 
disorder is selected from, the group consisting of immune- 
related diseases and disorders, organ transplantation, 
15 ■ myocardial infarction, cardiovascular disease, stroke, renal 
failure, oxidative s tress - related neurodegenerative 
disorders, and cancer. 



27. The method of claim 25, wherein said substance 
20 modulates kinase activity in vitjrc. 



ma: 



The method of claim 27, v/herein said substance is 
inhibitor 



29. A m^ethcd for detection of a kinase polypeptide in 
a sample as a diagnostic tool for a disease or disorder, 
wherein said m.ethod com.Drises: 

(a) contacting Sciia sample v;ith a nucleic acid orobe 
which hybridizes under hybridization assay conditions to a 
nucleic acid target region of a kinase polypeptide selected 
from the group consisting of STLK2 , 3TLK3 , STLK4 , STLK5 , 
STLK6, STLK7,. ZCl, ZC2 , 2C3 , ZC4 , KKS2 , SULUl , SULU3 , GSK2 , 
?AK4 , and PAK5 , said probe comiprising the nucleic acid 
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sequence encoding said polypeptide, fragmencs -herecj:, or 
-he complements of said sequences and fragrriencs; and 

(b) detecting the presence or amount of the 
probe: target region hybrid as an indication of said disease. 

30. The m.echod of claim 29, wherein said disease or 
disorder is selected from the group consiscing of imm.une- 
related diseases and disorders, organ transplantation, 
myocardial infarction, cardiovascular disease, stroke, renal 
rai^ure, oxidative scress - related neurodegenerative 
disorders, and cancer. 



20 



31. A method for dececnion of a kinase pclypeotide in 
a sam.ple as a diagnostic tool for a disease or disorder, 
1^ wherein said m:echod comiprises: 

;a) com.paring a nucleic acid target region encoding 
said kinase polypeptide in a sam^ple, wherein said kinase 
polypeptide is selected from che group consis-ing of STLK2 , 
STLK3, STLK4, STLK5 , 3TLK6 , STLK7 , ZCl, ZC2 , ZC3 , ZC4 , ?:HS2 , 
SULUl, SULU3, GEK2, PAK4 , and FAK5 , or one or more fragments 
-hereof, with a control nucleic acid cargec region encoding 
said kinase polypeptide, or one or more fragm.encs thereof ; 
and 

(b.) detecting differences in sequence or amount between 
said target region and said control target region, as an 
indicarion of said disease or disorder. 

32. The merhcd of claim 31, wherein said disease or 
disorder is selected from the group consisting of immune- 
related diseases and disorders, organ ' ransplanrat ion, 
myocardial infarction, cardiovascular disease, stroke, renal 
r allure, oxida-ive s tress - related neurodegenerative 
disorders, and cancer. 



30 
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SEQ ID NO: 5 STLK2 human Nterm=l-21 kln=22-274 
Cterni=275-416 

MAHSPVAVQVPGMQNNIADPEELFTKLERIGKGSFGEVFKGIDNRTQQVVAIKIIDLEEA 
EDEIEDIQQEITVLSQCDSSYVTKYYGSYLKGSKLWIIMEYLGGGSALDLLRAGPFDEFQ 
lATMLKEILKGLDYLHSEKKIHRDIKAANVLLSEQGDVKLADFGVAGQLTDTQIKRNTFV 
GTPFWMAPEVIQQSAYDSKADIWSLGITAIELAKGEPPNSDMHPMRVLFLIPKNNPPTLV 
GDFTKSFKEFIDACLNKDPSFRPTAKELLKHKFIVKNSKKTSYLTELIDRFKRWKAEGHS 
DDESDSEGSDSE5TSRENNTHPEWSFTTVRKKPDPKKVQNGAEQDLVQTLSCLSMIITPA 
FAELKQQDENNASRNQAIEELEKSIAVAEAACPGITDKMVKKLIEKFQKCSADESP 

SEQ ID NO: 6 STLK3 human Nterm=l-31 kin=32-308 Cterm=309-489 
(insert=327-352) tail=490-516 

TAAPAPAAPAAPAPAPAPAPAAQAVGWPICRDAYELQEVIGSGATAVVQAALCKPRQERV 
AIKRINLEKCQTSMDELLKEIQAMSQCSHPNVVTYYTSFVVKDELWLVMKLLSGGSMLDI 
IKYIVNRGEHKNGVLEEAIIATILKEVLEGLDYLHRNGQIHRDLKAGNILLGEDGSVQIA 
DFGVSAFLATGGDVTRNKVRKTFVGTPCWMAPEVMEQVRGYDFKADMWSFGITAIELATG 
AAPYHKYPPMKVLMLTLQNDPPTLETGVEDKEMMKKYGKSFRKLLSLCLQKDPSKRPTAA 
ELLKCKFFQKAKNREYLIEKLLTRTPDIAQRAKKVRRVPGSSGHLHKTEDGDWEWSDDEM 
DEKSEEGKAAFSQEKSRRVKEENPEIAVSASTIPEQIQSLSVHDSQGPPNANEDYREASS 
CAVNLVLRLRNSRKELNDIRFEFTPGRDTADGVSQELFSAGLVDGHDVVIVAANLQKIVD 
PPKALKTLTFKLASGCDGSEIPDEVKLIGFAQLSVS 

SEQ ID NO: 7 STLK4 human Nterm=absent , kin=l-178, Ctail=179-414, 
insertl=198-222, insert2=253-293 

KSGVLDXSTIATILREVLEGLEYLHKXGQIHRDVKAGNILXGEDGSVQIADFGVSAFLAT 
GGDITRNKVRKTFVGTPCWMAPEVMEQVRGYDFKADIWSFGITAIELATGAAPYHKYPPM 
KVLMLTLQNDPPSLETGVQDKEMLKKYGKSFRKMISLCLQKDPEKRPTAAELLRHKFFQK 
AKNKEFLQEKTLQRAPTISERAKKVRRVPGSSGRLHKTEDGGWEWSDDEFDEESEEGKAA 
ISQLRSPRVKESISNSELFPTTDPVGTLLQVPEQISAHLPQPAGQIATQPTQVSLPPTAE 
PAKTAQALSSGSGSQETKIPISLVLRLRNSKKELNDIRFEFTPGRDTAEGVSQELISAGL 
VDGRDLVIVAANLQKIVEEPQSNRSVTFKLASGVEGSDIPDDGKLIGFAQLSIS 

SEQ ID NO: 8 STLK5 human Nterm=absent , kin=l-222CLacks N-term), 
Ctail=224-274 

LICTHFMDGMNELAIAYILQGVLKALDYIHHMGYVHRSVKASHILISVDGKVYLSGLRSN 
LSMISHGQRQRVVHDFPKYSVKVLPWLSPEVLQQNLQGYDAKSDIYSVGITACELANGHV 
PFKDMPATQMLLEKLNGTVPCLLDTSTIPAEELTMSPSRSVANSGLSDSLTTSTPRPSNG 
DSPSHPYHRTFSPHFHHFVEQCLQRNPDARPSASTLLNHSFFKQIKRRASEALPELLRPV 

Fig- 8A 
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TPITNFEGSQSQDHSGIFGLVTNLEELEVDOWEF 

SEQ ID NO: 13 ZCl human 1/5/98 Nterm=l-22 kin=23-289 

coiled-coil=290-526 pro=527-640 8=641-896 Rab/Rac-BD=897-1239 

MANDSPAKSLVDIDLSSLRDPAGIFELVEVVGNGTYGQVYKGRHVKTGQLAAIKVMDVTE 

DEEEEIKLEINMLKKYSHHRNIATYYGAFIKKSPPGHDDQLWLVMEFCGAGSITDLVKNT 

KGNTLKEDWIAYISREILRGLAHLHIHHVIHRDIKGQNVLLTENAEVKLVDFGVSAQLDR 

TVGRRNTFIGTPYWMAPEVIACDENPDATYDYRSDLWSCGITAIEMAEGAPPLCDMHPMR 

ALFLIPRNPPPRLKSKKWSKKFFSFIEGCLVKNYMQRPSTEQLLKHPFIRDQPNERQVRI 

QLKDHIDRTRKKRGEKDETEYEYSGSEEEEEEVPEQEGEPSSIVNVPGESTLRRDFLRLQ 

QENKERSEALRRQQLLQEQQLREQEEYKRQLLAERQKRIEQQKEQRRRLEEQQRREREAR 

RQQEREQRRREQEEKRRLEELERRRKEEEERRRAEEEKRRVEREQEYIRRQLEEEQRHLE 

VLQQQLLQEQAMLLECRWREMEEHRQAERLQRQLQQEQAYLLSLQHDHRRPHPQHSQQPP 

PPQQERSKPSFHAPEPKAHYEPADRAREVEDRFRKTNHSSPEAQSKQTGRVLEPPVPSRS 

ESFSNGNSESVHPALQRPAEPQVPVRTTSRSPVLSRRDSPLQGSGQQNSQAGQRNSTSIE 

PRLLWERVEKLVPRPGSGSSSGSSNSGSQPGSHPGSQSGSGERFRVRSSSKSEGSPSQRL 

ENAVKKPEDKKEVFRPLKPADLTALAKELRAVEDVRPPHKVTDYSSSSEESGTTDEEDDD 

VEQEGADESTSGPEDTRAASSLNLSNGETESVKTMIVHDDVESEPAMTPSKEGTLIVRRT 

QSASSTLQKHKSSSSFTPFIDPRLLQISPSSGTTVTSVVGFSCDGMRPEAIRQDPTRKGS 

VVNVNPTNTRPQSOTPEIRKYKKRFNSEILCAALWGVNLLVGTESGLMLLDRSGQGKVYP 

LINRRRFQQMDVLEGLNVLVTISGKKDKLRVYYLSWLRNKILHNDPEVEKKQGWTTVGDL 

EGCVHYKVVKYERIKFLVIALKSSVEVYAWAPKPYHKFMAFKSFGELVHKPLLVDLTVEE 

GQRLKVIYGSCAGFHAVDVDSGSVYDIYLPTHIQCSIKPHAIIILPNTDGMELLVCYEDE 

GVYVNTYGRITKDVVLQWGEMPTSVAYIRSNQTMGWGEKAIEIRSVETGHLOGVFMHKRA 

QRLKFLCERNDKVFFASVRSGGSSQVYFMTLGRTSLLSW 

SEQ ID NO: 14 ZC2 human Nterm=m-issing kin=l-255 coiled-coil=256-442 
pro=443-526 8=627-954 Rab/RacBD=955-1297 

AFGEVYEGRHVKTGQLAAIKVMDVTGDEEEEIKQEINMLKKYSHHRNIATYYGAFIKKNP 
PGMDDQLWLVMEFCGAGSVTDLIKNTKGNTLKEEWIAYICREILRGLSHLHQHKVIHRDI 
KGQNVLLTENAEVKLVDFGVSAQLDRTVGRRNTFIGTPYWMAPEVIACDENPDATYDFKS 
DLWSLGITAIEMAEGAPPLCDMHPMRALFLIPRNPAPRLKSKKWSKKFQSFIESCLVKNH 
SQRPATEQLMKHPFIRDQPNERQVRIQLKDHIDRTKKKRGEKDETEYEYSGSEEEEEEND 
SGEPSSILNLPRESTLRRDFLRLQLANKERSEALRRQQLEQQQRENEEHKRQLLAERQKR 
lEEQKEQRRRLEEQQRREKELRKQQEREQRRHYEEQMRREEERRRAEHEQEYKRKQLEEQ 
RQAERLQRQLKQERDYLVSLQHQRQEQRPVEKKPLYHYKEGMSPSEKPAWAKEVEERSRL 
NRQSSPAMPHKVANRISDPNLPPRSESFSISGVQPARTPPMLRPVDPQIPHLVAVKSQGP 
ALTASQSVHEQPTKGLSGFQEALNVTSHRVEMPRQNSDPTSENPPLPTRIEKFDRSSWLR 
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QEEDIPPKVPQRTTSISPALARKNSPGNGSALGPRLGSQPIRASNPDLRRTEPILESPLQ 
RTSSGSSSSSSTPSSQPSSQGGSQPGSQAGSSERTRVRANSKSEGSPVLPHEPAKVKPEE 
SRDITRPSRPASYKKAIDEDLTALAKELRELRIEETNRPMKKVTDYSSSSEESESSEEEE 
EDGES ETHDGTVAVSDIPRLIPTGAPGSNEQYNVGMVGTHGLETSHADSFSGSISREGTL 
MIRETSGEKKRSGHSDSNGFAGHINLPDLVQQSHSPAGTPTEGLGRVSTHSQEMDSGTEY 
GMGSSTKASFTPFVDPRVYQTSPTDEDEEDEESSAAALFTGELLRQEQAKLNEARKISVV 
NVNPTNIRPHSDTPEIRKYKKRFNSEILCAALWGVNLLVGTENGLMLLDRSGQGKVYNLI 
NRRRFQQMDVLEGLNVLVTISGKKNKLRVYYLSWLRNRILHNDPEVEKKQGWITVGDLEG 
CIHYKVVKYERIKFLVIALKNAVEIYAWAPKPYHKFMAFKSFADLQHKPLLVDLTVEEGQ 
RLKVIFGSHTGFHVIDVDSGNSYDIYTPSHIQGNITPHAIVILPKTDGMEMLVCYEDE6V 
YVNTYGRITKDVVLQWGEMPTSVAYIHSNQIMGWGEKAIEIRSVETGHLDGVFMHKRAQR 
LKFLCERNDKVFFASVRSGGSSQVFFMTLNRNSMMNW 

SEQ ID NO: 15 ZC3 human kin=l-255 coiled-coil=256-476 pro=477-680 
8=681-983 Rab/RacBD =984-1326 

AFGEVYEGRHVKTGQLAAIKVMDVTEDEEEEIKQEINMLKKYSHHRNIATYYGAFIKKSP 
PGNDDQLWLVMEFCGAGSVTDLVKNTKGNALKEDCIAYICREILRGLAHLHAHKVIHRDI 
KGQNVLLTENAEVKLVDFGVSAQLDRTVGRRNTFIGTPYWMAPEVIACDENPDATYDYRS 
DIWSLGITAIEMAEGAPPLCDMHPMRALFLIPRNPPPRLKSKKWSKKFIDFIDTCLIKTY 
LSRPPTEQLLKFPFIRDQPTERQVRIQLKDHIDRSRKKRGEKEETEYEYSGSEEEDDSHG 
EEGEPSSIMNVPGESTLRREFLRLQQENKSNSEALKQQQQLQQQQQRDPEAHIKHLLHQR 
QRRIEEQKEERRRVEEQQRREREQRKLQEKEQQRRLEDMQALRREEERRQAEREQEYIRH 
RLEEEQRQLEILQQQLLQEQALLLEYKRKQLEEQRQSERLQRQLQQEHAYLKSLQQQQQQ 
QQLQKQQQQQLLPGDRKPLYHYGRGMNPADKPAWAREVEERTRMNKQQNSPLAKSKPGST 
GPEPPIPQASPGPPGPLSQTPPMQRPVEPQEGPHKSLVAHRVPLKPYAAPVPRSQSLQDQ 
PTRNLAAFPASHDPDPAIPAPTATPSARGAVIRQNSDPTSEGPGPSPNPPAWVRPDNEAP 
PKVPQRTSSIATALNTSGAGGSRPAQAVRARPRSNSAWQIYLQRRAERGTPKPPGPPAQP 
PGPPNASSNPDLRRSDPGWERSDSVLPASHGHLPQAGSLERNRVGVSSKPDSSPVLSPGN 
KAKPDDHRSRPGRPADFVLLKERTLDEAPRPPKKAMDYSSSSEEVESSEDDEEEGEGGPA 
EGSRDTPGGRDGDTDSVSTMVVHDVEEITGTQPPYGGGTMVVQRTPEEERNLLHADSNGY 
TNLPDVVQP5H5PTENSKGQSPPSKDGSGDYQSRGLVKAPGKSSFTMFVDLGIYQPGGSG 
DSIPITALVGGEGTRLDQLQYDVRKGSVVNVNPTNTRAHSETPEIRKYKKRFNSEILCAA 
LWGVNLLVGTENGLMLLDRSGQGKVYGLIGRRRFQQMDVLEGLNLLITISGKRNKLRVYY 
LSWLRNKILHNDPEVEKKQGWTTVGDMEGCGHYRVVKYERIKFLVIALKSSVEVYAWAPK 
PYHKFMAFKSFADLPHRPLLVDLTVEEGQRLKVIYGSSAGFHAVDVDSGNSYDIYIPVHI 
QSQITPHAIIFLPNTDGMEMLLCYEDEGVYVNTYGRIIKDVVLQWGEMPTSVAYICSNQI 
MGWGEKAIEIRSVETGHLDGVFMHKRAQRLKFLCERNDKVFFASVRSGGSSQVYFMTLNR 
NRIMNW 

Fig. 8C 
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SEQ ID NO: 16 ZC4 human Nterm kin coiled-coil pro B=missing 
Rab/RacBD=l-349 

NVNPLYVSPACKKPLIHMYEKEFTSEICCGSLWGVNLLLGTRSNLYLMDRSGKADITKLI 
RRRPFRQIQVLEPLNLLITISGHKNRLRVYHLTWLRNKILNNDPESKRRQEEMLKTEEAC 
KAIDKLTGCEHFSVLQHEETTYIAIALKSSIHLYAWAPKSFDESTAIKVFPTLDHKPVTV 
DLAIGSEKRLKIFFSSADGYHLIDAESEVMSDVTLPKNPLEIIIPQNIIILPDCLGIGMM 
LTFNAEALSVEANEQLFKKILEMWKDIPSSIAFECTQRTTGWGQKAIEVRSLQSRVLESE 
LKRRSIKKLRFLCTRGDKLFFTSTLRNHHSRVYFMTLGKLEELQSNYDV 

SEQ ID NO: 18 KHS2 human Nterm=l-13 kin=14-273 A=274-346 
Pro=347-534 RabBD =535-894 

MNPGFDLSRRNPQEDFELIQRIGSGTYGDVYKARNVNTGELAAIKVIKLEPGEDFAVVQQ 

EIIMMKDCKHPNIVAYFGSYLRRDKLWICMEFCGGGSLQDIYHVTGPLSELQIAYVSRET 

LQGLYYLHSKGKMHRDIKGANILLTDNGHVKLADFGVSAQITATIAKRKSFIGTPYWMAP 

EVAAVERKGGYNQICDLWAVGITAIELAELQPPMFDLHPMRALFLMTKSNFQPPKLKDKM 

KWSNSFHHFVKMALTKNPKKRPTAEKLLQHPFVTQHLTRSLAIELLDKVNNPDHSTYHDF 

DDDDPEPLVAVPHRIHSTSRNVREEKTRSEITFGQVKFDPPLRKETEPHHELPDSDGFLD 

SSEEIYYTARSNLDLQLEYGQGHQGGYFLGANKSLLKSVEEELHQRGHVAHLEDDEGODD 

ESKHSTLKAKIPPPLPPKPKSIFIPQEMHSTEDENQGTIKRCPMSGSPAKPSQVPPRPPP 

PRLPPHKPVALGNGMSSFQLNGERDGSLCQQQNEHRGTNLSRKEKKDVPKPISNGLPPTP 

KVHMGACFSKVFNGCPLKIHCASSWINPDTRDQYLIFGAEEGIYTLNLNELHETSMEQLF 

PRRCTWLYVMNNCLLSISGKASQLYSHNLPGLFDYARQMQKLPVAIPAHKLPDRILPRKF 

SVSAKIPETKWCQKCCVVRNPYTGHKYLCGALQTSIVLLEWVEPMQKFMLIKHIDFPIPC 

PLRMFEMLVVPEQEYPLVCVGVSRGRDFNQVVRFETVNPNSTSSWFTESDTPQTNVTHVT 

QLERDTILVCLDCCIKIVNLQGRLKSSRKLSSELTFDFQIESIVCLQDSVLAFWKHGMQG 

RSFRSNEVTQEISDSTRIFRLLGSDRVVVLESRPTDNPTANSNLYI.LAGHENSY 

SEQ ID NO: 22 SULUl human N=l-21 kin=22-277 A=278-427 
coiled-coill=428-637 8=638-751 coiled-coU2=752-898 

MRKGVLKDPEIDDLFYKDDPEELFIGLHEIGHGSFGAVYFATNAHTNEVVAIKKMSYSGK 
QTHEKWQDILKEVKFLRQLKHPNTIEYKGCYLKEHTAWLVMEYCLGSASDLLEVHKKPLQ 
EVEIAAITHGALHGLAYLHSHALIHRDIKAGNILLTEPGQVKLADFGSASMASPANSFVG 
TPYWMAPEVILAMDEGQYDGKVDIWSLGITCIELAERKPPLFNMNAMSALYHIAQNDSPT 
LQSNEWTDSFRRFVDYCLQKIPQERPTSAELLRHDFVRRDRPLRVLIDLIQRTKDAVREL 
DNLQYRKMKKILFQETRNGPLNESQEDEEDSEHGTSLNREMDSLGSNHSIPSMSVSTGSQ 
SSSVNSMQEVMDESSSELVMMHDDESTINSSSSVVHKKDHVFTRDEAGHGDPRPEPRPTQ 
SVQSQALHYRNRERFATIKSASLVTRQIHEHEQENELREQMSGYKRMRRQHQKQLIALEN 
KLKAEMDEHRLKLQKEVETHANNSSIELEKLAKKQVAIIEKEAKVAAADEKKFQQQILAQ 
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QKKDLTTFLESOKKQYKICKEKIKEEMNEDHSTPKKEKQERISKHKENLQHTQAEEEAHL 
LTQQRLYYDKNCRFFKRKIMIKRHEVEQQNIREELNKKRTQKEMEHAMLIRHDESTRELE 
YRQLHTLQKLRMDLIRLQHQTELENQLEYNKRRERELHRKHVMGLRQQPKNLKAMEMQIK 
KQFQDTCKVQTKQYKALKNHQLEVTPKNEHKTILKTLKDEQTRKLAILAEQYEQSINEMM 
ASQALRLDEAQEAECQALRLQLQQEMELLNAYQSKIKMQTEAQHERELQKLEQRVSLRRA 
HLEQKIEEELAALQKERSERIKNLLERQEREIETFDMESLRMGFGNLVTLDFPKEDYR 

SEQ ID NO: 23 SULU3 human Nterm=fiiisstng kin partial=l-66 A=67-215 

coiled-coill=216-425 B=426-539 coiled-coil2=540-786 CtaiU687-786 

lELAERKPPLFNMNAMSALYHIAQNESPTLQSNEWSDYFRNFVDSCLQKIPQDRPTSEEL 

LKHIFVLRERPETVLIDLIQRTKDAVRELDNLQYRKMKKLLFQEAHNGPAVEAQEEEEEQ 

DHGVGRTGTVNSVGSNQSIPSMSISASSQSSSVNSLPDVSDDKSELDMMEGDHTVMSNSS 

VIHLKPEEENYREEGDPRTRASDPQSPPQVSRHKSHYRNREHFATIRTASLVTRQMQEHE 

QDSELREQMSGYKRMRRQHQKQLMTLENKLKAEMDEHRLRLDKDLETQRNNFAAEMEKLI 

KKHQAAMEKEAKVMSNEEKKFQQHIQAQQKKELNSFLESQKREYKLRKEQLKEELNENQS 

TPKKEKQEWLSKQKENIQHFQAEEEANLLRRQRQYLELECRRFKRRMLLGRHNLEQDLVR 

EELNKRQTQKDLEHAMLLRQHESMQELEFRHLNTIQKMRCELIRLQHQTELTNQLEYNKR 

RERELRRKHVMEVRQQPKSLKSKELQIKKQFQDTCKIQTRQYKALRNHLLETTPKSEHKA 

VLKRLKEEQTRKLAILAEQYDHSINEMLSTQALRLDEAQEAECQVLKMQLQQELELLNAY 

QSKIKMQAEAQHDRELRELEQRVSLRRALLEQKIEEEMLALQNERTERIRSLLERQAREI 

EAFDSESMRLGFSNMVLSNLSPEAFSHSYPGASGWSHNPTGGPGPHWGHPMGGPPQAWGH 

PMQGGPQPWGHPSGPMQGVPRGSSMGVRNSPQALRRTASGGRTEQGMSRSTSVTSQISNG 
SHMSYT 

SEQ ID NO: 24 SULU3 murine Nterm=l-25 kin=26-273 A=274-422 
ccl=423-632 6=633-748 cc2=missing 

MPSTNRAGSLKDPEIAELFFKEDPEKLFTDLREIGHGSFGAVYFARDVRTNEVVAIKKMS 
YS6KQSTEKWQDIIKEVKFLQRIKHPNSIEYKGCYLREHTAWLVMEYCLGSASDLLEVHK 
KPLQEVEIAAITHGALQGLAYLHSHTMIHRDIKAGNILLTEPGQVKLADFGSASMASPAN 
SFVGTPYWMAPEVILAMDEGQYD6KVDVWSLGITCIELAERKPPLFNMNAMSALYHIAQN 
ESPTLQSNMNDSCLQKIPQDRPTSEELLKHMFVLRERPETVLIDLIQRTKDAVRELDNLQ 
YRKMKKLLFQEAHNGPAVEAQEEEEEQDHGVGRTGTVNSVGSNQSIPSMSISASSQSSSV 
NSLPDASDDKSELDMMEGDHTVMSN5SVIHLKPEEENYQEEGDPRTRASDPQSPPQVSRH 
KSHYRNREHFATIRTASLVTRQMQEHEQDSELREQMSGYKRMRRQHQKQLMTLENKLKAE 
MDEHRLRLDKDLETQRNNFAAEMEKLIKKHQAAMEKEAKVMANEEKKFQQHIQAQQKKEL 
NSFLESQKREYKLRKEQLKEELNENQSTPKKEKQEWLSKQKENIQHFQAEEEANLLRRQR 
QYLELECRRFKRRMLLGRHNLEQDLVREELNKRQTQKDLEHAMLLRQHESMQELEFRHLN 
TIQKMRCELIRLQHQTELTNQLEYNKRRERELRRKHVMEVRQQPKSLKSKELQIKKQFQD 
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SEQ ID NO: 26 GEK2 human N=l-33 kin=34-294 A=295-337 6=338-472 215 
coiled-coill=473-724 215 coiled-coil2=725-912 

MAFANFRRILRLSTFEKRKSREYEHVRRDLDPNEVWEIVGELGDGAFGKVYKAKNKETGA 
LAAAKVIETKSEEELEDYIVEIEILATCDHPYIVKLLGAYYHDGKLWIMIEFCPGGAVDA 
IMLELDRGLTEPQIQVVCRQMLEALNFLHSKRIIHRDLKAGNVLMTLEGDIRLADFGVSA 
KNLKTLQKRDSFIGTPYWMAPEVVMCETMKDTPYDYKADIWSLGITLIEMAQIEPPHHEL 
NPMRVLLKIAKSDPPTLLTPSKWSVEFRDFLKIALDKNPETRPSAAQLLEHPFVSSITSN 
KALRELVAEAKAEVMEEIEDGRDEGEEEDAVDAASTLENHTQNSSEVSPPSLNADKPLEE 
SPSTPLAPSQSQDSVNEPCSQPSGDRSLQTTSPPVVAPGNENGLAVPVPLRKSRPVSMDA 
RIQVAQEKQVAEQGGDLSPAANRSQKASQSRPNSSALETLGGEKLANGSLEPPAQAAPGP 
SKRDSDCSSLCTSESMDYGTNLSTDLSLNKEMGSLSIKDPKLYKKTLKRTRKFVVDGVEV 
SITTSKIISEDEKKDEEMRFLRRQELRELRLLQKEEHRNQTQLSNKHELQLEQMHKRFEQ 
EINAKKKFFDTELENLERQQKQQVEKMEQDHAVRRREEARRIRLEQDRDYTRFQEQLKLM 
KKEVKNEVEKLPRQQRKESMKQKMEEHTQKKQLLDRDFVAKQKEDLELAMKRLTTDNRRE 
ICDKERECLMKKQELLRDREAALWEMEEHQLQERHQLVKQQLKDQYFLQRHELLRKHEKE 
REQMQRYNQRMIEQLKVRQQQEKARLPKIQRSEGKTRMAMYKKSLHINGGGSAAEQREKI 

KQFSQQEEKRQKSERLQQQQKHENQMRDMLAQCESNMSELQQLQNEKCHLLVEHETQKLK 
ALDESHNQNLKE 

SEQ ID NO: 29 PAK4 human Rac=l-51 A=52-224 Nterm=225-393 
kin=394-658 Ctail=659-681 residues 13-23 
(SAPQNFQHRVH)= Cdc42 /Rac-binding motif 

MFRKKKKKRPEISAPQNFQHRVHTSFDPKEGKFVGLPPQWQNILDTLRRPKPVVDPSRIT 
RVQLQPMKTVVRGSAMPVDGYISGLLNDIQKLSVISSNTLRGRSPTSRRRAQSLGLLGDE 
HWATDPDMYLQSPQSERTDPHGLYLSCNGGTPAGHKQMPWPEPQSPRVLPNGLAAKAQSL 
GPAEFQGASQRCLQLGACLQSSPPGASPPTGTNRHGMKAAKHGSEEARPQSCLVGSATGR 
PGGEGSPSPKTRES5LKRRLFRSMFLSTAATAPPSSSKPGPPPQSKPNSSFRPPQKDNPP 
SLVAKAQSLPSDQPVGTFSPLTTSDTSSPQKSLRTAPATGQLPGRSSPAGSPRTWHAQIS 
TSNLYLPQDPTVAKGALAGEDTGVVTHEQFKAALRMVVDQGDPRLLLDSYVKIGEGSTGI 
VCLAREKHSGRQVAVKMMDLRKQQRRELLFNEVVIMRDYQHFNVVEMYKSYLVGEELWVL 
MEFLQGGALTDIVSQVRLNEEQIATVCEAVLQALAYLHAQGVIHRDIKSDSILLTLDGRV 
KLSDFGFCAQISKDVPKRKSLVGTPYWMAPEVISRSLYATEVDIWSLGIMVIEMVDGEPP 
YFSDSPVQAMKRLRDSPPPKLKNSHKVSPVLRDf LERMLVRDPQERATAQELLDHPFLLQ 
TGLPECLVPLIQLYRKQTSTC 
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SEQ ID NO: 30 PAK5 human Rac A=niissing Nterm partial=l-114 
kin=115-379 Ctail=380-398 

ASGAKLAAGRPFNTYPRADTDHPSRGAQGEPHDVAPNGPSAGGLAIPQSSSSSSRPPTRA 
RGAPSPGVLGPHASEPQLAPPACTPAAPAVPGPPGPRSPQREPQRVSHEQFRAALQLVVD 
PGDPRSYLDNFIKIGEGSTGIVCIATVRSSGKLVAVKKMDLRKQQRRELLFNEVVIMRDY 
QHENVVEMYNSYLVGDELWVVMEFLEGGALTDIVTHTRMNEEQIAAVCLAVLQALSVLHA 
QGVIHRDIKSDSILLTHDGRVKLSDFGFCAQVSKEVPRRKSLVGTPYWMAPELISRLPYG 
PEVDIWSLGIMVIEMVDGEPPYFNEPPLKAMKMIRDNLPPRLKNLHKVSPSLKGFLDRLL 
VRDPAQRATAAELLKHPFLAKAGPPASIVPLMRQNRTR 
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SEQ ID NO: 1 STLK2 HUMAN 

TAACAGCCCACCTCCTAGCCCCGGGCTACGCGCCGCCAGCCCAGTAACCCCACTTTTGTG 
TGTCCTCCCAGGCCCCGATCGAAAAGCCTGGGAGGGCCGCCGAACTACCCCCGGAGGGAG 
GAGCCAGTCCGAACCCAAGGCGCCACCGCCGCAGAAGCGGAGCGAGGCAGCATTCGCCTC 
CATGGCCCACTCGCCGGTGGCTGTCCAAGTGCCTGGGATGCAGAATAACATAGCTGATCC 
AGAAGAACTGTTCACAAAATTAGAGCGCATTGGGAAAGGCTCATTTGGGGAA6TTTTCAA 
AGGAATTGATAACCGTACCCAGCAAGTCGTTGCTATTAAAATCATAGACCTTGAGGAAGC 
CGAAGATGAAATAGAAGACATTCAGCAAGAAATAACTGTCTTGAGTCAATGTGACAGCTC 
ATATGTAACAAAATACTATGGGTCATATTTAAAGGGGTCTAAATTATGGATAATAATGGA 
ATACCTGGGCGGTGGTTCAGCACTGGATCTTCTTCGAGCTGGTCCATTTGATGAGTTCCA 
GATTGCTACCATGCTAAAGGAAATTTTAAAAGGTCTGGACTATCTGCATTCAGAAAAGAA 
AATTCACCGAGACATAAAAGCTGCCAATGTCTTGCTCTCAGAACAAGGAGATGTTAAACT 
TGCTGATTTTGGAGTTGCTGGTCAGCTGACAGATACACAGATTAAAAGAAATACCTTTGT 
GGGAACTCCATTTTGGATGGCTCCTGAAGTTATTCAACAGTCAGCTTATGACTCAAAAGC 
TGACATTTGGTCATTGGGAATTACTGCTATTGAACTAGCCAAGGGAGAGCCACCTAACTC 
CGATATGCATCCAATGAGAGTTCTGTTTCTTATTCCCAAAAACAATCCTCCAACTCTTGT 
TGGAGACTTTACTAAGTCTTTTAAGGAGTTTATTGATGCTTGCCTGAACAAAGATCCATC 
ATTTCGTCCTACAGCAAAAGAACTTCTGAAACACAAATTCATTGTAAAAAATTCAAAGAA 
GACTTCTTATCTGACTGAACTGATAGATCGTTTTAAGAGATGGAAGGCAGAAGGACACAG 
TGATGATGAATCTGATTCCGAGGGCTCTGATTCGGAATCTACCAGCAGGGAAAACAATAC 
TCATCCTGAATGGAGCTTTACCACCGTACGAAAGAAGCCTGATCCAAAGAAAGTACAGAA 
TGGGGCAGAGCAAGATCTTGTGCAAACCCTGAGTTGTTTGTCTATGATAATCACACCTGC 
ATTTGCTGAACTTAAACAGCAGGACGAGAATAACGCTAGCAGGAATCAGGCGATTGAAGA 
ACTCGAGAAAAGTATTGCTGTGGCTGAAGCCGCCTGTCCCGGCATCACAGATAAAATGGT 
GAAGAAACTAATTGAAAAATTTCAAAAGTGTTCAGCAGACGAATCCCCCTAAGAAACTTA 
TTATTGGCTTCTGTTTCATATGGACCCAGAGAGCCCCACCAAACCTACGTCAAGATTAAC 
AATGCTTAACCCATGAGCTCCATGTGCCTTTTGGATCTTTGCAACACTGAAGATTTGGAA 
GAAGCTATTAAACTATTTTGTGATGGCGTTTATCATTTTATATTTTGAAAGGATTATTTT 
GTAAGGAATAACTTTTAATACTATAGTTTCACCTGTATTCTAGTAAATGTTGAGACACCG 
TTTTGCTTTTAAGTATCCCTATTTCTTAAGTTACGAGGATGAATACCTTTCACATTTTGA 
TCTTTAGTTGACTCTACAGTCATGAAACATACAGGTCTTTCAAAGTCATTCTCAATATTC 
AGCTTTTGTAAATTATCAAGCTTCAAAAAGCTTTTTTTTAAAAAAAAAAACATGCATATT 
CTAAAAATGACTATTGGTGGGGAGGTGTAAATAAGTCATACCTTCTTAAAACAGAAAATT 
TAAGTAAAGTCTTTTAAATGAAACCTGTAAAAGTATTGACTCTTCTACCAAGTTGGTATG 
ATATTCCAGGCAGCTCAATGATTATCACATTTGAGACCCTGTGTTTGAAGCATTTACAGG 
CAATGTACAGCAACAGAGGTACCTCTTGGTGTATAGTATTTACATTCTCTTTTAGGTAGA 
AGAGGCAATTTTACCCTTATTTCACATGGTTAGAAATTTAAAGCAAGATCATTTACCCAA 
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GGATAGGTGTTTGGTAATGTTGAAGGAGTTAGTCTGGCTTCATGTTTTACATCTTCAACT 

AAAATCCCATACTATCTGCTTGGATTTGGAGAGCCAAAAAATAAAGCTGATTGTCATGTG 

ATTAAATATCTGATCAACAGGTATGAATATAACTTAAATCAGCATATTTTTGCCATGGTA 

ATAAATTGTCCTATAAACTATTTATATATTTTTGTTCTTCATAATTATCACTAATAAGCA 

TCAGTTTGTTGTTTTTAAAAGGATATTTAAGTGAGCATTTTCTAGTTCATATGAAAATAA 

CCATAGTACAGGATGATTTCTGTCCACACAAAGGTTAAATTA6ATTGCACAGTTAATTTT 

CACTTATATTTATGGTACTATTATGTGGGTGATGCCTTTTTCTTTTAAGCCCAGTACATA 

TATTATGCCTGCCTAAGTTCTGAACTGGGGCTGTATTTCAGTAGTTGTAGAATTATTGAT 

ATTTAGTTTTGATAGCTAATGTTTAATTGTTTGGATCTGCACAGTTTGGTTTTTGCACAA 

AAGTCATTTAAAAAAATCTGAGTAATTGTCAAATATTAAAAGAAAGATATTCTTCCTGTA 

AGGAATACAGTTTTTAGTCAAAGTGGCCATTACATCCTCTTTTTAATTTACATAATACAG 

ATACTTGAGAAAGTTGTTGTGGTGTTGTATGCCAAGAAAATTCTTTTTATTGGTGCCTAT 

ATTGTAACAATTATTTTTAATGCATTGTATTTTGAAGTAACGGTTCAGTTAAATTTTTCA 

CCTGCTGTGTAACTGAAACACAATTACAGTTTATAATCATCTGTAGAAGTCTGGAGATAA 

TTTTGCAACTCATGTTATGGGTTAAATGAATATTTTTGTAAAAGTAAAAGCAACAAATTT 

ATAAATTGATTATTTGAAACTTTACAACACAATTGCATCCCAAATACAAATTGTATTGCT 

TATTCATTATAGCTATTCGTCCTGTAATCTGTTTCTAGGTGAAGCATACTCCAGTGTTTT 

AGGGGTTTTGAAAATAAATATTTAAATTTCACAGTCAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAA 

SEQ ID NO: 2 STLK3 HUMAN 

GACAGCAGCGCCGGCCCCGGCAGCTCCCGCGGCCCCGGCCCCGGCCCCGGCCCCGGCCCC 
GGCGGCACAGGCTGTCGGCTGGCCCATCTGCAGGGACGCGTACGAGCTGCAGGAGGTTAT 
CGGCAGTGGAGCTACTGCTGTGGTTCAGGCAGCCCTATGCAAACCCAGGCAAGAACGTGT 
AGCAATAAAACGGATCAACTTGGAAAAATGCCAGACCAGTATGGATGAACTATTAAAAGA 
AATTCAAGCCATGAGTCAGTGCAGCCATCCCAACGTAGTGACCTATTACACCTCTTTTGT 
GGTCAAAGATGAACTTTGGCTGGTCATGAAATTACTAAGTGGAGGTTCAATGTTGGATAT 
CATAAAATACATTGTCAACCGAGGAGAACACAAGAATGGAGTTCTGGAAGAGGCAATAAT 
AGCAACAATTCTTAAAGAGGTTTTGGAAGGCTTAGACTATCTACACAGAAACGGTCAGAT 
TCACAGGGATTTGAAAGCTGGTAATATTCTTCTGGGTGAGGATGGTTCAGTACAAATAGC 
AGATTTTGGGGTAAGTGCGTTCCTAGCAACAGGGGGTGATGTTACCCGAAATAAAGTAAG 
AAAAACATTCGTTGGCACCCCATGTTGGATGGCTCCTGAAGTCATGGAACAGGTGAGA6G 
CTATGACTTCAAGGCTGACATGTGGAGTTTTGGAATAACTGCCATTGAATTAGCAACAGG 
AGCAGCGCCTTATCACAAATATCCTCCCATGAAAGTGTTAATGTTGACTTTGCAAAATGA 
TCCACCCACTTTGGAAACAGGGGTAGAGGATAAAGAAATGATGAAAAAGTACGGCAAGTC 
CTTTAGAAAATTACTTTCACTGTGTCTTCAGAAAGATCCTTCCAAAAGGCCCACAGCAGC 
AGAACTTTTAAAATGCAAATTCTTCCAGAAAGCCAAGAACAGAGAGTACCTGATTGAGAA 
GCTGCTTACAAGAACACCAGACATAGCCCAAAGAGCCAAAAAGGTAAGAAGAGTTCCTGG 
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GTCAAGTGGTCACCTTCATAAAACCGAAGACGGGGACTGGGAGTGGAGTGACGACGAGAT 

GGATGAGAAGAGCGAAGAAGGGAAAGCAGCTTTTTCTCAGGAAAAGTCACGAAGAGTAAA 

AGAAGAAAATCCAGAGATTGCAGTGAGTGCCAGCACCATCCCCGAACAAATACAGTCCCT 

CTCTGTGCACGACTCTCAGGGCCCACCCAATGCTAATGAAGACTACAGAGAAGCTTCTTC 

TTGTGCCGTGAACCTCGTTTTGAGATTAAGAAACTCCAGAAAGGAACTTAATGACATACG 

ATTTGAGTTTACTCCAGGAAGAGATACAGCAGATGGTGTATCTCAGGAGCTCTTCTCTGC 

TGGCTTGGTGGATGGTCACGATGTAGTTATAGTGGCTGCTAATTTACAGAAGATTGTAGA 

TGATCCCAAAGCTTTAAAAACATTGACATTTAAGTTGGCTTCTGGCTGTGATGGGTCGGA 

GATTCCTGATGAAGTGAAGCTGATTGGGTTTGCTCAGTTGAGTGTCAGCTGATGTATGTC 

CCTTGATGTCACCCTGATCTGTCATGCCCCACCGCCACCCCTACTCCCTTCAACCCTCCC 

TCTTTCTGCCCATTTCCTCCCACCCCCTCACTCCCATTTCCTAGCAAAATCAGAAGATTG 

TGAAGAGGCCGGCTTCAACAAAATGGGATAAAAAAATAATTTTTTAAAACTTACAACACT 

CCGAGTTCTGCTTTATTCTCTAGCAATCCACAGTACAAGAACAAGCAAATGCCACAGCTG 

CACGACTGTTGCTCATTTTTCCAAAAGCTATTTAATATTCTTAGCAATCAATTTGGATAT 

CCCTTAAGTGAAAAGAATCTGAAATACACTCAGGTGGTCTTATTTATTGGCAACAAAAGG 

AATTTTCTATCCAGAAGCCTATTTCTCCTTTCATTGTTGTTATTTCTGTTATAATACTTT 

AATTGTACATCTGACAATACTGCCTCTTTTATGTTGTATTTAGAAATTAATATACTTATA 

AAATTAAGATTTATTAGCCAAACTTGAATTCTAGTTTTAAAACTGACTGTGAATTTTATT 

TTTCATATATTTATGCATTACACACCTTAGCTATAAGAAAAAAAGG6TTTTGATTATATG 

CTTCTTGCAGTTAATCTCGTTATTTAAACAAAAAGTTTTGGGTCTATCTTTGGAGTATTT 

GTAACTTCTAAATTTTGAAATGACTGAATTAGGAATTTGGATGCTTATTCTTTTAGTCTG 

TTTGCCTAAAAACCAATTTACAATCTGACTGTCTCTTGGGAGAGGGAGGTGCCTTGCAAA 

CTTTCACATTAAGAATGTGCCTGAGGCTGCTTTACTCTGGAATAGTCTCAGATCTAAAAT 

TTCCTCTATATAAGGTGGCATATGTTAAGTTTTGCTTCATTGGACCGTTTAGAATGCTAT 

GTAAAATGTTGCCATTCTGTTAGATTGCTAACTATATACCCATCTCTGATTTGGCTCTCC 

TTAAGTGATAGGATTTGTTATTCTAAAGGTGATAAACTTGAAAATATCAGAATCTGAGTT 

TTACTTGAAATTTTGCAGAATACCCAGGTGGAGTGAAAATTGGAAGGGTTTTGTGCAATG 

ACTAAAAGGTAAAACGCTGTTAAGGTTCAAGAATCAATACTTTCAACCCAAGTAGCCCTC 

TGCTTGACTGTATATTATGGAACTAGTAAACCTTAGGATTTTGAAAATTGGAGTCTAATC 

TTTCAAGGAGGTGGGCTCCCAGGATGGTACCATTGCTCTTTCCTAGCTAACCCTAGATAT 

GGCAGCTCTTTAATGTACTTCAAAAAGCAAATATATATTACTAAGGAAAAAAAGTTATTT 

ATAATTGCCTTGTCATAATTGTTAAGGTGTTCTAGAGCCATTTGCATACAATTTAATGTA 

ATTTCATTCCATTCTATTGTTTACACAACGATTACTCGAAGATGACTGCAAAGGTAAAAG 
GAAAATAAAAGTGTATTGCACAATGAAAAA 

SEQ ID NO: 3 STLK4 HUMAN 

CAAAAGTGGAGTCCTAGATGANTCTACCATTGCTACGATACTCCGAGAAGTACTGGAAGG 
GCTGGAATATCTGCATAAAANTGGACAGATCCACAGAGATGTGAAAGCTGGAAACATTCT 
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TNTTGGAGAAGATGGCTCAGTACAGATTTCAGACTTTGGGGTTAGTGCTTTTTTAGCAAC 
TGGTGGTGATATTACCCGAAATAAAGTGAGAAAGACCTTTGTTGGCACCCCTTGTTGGAT 
GGCACCTGAAGTTATGGAACAGGTCCGTGGTTATGATTTCAAAGCTGATATTTGGAGTTT 
TGGAATTACAGCAATTGAATTGGCTACAGGGGCGGCTCCTTATCATAAATATCCACCAAT 
GAAGGTTTTAATGCTGACACTGCAGAACGATCCTCCTTCTTTGGAAACTGGTGTTCAAGA 
TAAAGAAATGCTGAAAAAATATGGAAAATCATTTAGAAAAATGATTTCATTGTGCCTTCA 
AAAAGATCCAGAAAAAAGACCAACAGCAGCAGAACTATTAAGGCACAAATTTTTCCAGAA 
AGCAAAGAATAAAGAATTTCTTCAAGAAAAAACATTGCAGAGAGCACCAACCATTTCTGA 
AAGAGCAAAAAAGGTTCGGAGAGTACCAGGTTCCAGTGGGCGTCTTCATAAGACAGAGGA 
TGGAGGCTGGGAGTGGAGTGATGATGAATTTGATGAAGAAAGTGAGGAAGGGAAAGCAGC 
AATTTCACAACTCAGGTCTCCCCGAGTGAAAGAATCAATATCAAATTCTGAGCTCTTTCC 
AACAACTGATCCTGTGGGTACTTTGCTCCAAGTTCCAGAACAGATCTCTGCTCATCTACC 
TCAGCCAGCTGGGCAGATTGCTACACAGCCAACTCAAGTCTCTCTCCCACCCACCGCAGA 
GCCAGCAAAAACAGCTCAGGCTTTGTCTTCAGGATCAGGTTCACAAGAAACCAAGATCCC 
AATCAGTCTAGTACTAAGATTAAGGAATTCCAAAAAAGAACTAAATGATATTCGATTTGA 
ATTTACTCCTGGGAGAGATACAGCAGAGGGTGTCTCTCAGGAACTCATTTCTGCTGGCCT 
GGTCGACGGAAGGGATTTAGTAATAGTGGCAGCTAATTTGCAGAAAATTGTGGAAGAACC 
TCAGTCAAATCGATCTGTCACTTTCAAACTGGCATCTGGTGTCGAAGGCTCAGATATTCC 
TGATGATGGTAAACTGATAGGATTTGCCCAGCTCAGCATCAGCTAAACCACAACCCTGGA 
AGAGGCGGCCTAAGGAGATTCCACACATGCGTATCTCTGTTGCTTCTATTGGCCTAAACC 
CACTACTGCCAAAGAACCCAGCAACAAACCTCCCGGCTAGGAGCTTTAGAAGTCTTTATG 
TTCTTCCTGCCATCATTCCTCCTTTTCCCACAGGGAAAGAAAAGTTGGATCACTAGTGGC 
CAGCATCCCCAGAGTTCCGTTAGTAAACTTACTTCATATGTCCCCTGTCTTCCTCCATCT 
GAGAAGTGGCCCATGTGCTTCAAGGCCCAGGAGGGAGATCTGTCAGCTCATTCTTGCCTT 
ACTCCAATGATGGCCCAGGTGGAAAAGTAGCAGCTGTATCGGGCTTCCTCATCCTGCCTG 
TTCCCCCACACCTGCCAGGATATGGACATCTTGGGATATCTCTTTACCACTGAAGTAGAA 
TTGATTGTTCAGCTGGAGCCCAGAGAATTTAATTTAATGTTTTTTCTTTGTACCTGATGT 
GAATTCTAGCAACCTTTGTTAGGAAAAAGCACAGCCTCAGATGGAGGCAGCCTAAACTGT 
GTTCTTGTTTTGTTCATGGTGTTTCTAAGCGTTTTGCTGAAGCTGCTCTCAGGCACCCCC 
TTCTTCATTGCTCTCTCCAGAAAGGGTTGCTAGCCTTAACTTCAGCTGGTGCAAAACATC 
TGACTGTAGCCGAACTTCAGCCATCAGATCCTTCAAAGTGGAACTTTGGATTGTTTTTAC 
AGACAACATCGAGTAATGGCTTGTAAATGTGAATTTTGCCAGAGGTGGTTTTTGAACAGG 
AAAATCATAATTCATATCATTGGAGAAGTATTTATTTTCAAATATCAAATTGAAGAAAAA 
CTCAATCCTCCCATGAAAATCAGTTCGCCTGGCCTCCAAGTCGTGAGGAAATGGGTATGC 
AAGGCTGAGATTTCTACAGCAATAAAGGAGACACACACTGGGCCAGAGAGGCCTGCCTTC 
TGCCTGCTCTCCTGCACTGACCCTTTGGAGGGGGTCTCTGTGTGCTGAAGCTAACTCAAG 
ATGGAAAGTGAAACCACATGTGCCGTGACCTTTAGGTTTTATGAGTAGACAGTGTTCATT 
TGATTTTCTACAGAAATAATATAAATTATTCTTTAGGTTTAAAAAAGAGCACTCATAATG 
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CAATATGTGAATAATCAGTGAGGTTGATTTTTCTTTTTTCCTACCGTTTCATAGTCTTTG 
TCTAACTGCTAGTAACCCTACCGAGTTTTATATATGAGTGGGATACTCAATCTGGCCTTA 
AAAAGATACACAAAGATGGGCTGTGGGTCCCTGGAAAGGGGGAGAGTTGCCCTTTACAGA 
ATCACTCGAGCCCTTTCCAGCACTGTTGGTCTGATGAACAAGGTTGTTTTACCTTATTTT 
CTCTTGGAACATATCTGAAAACCTTCCCCACAAATAACTTGTCACACCTTTTGTTTCATT 
CTGAGTCTTTAGTTTTAGTCATGGGCTTTCTTCACCTGCTCTAGGTGCAAAGGCATGTTG 
GGAAAGAGATGGATGTTGGGGAGGAAGAGAGGAGATGGATTTCAGTTGGGAGTTAGGAGG 
AGAGTAGGTGAGATGATCAGACACCGGAGTTCAACGTCCCAGCAGTCTTGGTAAAAGGAG 
GGAGCCTGCTGAGCCAGGAGGGAGAAAAGAAGATTGACCAGCTTGCTAGAAAAATACTTA 
GCTTTTCTTTTTCTTTTTTTGTGGAGGG6GGACGGAGAGGAACAAGGATGGGGAGGTAGG 
AATGAGGTATAGAAAAGAGATAGCATCTTCTTTGGCACAAGACTAGTGGCTTACCGCTTA 
CCTTAGAGTTTTGTTTTTTTTTTTTCAAACCCATCAAAATCTACTTATTTAT6AATCCAA 
GGGGTGGCAGCATCACTCTGTTCTAGCATTCTTTGTGGAGATGGTCTGGTGCCTAGCTGG 
GAGTGAGCAGCAGCCCATCCCCTGTTCACTTTCTCTAGCCCATCATTACCTGTGAACTGC 
AGTGGGGCAGTCATGGCAAATAGAATTGGGCTGGGGTTTCTCCTTCTTTTCAGTTCATTG 
TTTGCCCTGCTAGGAATTAGAAGACAGACACCATGTCCCAGGACAGTGTTACTTCTTCTG 
CATGATGTGTGGTAGACTCCCTTTGCTGGCTTGTGCAGTGATACTGAGAAAATACATGAA 
CAGAAACTGCCCAGGTGGAACAGCACGTAACCTAGTGAGTGACTGTACTCCTTTCTAGGA 
ATGCTGATTCAGAGTGCACCTCTTTGACTAGGTCCCAGGATCCCCTTGTCCCTGGAGTAG 
GGACTAACTATAGCACAAAGTAATATGTGCCAATGCTATTTGTGAAATGTTTGGTCTTTC 
TAAACGACTAAAGGATTTGTTGGGTTTTTGCTTAAGTTTTGAACCAAATCCTAGAGCCAG 
CTGATAATATTTAATAATCTGGAGGAGA6AATAATGATGTACCAATAAGTGGAGATTCCT 
CCTTATGATGTATGCTAGGTTATGGAAGATGTAAAATATTCAACTTTTTCCTCCTTTTTT 
TGGACTTTGTATTTTACTGCATGTTTTCTTCATTTTTAATCAATAAAGAGTAAATTGTCA 
AAAAAAAAAAAAAAAAA 

SEQ ID NO: 4 STLK5 HUMAN 

CTCATCTGTACACACTTCATGGATGGCATGAAT6AGCTGGCGATTGCTTACATCCTGCAG 
GGGGTGCTGAAGGCCCTCGACTACATCCACCACATGGGATATGTACACAGGAGTGTCAAA 
GCCAGCCACATCCTGATCTCTGTGGATGGGAAGGTCTACCTGTCTGGTTTGCGCAGCAAC 
CTCAGCATGATAAGCCATGGGCAGCGGCAGCGAGTGGTCCACGATTTTCCCAAGTACAGT 
GTCAAGGTTCTGCCGTGGCTCAGCCCCGAGGTCCTCCAGCAGAATCTCCAGGGTTATGAT 
GCCAAGTCTGACATCTACAGTGTGGGAATCACAGCCTGTGAACTGGCCAACGGCCATGTC 
CCCTTTAAGGATATGCCTGCCACCCAGATGCTGCTAGAGAAACTGAACGGCACAGTGCCC 
TGCCTGTTGGATACCAGCACCATCCCCGCTGAGGAGCTGACCATGAGCCCTTCGCGCTCA 
GTGGCCAACTCTGGCCTGAGTGACAGCCTGACCACCAGCACCCCCCGGCCCTCCAACGGT 
GACTCGCCCTCCCACCCCTACCACCGAACCTTCTCCCCCCACTTCCACCACTTTGTGGAG 
CAGTGCCTTCAGCGCAACCCGGATGCCAGGCCCAGTGCCAGCACCCTCCTGAACCACTCT 
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TTCTTCAAGCAGATCAAGCGACGTGCCTCAGAGGCTTTGCCCGAATTGCTTCGTCCTGTC 

ACCCCCATCACCAATTTTGAGGGCAGCCAGTCTCAGGACCACAGTGGAATCTTTGGCCTG 

GTAACAAACCTGGAAGAGCTGGAGGTGGACGATTGGGAGTTCTGAGCCTCTGCAAACTGT 

GCGCATTCTCCAGCCAGGGATGCAGAGGCCACCCAGAGGCCCTTCCTGAGGGCCGGCCAC 

ATTCCCGCCCTCCTGGGCAGATTGGGTAGAAAGGACATTCTTCCAGGAAAGTTGACTGCT 

GACTGATTGGGAAAGAAAATCCTGGAGAGATACTTCACTGCTCCAAGGCTTTTGAGACAC 

AAGGGAATCTCAACAACCAGGGATCAGGAGGGTCCAAAGCCGACATTCCCAGTCCTGTGA 

GCTCAGGTGACCTCCTCCGCAGAAGAGAGATGCTGCTCTGGCCCTGGGAGCTGAATTCCA 

AGCCCAGGGTTTGGCTCCTTAAACCCGAGGACCGCCACCTCTTCCCAGTGCTTGCGACCA 

GCCTCATTCTATTTAACTTTGCTCTCAGATGCCTCAGATGCTATAGGTCAGTGAAAGGGC 

AAGTAGTAAGCTGCCTGCCTCCCTTCCCTCAGACCTCTCCCTCATAATTCCAGAGAAG6G 

CATTTCTGTCTTTTTAAGCACAGACTAAGGCTGGAACAGTCCATCCTTATCCCTCTTCTG 

GCTTGGGCCCTGACACCTAAGTCTTTCCCACGGTTTATGTGTGTGCCTCATTCCTTTCCC 

ACCAAGAATCCATCTTAGCGCCTCCTGCCAGCTGCCCTGGTGCTTTCTCCAAGGGCCATC 

AGTGTCTTGCCTAGCTTGAGGGCTTAAGTCCTTATGCTGTGTTAGTTTCGTTGTCAGAAC 
AAATTAAAATTTTCAGAGACGCTG 

SEQ ID NO: 9 ZCl HUMAN 

GAGACCATGGCGAACGACTCTCCCGCGAAAAGTCTGGTGGACATCGACCTCTCCTCCCTG 
CGGGATCCTGCTGGGATTTTTGAGCTGGTGGAAGTGGTTGGAAATGGCACCTATGGACAA 
GTCTATAAGGGTCGACATGTTAAAACGGGTCAGTTGGCAGCCATCAAAGTTATGGATGTC 
ACTGAGGATGAAGAGGAAGAAATCAAACTGGAGATAAATATGCTAAAGAAATACTCTCAT 
CACAGAAACATTGCAACATATTATGGTGCTTTCATCAAAAAGAGCCCTCCAGGACATGAT 
GACCAACTCTGGCTTGTTATGGAGTTCTGTGGGGCTGGGTCCATTACAGACCTTGTGAAG 
AACACCAAAGGGAACACACTCAAAGAAGACTGGATCGCTTACATCTCCAGAGAAATCCTG 
AGGGGACTGGCACATCTTCACATTCATCATGTGATTCACCGGGATATCAAGGGCCAGAAT 
GTGTTGCTGACTGAGAATGCAGAGGTGAAACTTGTTGACTTTGGTGTGAGTGCTCAGCTG 
GACAGGACTGTGGGGCGGAGAAATACGTTCATAGGCACTCCCTACTGGATGGCTCCTGAG 
GTCATCGCCTGTGATGAGAACCCAGATGCCACCTATGATTACAGAAGTGATCTTTGGTCT 
TGTGGCATTACAGCCATTGAGATGGCAGAAGGTGCTCCCCCTCTCTGTGACATGCATCCA 
ATGAGAGCACTGTTTCTCATTCCCAGAAACCCTCCTCCCCGGCTGAAGTCAAAAAAATGG 
TCGAAGAAGTTTTTTAGTTTTATAGAAGGGTGCCTGGTGAAGAATTACATGCAGCGGCCC 
TCTACAGAGCAGCTTTTGAAACATCCTTTTATAAGGGATCAGCCAAATGAAAGGCAAGTT 
AGAATCCAGCTTAAGGATCATATAGATCGTACCAGGAAGAAGAGAGGCGAGAAAGATGAA 
ACTGAGTATGAGTACAGTGGGAGTGAGGAAGAAGAGGAGGAAGTGCCTGAACAGGAAGGA 
GAGCCAAGTTCCATTGTGAACGTGCCTGGTGAGTCTACTCTTCGCCGAGATTTCCTGAGA 
CTGCAGCAGGAGAACAAGGAACGTTCCGAGGCTCTTCGGAGACAACAGTTACTACAGGAG 
CAACAGCTCCGGGAGCAGGAAGAATATAAAAGGCAACTGCTGGCAGAGAGACAGAAGCGG 
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ATTGAGCAGCAGAAAGAACAGAGGCGACGGCTAGAAGAGCAACAAAGGAGAGAGCGGGAA 
GCTAGAAGGCAGCAGGAACGTGAACAGCGAAGGAGAGAACAAGAAGAAAAGAGGCGTCTA 
GAGGAGTTGGAGAGAAGGCGCAAAGAAGAAGAGGAGAGGAGACGGGCAGAAGAAGAAAAG 
AGGAGAGTTGAAAGAGAACAGGAGTATATCAGGCGACAGCTAGAAGAGGAGCAGCGGCAC 
TTGGAAGTCCTTCAGCAGCAGCTGCTCCAGGAGCAGGCCATGTTACTGGAGTGCCGATGG 
CGGGAGATGGAGGAGCACCGGCAGGCAGAGAGGCTCCAGAGGCAGTTGCAACAAGAACAA 
GCATATCTCCTGTCTCTACAGCATGACCATAGGAGGCCGCACCCGCAGCACTCGCAGCAG 
CCGCCACCACCGCAGCAGGAAAGGAGCAAGCCAAGCTTCCATGCTCCCGAGCCCAAAGCC 
CACTACGAGCCTGCTGACCGAGCGCGAGAGGTGGAAGATA6ATTTAGGAAAACTAACCAC 
AGCTCCCCTGAAGCCCAGTCTAAGCAGACAGGCAGAGTATTGGAGCCACCAGTGCCTTCC 
CGATCAGAGTCTTTTTCCAATGGCAACTCCGAGTCTGTGCATCCCGCCCTGCAGAGACCA 
GCGGAGCCACAGGTTCCTGTGAGAACAACATCTCGCTCCCCTGTTCTGTCCCGTCGAGAT 
TCCCCACTGCAGGGCAGTGGGCAGCAGAATAGCCAGGCAGGACAGAGAAACTCCACCAGT 
ATTGAGCCCAGGCTTCTGTGGGAGAGAGTGGAGAAGCTGGTGCCCAGACCTGGCAGTGGC 
AGCTCCTCAGGGTCCAGCAACTCAG6ATCCCAGCCCGGGTCTCACCCTGGGTCTCAGAGT 
GGCTCCGGGGAACGCTTCAGAGTGAGATCATCATCCAAGTCTGAAGGCTCTCCATCTCAG 
CGCCTGGAAAATGCAGTGAAAAAACCTGAAGATAAAAAGGAAGTTTTCAGACCCCTCAAG 
CCTGCTGATCTGACCGCACTGGCCAAAGAGCTTCGAGCAGTGGAAGATGTACGGCCACCT 
CACAAAGTAACGGACTACTCCTCATCCAGTGAGGAGTCGGGGACGACGGATGAGGAGGAC 
GACGATGTGGAGCAGGAAGGGGCTGACGAGTCCACCTCAGGACCAGAGGACACCAGAGCA 
GCGTCATCTCTGAATTTGAGCAATGGTGAAACGGAATCTGTGAAAACCATGATTGTCCAT 
GATGATGTAGAAAGTGAGCCGGCCATGACCCCATCCAAGGAGGGCACTCTAATCGTCCGC 
CGGACTCAGTCCGCTAGTAGCACACTCCAGAAACACAAATCTTCCTCCTCCTTTACAGCT 
TTTATAGACCCCAGATTACTACAGATTTCTCCATCTAGCGGAACAACAGTGACATCTGTG 
GTGGGATTTTCCTGTGATGGGATGAGACCAGAAGCCATAAGGCAAGATCCTACCCGGAAA 
GGCTCAGTGGTCAATGTGAATCCTACCAACACTAGGCCACAGAGTGACACCCCGGAGATT 
CGTAAATACAAGAAGAGGTTTAACTCTGAGATTCTGTGTGCTGCCTTATGGG6AGTGAAT 
TTGCTAGTGGGTACAGAGAGTGGCCTGATGCTGCTGGACAGAAGTGGCCAAGGGAAGGTC 
TATCCTCTTATCAACCGAAGACGATTTCAACAAATGGACGTACTTGAGGGCTTGAATGTC 
TTGGTGACAATATCTGGCAAAAAGGATAAGTTACGTGTCTACTATTTGTCCTGGTTAAGA 
AATAAA ATACTIC ACAATGATCCAGAAGTTGAGAAGAAGCAGGGATGGACAACCGTAGGG 
GATTTGGAAGGATGTGTACATTATAAAGTTGTAAAATATGAAAGAATCAAATTTCTGGTG 
ATTGCTTTGAAGAGTTCTGTGGAAGTCTATGCGTGGGCACCAAAGCCATATCACAAATTT 
ATGGCCTTTAAGTCATTTGGAGAATTGGTACATAAGCCATTACTGGTGGATCTCACTGTT 
GAGGAAGGCCAGAGGTTGAAAGTGATCTATGGATCCTGTGCTGGATTCCATGCTGTTGAT 
GTGGATTCAGGATCAGTCTATGACATTTATCTACCAACACATATCCAGTGTAGCATCAAA 
CCCCATGCAATCATCATCCTCCCCAATACAGATGGAATGGAGCTTCTGGTGTGCTATGAA 
GATGAGGGGGTTTATGTAAACACATATGGAAGGATCACCAAGGATGTAGTTCTACAGTGG 
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GGAGAGATGCCTACATCAGTAGCATATATTCGATCCAATCAGACAATGGGCTGGGGAGAG 

AAGGCCATAGAGATCCGATCTGTGGAAACTGGTCACTTGGATGGTGTGTTCATGCACAAA 

AGGGCTCAAAGACTAAAATTCTTGTGTGAACGCAATGACAAGGTGTTCTTTGCCTCTGTT 

CGGTCTGGTGGCAGCAGTCAGGTTTATTTCATGACCTTAGGCAGGACTTCTCTTCTGAGC 

TGGTAGAAGCAGTGTGATCCAGGGATTACTGGCCTCCAGAGTCTTCAAGATCCTGAGAAC 
TTGGAATTCCTTGTAACT 

SEQ ID NO: 10 ZC2 HUMAN 

GCTTTCGGGGAGGTCTATGAGGGTCGTCATGTCAAAACGGGCCAGCTTGCAGCCATCAAG 

GTTATGGATGTCACAGGGGATGAAGAGGAAGAAATCAAACAAGAAATTAACATGTTGAAG 

AAATATTCTCATCACCGGAATATTGCTACATACTATGGTGCTTTTATCAAAAAGAACCCA 

CCAGGCATGGATGACCAACTTTGGTTGGTGATGGAGTTTTGT6GTGCTGGCTCTGTCACC 

GACCTGATCAAGAACACAAAAGGTAACACGTTGAAAGAGGAGTGGATTGCATACATCTGC 

AGGGAAATCTTACGGGGGCTGAGTCACCTGCACCAGCATAAAGTGATTCATCGAGATATT 

AAAGGGCAAAATGTCTTGCTGACTGAAAATGCAGAAGTTAAACTAGTGGACTTTGGAGTC 

AGTGCTCAGCTTGATCGAACAGTGGGCAGGAGGAATACTTTCATTGGAACTCCCTACTGG 

ATGGCACCAGAAGTTATTGCCTGTGATGAAAACCCAGATGCCACATATGATTTCAAGAGT 

GACTTGTGGTCTTTGGGTATCACCGCCATTGAAATGGCAGAAGGTGCTCCCCCTCTCTGT 

GACATGCACCCCATGAGAGCTCTCTTCCTCATCCCCCGGAATCCAGCGCCTCGGCTGAAG 

TCTAAGAAGTGGTCAAAAAAATTCCAGTCATTTATTGAGAGCTGCTTGGTAAAGAATCAC 

AGCCAGCGACCAGCAACAGAACAATTGATGAAGCATCCATTTATACGAGACCAACCTAAT 

GAGCGACAGGTCCGCATTCAACTCAAGGACCATATTGATAGAACAAAGAAGAAGCGAGGA 

GAAAAAGATGAGACAGAGTATGAGTACAGTGGAAGTGAGGAAGAAGAGGAGGAGAATGAC 

TCAGGAGAGCCCAGCTCCATCCTGAATCTGCCAAGGGAGTCGACGCTGCGGAGGGACTTT 

CTGAGGCTGCAGCTGGCCAACAAGGAGCGTTCTGAGGCCCTACGGAGGCAGCAGCTGGAG 

CAGCAGCAGCGGGAGAATGAGGAGCACAAGCGGCAGCTGCTGGCCGAGCGTCAGAA6CGC 

ATCGAGGAGCAGAAAGAGCAGAGGCGGCGGCTGGAGGAGCAACAAAGGCGAGAGAAGGAG 

CTGCGGAAGCAGCAGGAGAGGGAGCAGCGCCGGCACTATGAGGAGCAGATGCGCCGG6AG 

GAGGAGAGGAGGCGTGCGGAGCATGAACAGGAATATAAGCGCAAACAATTGGAAGAACAG 

AGACAAGCAGAAAGACTGCAGAGGCAGCTAAAGCAAGAAAGAGACTACTTAGTTTCCCTT 

CAGCATCAGCGGCAGGAGCAGAGGCCTGTGGAGAAGAAGCCACTGTACCATTACAAAGAA 

GGAATGAGTCCTAGTGAGAAGCCAGCATGGGCCAAGGAGGTAGAAGAACGGTCAAGGCTC 

AACCGGCAAAGTTCCCCTGCCATGCCTCACAAGGTTGCCAACAGGATATCTGACCCCAAC 

CTGCCCCCAAGGTCGGAGTCCTTCAGCATTAGTGGAGTTCAGCCTGCTCGAACACCCCCC 

ATGCTCAGACCAGTCGATCCCCAGATCCCACATCTGGTAGCTGTAAAATCCCAGGGACCT 

GCCTTGACCGCCTCCCAGTCAGTGCACGAGCAGCCCACAAAGGGCCTCTCTGGGTTTCAG 

GAGGCTCTGAACGTGACCTCCCACCGCGTGGAGATGCCACGCCAGAACTCAGATCCCACC 

TCGGAAAATCCTCCTCTCCCCACTCGCATTGAAAAGTTTGACCGAAGCTCTTGGTTACGA 
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CAGGAAGAAGACATTCCACCAAAGGTGCCTCAAAGAACAACTTCTATATCCCCAGCATTA 
GCCAGAAAGAATTCTCCTGGGAATGGTAGTGCTCTGGGACCCAGACTAGGATCTCAACCC 
ATCAGAGCAAGCAACCCTGATCTCCGGAGAACTGAGCCCATCTTGGAGAGCCCCTTGCAG 
A6GACCAGCAGTGGCAGTTCCTCCAGCTCCAGCACCCCTAGCTCCCAGCCCAGCTCCCAA 
GGAGGCTCCCAGCCTGGATCACAAGCAGGATCCAGTGAACGCACCAGAGTTCGAGCCAAC 
AGTAAGTCAGAAGGATCACCTGTGCTCCCCCATGAGCCTGCCAAGGTGAAACCAGAAGAA 
TCCAGGGACATTACCCGGCCCAGTCGACCAGCTAGCTACAAAAAAGCTATAGATGAGGAT 
CTGACGGCATTAGCCAAAGAACTAAGAGAACTCCGGATTGAAGAAACAAACCGCCCAAT6 
AAGAAGGTGACTGATTACTCCTCCTCCAGTGAG6AGTCAGAAAGTAGCGAGGAAGAGGAG 
GAAGATGGAGAGAGCGAGACCCATGATGGGACAGTGGCTGTCAGCGACATACCCAGACTG 
ATACCAACAGGAGCTCCAGGCAGCAACGAGCAGTACAATGTGGGAATGGTGGGGACGCAT 
GGGCTGGAGACCTCTCATGCGGACAGTTTCAGCGGCAGTATTTCAAGAGAAGGAACCTTG 
ATGATTAGAGAGACGTCTGGAGAGAAGAAGCGATCTGGCCACAGTGACAGCAATGGCTTT 
GCTGGCCACATCAACCTCCCTGACCTGGTGCAGCAGAGCCATTCTCCAGCTGGAACCCCG 
ACTGAGGGACTGGGGCGCGTCTCAACCCATTCCCAGGAGATGGACTCTGGGACTGAATAT 
GGCATGGGGAGCAGCACCAAAGCCTCCTTCACCCCCTTTGTGGACCCCAGAGTATACCAG 
ACGTCTCCCACTGATGAAGATGAAGAGGATGAGGAATCATCAGCCGCAGCTCTGTTTACT 
GGCGAACTTCTTAGGCAAGAACAGGCCAAACTCAATGAAGCAAGAAAGATTTCGGTGGTA 
AATGTAAACCCAACCAACATTCGGCCTCATAGCGACACACCAGAAATCAGAAAATACAAG 
AAACGATTCAACTCAGAAATACTTTGTGCAGCTCTGTGGGGTGTAAACCTTCTGGTGGGG 
ACTGAAAATGGCCTGATGCTTTTGGACCGAAGTGGGCAAGGCAAAGTCTATAATCTGATC 
AACCGGAGGCGATTTCAGCAGATGGATGTGCTAGAGGGACTGAATGTCCTTGTGACAATT 
TCAGGAAAGAAGAATAAGCTACGAGTTTACTATCTTTCATGGTTAAGAAACAGAATACTA 
CATAATGACCCAGAAGTAGAAAAGAAACAAGGCTGGATCACTGTTGGGGACTTGGAAGGC 
TGTATACATTATAAAGTTGTTAAATATGAAAGGATCAAATTTTTGGTGATTGCCTTAAAG 
AATGCTGTGGAAATATATGCTTGGGCTCCTAAACCGTATCATAAATTCATGGCATTTAAG 
TCTTTTGCAGATCTCCAGCACAAGCCTCTGCTAGTTGATCTCACGGTAGAAGAAGGTCAA 
AGATTAAAGGTTATTTTTGGTTCACACACTGGTTTCCATGTAATTGATGTTGATTCAGGA 
AACTCTTATGATATCTACACACCATCTCATATTCAGGGCAATATCACTCCTCATGCTATT 
GTCATCTTGCCTAAAACAGATGGAATGGAAATGCTTGTTTGCTATGAGGATGAGGGGGTG 
TATGTAAACACCTATGGCCGGATAACTAAGGATGTGGTGCTCCAATGGGGAGAAATGCCC 
ACGTCTGTGGCCTACATTCATTCCAATCAGATAATGGGCTGGGGCGAGAAAGCTATTGAG 
ATCCGGTCAGTGGAAACAGGACATTTGGATGGAGTATTTATGCATAAGCGAGCTCAAAGG 
TTAAAGTTTCTATGTGAAAGAAATGATAAGGTATTTTTTGCATCCGTGCGATCTGGAGGA 
AGTAGCCAAGTGTTTTTCATGACCCTCAACAGAAATTCCATGATGAACTGGTAACAGAAG 
AGCACTTGGCACTTATCTTCATGGCGTTATTTCTAATTTAAAAGAACATAACTCATGTGG 
ACTTATGCCAGTCTAGAGGCAGAATCAGAAGGCTTGGTTGAACATATCGCTTTCCCTTTT 
TCCTCTCCCTCCGCCCCTCCCAGTACAGTCCATCT 

Fig. 9 I 

41/76 

SUBSTITUTE SHEET (RULE 26) 

BNSDC'CID: -^WO 9953036A2JA> 



wo 99/53036 PCT/L'S99/08150 

SEQ ID NO: 11 ZC3 HUMAN 

GCATTTGGGGAGGTGTATGAGGGTCGGCATGTCAAGACGGGGCAGCTGGCTGCCATCAAG 

6TCATGGATGTCACGGAGGACGAGGAGGAAGAGATCAAACAGGAGATCAACATGCTGAAA 

AAGTACTCTCACCACCGCAACATCGCCACCTACTACGGAGCCTTCATCAAGAAGAGCCCC 

CCGGGAAACGATGACCAGCTCTGGCTGGTGATGGAGTTCTGTGGTGCTGGTTCAGTGACT 

GACCTGGTAAAGAACACAAAAGGCAACGCCCTGAAGGAGGACTGTATCGCCTATATCTGC 

AGGGAGATCCTCAGGGGTCTGGCCCATCTCCATGCCCACAAGGTGATCCATCGAGACATC 

AAGGGGCAGAATGTGCTGCTGACAGAGAATGCTGAGGTCAAGCTAGTGGATTTTGGGGTG 

AGTGCTCAGCTGGACCGCACCGTGGGCAGACGGAACACTTTCATTGGGACTCCCTACTGG 

ATGGCTCCAGAGGTCATCGCCTGTGATGAGAACCCTGATGCCACCTATGATTACAGGAGT 

6ATATTTGGTCTCTAGGAATCACAGCCATCGAGATGGCAGAGGGAGCCCCCCCTCTGTGT 

GACATGCACCCCATGCGAGCCCTCTTCCTCATTCCTCGGAACCCTCCGCCCAGGCTCAAG 

TCCAAGAAGTGGTCTAAGAAGTTCATTGACTTCATTGACACATGTCTCATCAAGACTTAC 

CTGAGCCGCCCACCCACGGAGCAGCTACTGAAGTTTCCCTTCATCCGGGACCAGCCCACG 

GAGCGGCAGGTCCGCATCCAGCTTAAGGACCACATTGACCGATCCCGGAAGAAGCGGGGT 

GAGAAAGAGGAGACAGAATATGAGTACAGCGGCAGCGAGGAGGAAGATGACAGCCATGGA 

GAGGAAGGAGAGCCAAGCTCCATCATGAACGTGCCTGGAGAGTCGACTCTACGCCGGGAG 

TTTCTCCGGCTCCAGCAGGAAAATAAGAGCAACTCAGAGGCTTTAAAACAGCAGCAGCAG 

CTGCAGCAGCAGCAGCAGCGAGACCCCGAGGCACACATCAAACACCTGCTGCACCAGCGG 

CAGCGGCGCATAGAGGAGCAGAAGGAGGAGCGGCGCCGCGTGGAGGAGCAACAGCGGCGG 

GAGCGGGAGCAGCGGAAGCTGCAGGAGAAGGAGCAGCAGCGGCGGCTGGAGGACATGCAG 

GCTCTGCGGCGGGAGGAGGAGCGGCGGCAGGCGGAGCGCGAGCAGGAATATATTCGTCAC 

AGGCTAGAGGAGGAGCAGCGACAGCTCGAGATCCTTCAGCAACAGCTGCTCCAGGAACAG 

GCCCTGCTGCTGGAATACAAGCGGAAGCAGCTGGAGGAGCAGCGGCAGTCAGAACGTCTC 

CAGAGGCAGCTGCAGCAGGAGCATGCCTACCTCAAGTCCCTGCAGCAGCAGCAACAGCAG 

CAGCAGCTTCAGAAACAACAGCAGCAGCAGCTCCTGCCTGGGGACAGGAAGCCCCTGTAC 

CATTATGGTCGGGGCATGAATCCCGCTGACAAACCAGCCTGGGCCCGAGAGGTAGAAGAG 

AGAACAAGGATGAACAAGCAGCAGAACTCTCCCTTGGCCAAGAGCAAGCCAGGCAGCACG 

GGGCCTGAGCCCCCCATCCCCCAGGCCTCCCCAGGGCCCCCAGGACCCCTTTCCCAGACT 

CCTCCTATGCAGAGGCCGGTGGAGCCCCAGGAGGGACCGCACAAGAGCCTGGTGGCACAC 

CGGGTCCCACTGAAGCCATATGCAGCACCTGTACCCCGATCGCAGTCCCTGCAGGACCAG 

CCCACCCGAAACCTGGCTGCCTTCCCAGCCTCCCATGACCCCGACCCTGCCATCCCCGCA 

CCCACTGCCACGCCCAGTGCCCGAGGAGCTGTCATCCGCCAGAATTCAGACCCCACCTCT 

GAAGGACCTGGCCCCAGCCCGAATCCCCCAGCCTGGGTCCGCCCAGATAACGAGGCCCCA 

CCCAAGGTGCCTCAGAGGACCTCATCTATCGCCACTGCCCTTAACACCAGTGGGGCCGGA 

GGGTCCCGGCCAGCCCAGGCAGTCCGTGCCAGACCTCGCAGCAACTCCGCCTGGCAAATC 

TATCTGCAAAGGCGGGCAGAGCGGGGCACCCCAAAGCCTCCAGGGCCCCCTGCTCAGCCC 

CCTGGCCCGCCCAACGCCTCTAGTAACCCCGACCTCAGGAGGAGCGACCCTGGCTGGGAA 
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CGCTCGGACAGCGTCCTTCCAGCCTCTCACGGGCACCTCCCCCAGGCTGGCTCACTGGAG 
CGGAACCGCGTGGGAGTCTCCTCCAAACCGGACAGCTCCCCTGTGCTCTCCCCTGGGAAT 
AAAGCCAAGCCCGACGACCACCGCTCACGGCCAGGCCGGCCCGCAGACTTTGTGTTGCTG 
AAAGAGCGGACTCTGGACGAGGCCCCTCGGCCTCCCAAGAAGGCCATGGACTACTCGTCG 
TCCAGCGAGGAGGTGGAAAGCAGTGAGGACGACGAGGAGGAAGGCGAAGGCGGGCCAGCA 
GAGGGGAGCAGAGATACCCCTGGGGGCCGCGATGGGGATACAGACAGCGTCAGCACCATG 
GTGGTCCACGACGTCGAGGAGATCACCGGGACCCAGCCCCCATACGGGGGCGGCACCATG 
GTGGTCCAGCGCACCCCTGAAGAGGAGCGGAACCTGCTGCATGCTGACAGCAATGGGTAC 
ACAAACCTGCCTGACGTGGTCCAGCCCAGCCACTCACCCACC6AGAACAGCAAAGGCCAA 
AGCCCACCCTCGAAGGATGGGAGTGGTGACTACCAGTCTCGTGGGCTGGTAAAGGCCCCT 
GGCAAGAGCTCGTTCACGATGTTTGTGGATCTAGGGATCTACCAGCCTGGAGGCAGTGGG 
GACAGCATCCCCATCACAGCCCTAGTGGGTGGAGAGGGCACTCGGCTCGACCAGCTGCA6 
TACGACGTGAGGAAGGGTTCTGTGGTCAACGTGAATCCCACCAACACCCGGGCCCACAGT 
GAGACCCCTGAGATCCGGAAGTACAAGAAGCGATTCAACTCCGAGATCCTCTGTGCAGCC 
CTTTGGGGGGTCAACCTGCTGGTGGGCACGGAGAACGGGCTGATGTTGCTGGACCGAAGT 
GGGCAGGGCAAGGTGTATGGACTCATTGGGCGGCGACGCTTCCAGCAGATGGATGTGCTG 
GAGGGGCTCAACCTGCTCATCACCATCTCAGGGAAAAGGAACAAACTGCGGGTGTATTAC 
TTGTCCTGGCTCCGGAACAAGATTCTGCACAATGACCCAGAAGTGGAGAAGAAGCAGGGC 
TGGACCACCGTGGGGGACATGGAGGGCTGCGGGCACTACCGTGTTGTGAAATACGAGCGG 
ATTAAGTTCCTGGTCATCGCCCTCAAGAGCTCCGTGGAGGTGTATGCCTGGGCCCCCAAA 
CCCTACCACAAATTCATGGCCTTCAAGTCCTTTGCCGACCTCCCCCACCGCCCTCTGCTG 
GTCGACCTGACAGTAGAGGAGGGGCAGCGGCTCAAGGTCATCTATGGCTCCAGTGCTGGC 
TTCCATGCTGTGGATGTCGACTCGGGGAACAGCTATGACATCTACATCCCTGTGCACATC 
CAGAGCCAGATCACGCCCCATGCCATCATCTTCCTCCCCAACACCGACGGCATGGAGATG 
CTGCTGTGCTACGAGGACGAGGGTGTCTACGTCAACACGTACGGGCGCATCATTAAGGAT 
GTGGTGCTGCAGTGGGGGGAGATGCCTACTTCTGTGGCCTACATCTGCTCCAACCAGATA 
ATGGGCTGGGGTGAGAAAGCCATTGAGATCCGCTCTGTGGAGACGGGCCACCTCGACGGG 
GTCTTCATGCACAAACGAGCTCAGAGGCTCAAGTTCCTGTGTGAGCGGAATGACAAGGTG 
TTTTTTGCCTCAGTCCGCTCTGGGGGCAGCAGCCAAGTTTACTTCATGACTCTGAACCGT 
AACCGCATCATGAACTGGTGACGGGGCCCTGGGCTGGGGCTGTCCCACACTGGACCCAGC 
TCTCCCCCTGCAGCCAGGCTTCCCGGGCCGCCCCTCTTTCCCCTCCCTGGGCTTTTGCTT 
TTACTGGTTTGATTTCACTGGAGCCTGCTGGGAACGTGACCTCTGACCCCTGA 

SEQ ID NO: 12 ZC4 HUMAN 

CAATGTTAACCCACTCTATGTCTCTCCTGCATGTAAAAAACCACTAATCCACATGTATGA 
AAAGGAGTTCACTTCTGAGATCTGCTGTGGTTCTTTGTGGGGAGTCAATTTGCTGTTGGG 
AACCCGATCTAATCTATATCTGATGGACAGAAGTGGAAAGGCTGACATTACTAAACTTAT 
AAGGCGAAGACCATTCCGCCAGATTCAAGTCTTAGAGCCACTCAATTTGCTGATTACCAT 
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CTCAGGTCATAAGAACAGACTTCGGGTGTATCATCTGACCTGGTTGAGGAACAAGATTTT 

GAATAATGATCCAGAAAGTAAAAGAAGGCAAGAAGAAATGCTGAAGACAGAGGAAGCCTG 

CAAAGCTATTGATAAGTTAACAGGCTGTGAACACTTCAGTGTCCTCCAACATGAAGAAAC 

AACATATATTGCAATTGCTTTGAAATCATCAATTCACCTTTATGCATGGGCACCAAAGTC 

CTTTGATGAAAGCACTGCTATTAAAGTATTTCCAACACTTGATCATAAGCCAGTGACAGT 

TGACCTGGCTATTGGTTCTGAAAAAAGACTAAAGATTTTCTTCAGCTCAGCAGATGGATA 

TCACCTCATCGATGCAGAATCTGAGGTTATGTCTGATGTGACCCTGCCAAAGAATCCCCT 

GGAAATCATTATACCACAGAATATCATCATTTTACCTGATTGCTTGGGAATTGGCATGAT 

GCTCACCTTCAATGCTGAAGCCCTCTCTGTGGAAGCAAATGAACAACTCTTCAAGAAGAT 

CCTTGAAATGTGGAAAGACATACCATCTTCTATAGCTTTTGAATGTACACAGCGAACCAC 

AGGATGGGGCCAAAAGGCCATTGAAGTGCGCTCTTTGCAATCCAGGGTTCTGGAAAGTGA 

GCTGAAGCGCAGGTCAATTAAGAAGCTGAGATTCCTGTGCACCCGGGGTGACAAGCTGTT 

CTTTACCTCTACCCTGCGCAATCACCACAGCCG6GTTTACTTCATGACACTTGGAAAACT 

TGAAGAGCTCCAAAGCAATTATGATGTCTAAAAGTTTCCAGTGATTTATTACCACATTAT 

AAACATCATGTATAGGCAGTCTGCATCTTCAGATTTCAGAGATTAAATGAGTATTCAGTT 

TTATTTTTAGTAAAGATTAAATCCAAAACTTTACTTTTAATGTAGCACAGAATAGTTTTA 

ATGAGAAATGCAGCTTTATGTATAAAATTAACTATAGCAAGCTCTAGGTACTCCAATGGT 

GTACAATGTCTTTTGCACAAACTTTGTAACTTTTGTTACTGTGAATTCAAACATTACTCT 

TTGGACAGTTTGGACAGTATCTGTATTCAGATTTTACAACATGGAGTAAAGAAACCTGTT 

ATGAATTAGATTACAAGCAGCCTTCAAAAGAATTGGCACT6GGATAAGATTTTTCAGAAA 
AGAAAAACATCGGCAAACT 

SEQ ID NO: 17 KHS2 HUMAN 

CCGCCATGAACCCCGGCTTCGATTTGTCCCGCCGGAACCCGCAGGAGGACTTCGAGCTGA 
TTCAGCGCATCGGCAGCGGCACCTACGGCGACGTCTACAAGGCACGGAATGTTAACACTG 
GTGAATTAGCAGCAATTAAAGTAATAAAATTGGAACCAGGAGAAGACTTTGCAGTTGTGC 
AGCAAGAAATTATTATGATGAAAGACTGTAAACACCCAAATATTGTTGCTTATTTTGGAA 
GCTATCTCAGGCGAGATAAGCTTTGGATTTGCATGGAGTTTTGTGGAGGTGGTTCTTTAC 
AGGATATTTATCACGTAACTGGACCTCTGTCAGAACTGCAAATTGCATATGTTAGCAGAG 
AAACACTGCAGGGATTATATTATCTTCACAGTAAAGGAAAAATGCACAGAGATATAAAGG 
GAGCTAACATTCTATTAACGGATAATGGTCATGTGAAATTGGCTGATTTTGGAGTATCTG 
CACAGATAACAGCTACAATTGCCAAACGGAAGTCTTTCATTGGCACACCATATTGGATGG 
CTCCAGAAGTTGCAGCTGTTGAGAGGAAGGGGGGTTACAATCAACTCTGTGATCTCTGGG 
CAGTGGGAATCACTGCCATAGAACTTGCAGAGCTTCAGCCTCCTATGTTTGACTTACACC 
CAATGAGAGCATTATTTCTAATGACAAAAAGCAATTTTCAGCCTCCTAAACTAAAGGATA 
AAATGAAATGGTCAAATAGTTTTCATCACTTTGTGAAAATGGCACTTACCAAAAATCCGA 
AAAAAAGACCTACTGCTGAAAAATTATTACAGCATCCTTTTGTAACACAACATTTGACAC 
GGTCTTTGGCAATCGAGCTGTTGGATAAAGTAAATAATCCAGATCATTCCACTTACCATG 

Fig. 9L 

44/76 

SUBSTITUTE SHEET (RULE 26) 

BNSDOCID: ^WO ?953036A2_(A> 



wo 99/53036 PCT/US99/08150 

ATTTCGATGATGATGATCCTGAGCCTCTTGTTGCTGTACCACATAGAATTCACTCAACAA 

GTAGAAACGTGAGAGAAGAAAAAACACGCTCAGAGATAACCTTTGGCCAAGTGAAATTTG 

ATCCACCCTTAAGAAAGGAGACAGAACCACATCATGAACTTCCCGACAGTGATGGTTTTT 

TGGACAGTTCAGAAGAAATATACTACACTGCAAGATCTAATCTGGATCTGCAACTGGAAT 

ATGGACAAGGACACCAAGGTGGTTACTTTTTAGGTGCAAACAAGAGTCTTCTCAAGTCTG 

TTGAAGAAGAATTGCATCAGCGAGGACACGTCGCACATTTAGAAGATGATGAAGGAGATG 

ATGATGAATCTAAACACTCAACTCTGAAAGCAAAAATTCCACCTCCTTTGCCACCAAAGC 

CTAAGTCTATCTTCATACCACAGGAAATGCATTCTACTGAGGATGAAAATCAAGGAACAA 

TCAAGAGATGTCCCATGTCAGGGAGCCCAGGAAAGCCATCCCAAGTTCCACCTAGACCAC 

CACCTCCCAGATTACCCCCACACAAACCTGTTGCCTTAGGAAATGGAATGAGCTCCTTCC 

AGTTAAATGGTGAACGAGATGGCTCATTATGTCAACAACAGAATGAACATAGAGGCACAA 

ACCTTTCAAGAAAAGAAAAGAAAGATGTACCAAAGCCTATTAGTAATGGTCTTCCTCCAA 

CACCTAAAGTGCATATGGGTGCATGTTTTTCAAAAGTTTTTAATGGGTGTCCCTTGAAAA 

TTCACTGTGCATCATCATGGATAAACCCAGATACAAGAGATCAGTACTTGATATTTGGTG 

CCGAAGAAGGGATTTATACCCTCAATCTTAATGAACTTCATGAAACATCAATGGAACAGC 

TATTCCCTCGAAGGTGTACATGGTTGTATGTAATGAACAATTGCTTGCTATCAATATCTG 

GTAAAGCTTCTCAGCTTTATTCCCATAATTTACCAGGGCTTTTTGATTATGCAAGACAAA 

TGCAAAAGTTACCTGTTGCTATTCCAGCACACAAACTCCCTGACAGAATACTGCCAAGGA 

AATTTTCTGTATCAGCAAAAATCCCTGAAACCAAATGGTGCCAGAAGTGTTGTGTTGTAA 

GAAATCCTTACACGGGCCATAAATACCTATGTGGAGCACTTCAGACTAGCATTGTTCTAT 

TAGAATGGGTTGAACCAATGCA6AAATTTATGTTAATTAAGCACATAGATTTTCCTATAC 

CATGTCCACTTAGAATGTTTGAAATGCTGGTAGTTCCTGAACAGGAGTACCCTTTAGTTT 

GTGTTGGTGTCAGTAGAGGTAGAGACTTCAACCAAGTGGTTCGATTTGAGACGGTCAATC 

CAAATTCTACCTCTTCATGGTTTACAGAATCAGATACCCCACAGACAAATGTTACTCATG 

TAACCCAACTGGAGAGAGATACCATCCTTGTATGCTTGGACTGTTGTATAAAAATAGTAA 

ATCTCCAAGGAAGATTAAAATCTAGCAGGAAATTGTCATCAGAACTCACCTTTGATTTCC 

AGATTGAATCAATAGTGTGCCTACAAGACAGTGTGCTAGCTTTCTGGAAACATGGAATGC 

AAGGTAGAAGTTTTAGATCTAATGAGGTAACACAAGAAATTTCAGATAGCACAAGAATTT 

TCAGGCTGCTTGGATCTGACAGGGTCGTGGTTTTGGAAAGTAGGCCAACTGATAACCCCA 

CAGCAAATAGCAATTTGTACATCCTGGCGGGTCATGAAAACAGTTACTGAGAATTGTTGT 

GCTTTGACAGTTAACTCTAGAAAGAAAGAACACTACCACTGCAACATTAATGGATGCTTG 

AAGCTGTACAAAAGCTGCAGTAACCTGTCTTCAGTTACTTTGTAATTTATTGTGGCATGA 

GATAAGATGGGGAAAATTTTGTTTTAAGTGGTATGGATATATTTAGCATATTGAACCACA 

CAAGTGCTTAATTCATTGTTATGTAATCTTTGTACATATAGGCAGTATTTTTTCTGTGAA 

ACTTCATATTGCTGAAGACATACACTAAGAATTTATGTAGATAATGTACTTTTATGAGAT 

GTACAAGTAAGTGTCTTATCTGTACAGATGTAAATGTTGATGAAAATGCAATTGGGGTTA 

ATATTTTAAGAATTCTTTAGTATATTCTTGGGTGTGGCTATATTACAAAATGGGATGCTG 

GCAATGAAACAATACATTTAACACTATTGTATTTTTATTATATGTAATTTAGTAATATGA 
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ATATAAATCTTGTAACTTTTAAAATTGTAATGGAGGCTGTAATCATTTTATAATCTTTTT 

AATTTTAATGCAAGTACACTGGTGTTTATATTTGCACAAAGTATTGATATGTGATGTATT 

AAGTCACAAAAGTAAGCTGTGACATTGTCTATAAGCATTTGGCTCCACAAATGTATTTGG 

ATTGTTTTCTATGTGAAGCAAACCAATTATAATTAACCACATGTTGTAGTAACTGGTCTT 

TTTATATTTAAGCAGAATCCTGTAAGATTGCTTGTCTTTGCTTAAAAACAATACCTTTGA 

ACATTTTTGAATCACAGAATAGCGGTACCATGATAGAATACTGCAATTGTGGTCAGAATT 

ACAGTATGCACAAAGAATTAATTAGCATTATTAAAGAGTCCTCACTAAACATTTCATATG 

ATCACACTGAAGAACTGTAACATTCCATAGAGTGAAGTGGTTCAAATTTCTCTTGGAATT 

TTTACTTTTGTTGGCCTTATTTTATGATCCTTTTCATATTTCTTTTGACTTA6AGTATTA 

ATACATGGCCAAAATAATTTAGTTACTACCTCATACAAACAATATAATGGTTACTACACA 

TCACAGGAACTTAGTTTTGGTTTAAGTCATTTTTGATTGCTTTTTTCCAATGGAATATGT 

ATATACCAGGTTTTAGCAAAATGCACACTTTTGGCTCTTTTTG6TATATGTTCTTTATAT 

TTTAATGTGAGTATATACACTAAGAACAAACTAAATTGTGATTTATGATCTTCATTTATT 

TTAATGATAATGGTTTTAAAATATGTTCCTGATTGTACATATTGTAAAATAAACATGTTT 
TTT 

SEQ ID NO: 19 SULUl HUMAN 

GGGAGGGTCCTTGTGGCGCCGGGCGGCGGGGTCCTGCGTGGAGAGTGGGACGCAACGCCG 

AGACCGCGAGCAGAGGCTGCGCACAGCCGGATCCGGCACTCAGCGACCGGACCCAAGGAT 

CCGCCGGGGAACAAGCCACAGGAGAGCGACTCAGGAACAAGTGTGGGAGAGGAAGCGGCG 

GCGGCGGCGCCGGGCCCGGGGGTGGTGACAGCAGGTCTGAGGTTGCATCATAAATACAAA 

GGACTGAAGTTATAAAAGAGAAAAGAGAAGTTTGCTGCTAAAATGAATCTGAGCAATATG 

GAATATTTTGTGCCACACACAAAAAGGTACTGAAGATTTACCCCCCAAAAAAAATTGTCA 

ATGAGAAATAAAGCTAACTGATATCAAAAAGCAGAGCCTGCTCTACTGGCCATCATGCGT 

AAAGGGGTGCTGAAGGACCCAGAGATTGACGATCTATTCTACAAAGATGATCCTGAGGAA 

CTTTTTATTGGTTTGCATGAAATTGGACATGGAAGTTTTGGAGCAGTTTATTTTGCTACA 

AATGCTCACACCAATGAGGTGGTGGCAATTAAGAAGATGTCCTATAGTGGGAAGCAGACC 

CATGAGAAATGGCAAGATATTCTTAAGGAAGTTAAATTTTTACGACAATTGAAGCATCCT 

AATACTATTGAGTACAAAGGCTGTTACTTGAAAGAACACACTGCTTGGTTGGTGATGGAA 

TATTGCTTAGGCTCAGCCTCTGATTTATTAGAAGTTCATAAAAAACCACTTCAGGAAGTG 

GAGATCGCTGCCATTACTCATGGAGCCTTGCATGGACTAGCCTACCTACATTCTCATGCA 

TTGATTCATAGGGATATTAAAGCAGGAAATATTCTTCTAACAGAGCCAGGTCAGGTAAAA 

CTAGCTGATTTTGGATCTGCTTCAATGGCTTCTCCTGCCAACTCCTTCGTGGGCACACCT 

TACTGGATGGCTCCAGAGGTGATCTTAGCTATGGATGAAGGACAGTATGATGGGAAAGTT 

GATATTTGGTCACTTGGCATCACTTGTATTGAATTGGCGGAACGGAAGCCGCCCCTTTTC 

AACATGAATGCAATGAGTGCCTTATATCACATTGCCCAGAATGACTCCCCAACGTTACAG 

TCTAATGAATGGACAGACTCCTTTAGGAGATTTGTTGATTACTGCTTGCAGAAAATACCT 

CAGGAAAGGCCAACATCAGCAGAACTATTAAGGCATGACTTTGTTCGACGAGACCGGCCA 
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CTACGTGTCCTCATTGACCTCATACAGAGGACAAAAGATGCAGTTCGTGAGCTAGATAAC 
CTACAGTACCGAAAAATGAAAAAAATACTTTTCCAAGAGACACGGAATGGACCCTTGAAT 
GAGTCACAGGAGGATGAGGAAGACAGTGAACATGGAACCAGCCTGAACAGGGAAATGGAC 
AGCCTGGGCAGCAACCATTCCATTCCAAGCATGTCCGTGAGCACAGGCAGCCAGAGCAGC 
AGTGTGAACAGCATGCAGGAAGTCATGGACGAGAGCAGTTCCGAACTTGTCATGATGCAC 
GATGACGAAAGCACAATCAATTCCAGCTCCTCCGTCGTGCATAAGAAAGATCATGTATTC 
ACAAGGGATGAGGCGGGCCACGGCGATCCCAGGCCTGAGCCGCGGCCTACCCAGTCAGTT 
CAGAGCCAGGCCCTCCACTACCGGAACAGAGAGCGCTTTGCCACGATCAAATCAGCATCT 
TTGGTTACACGACAGATCCATGAGCATGAGCAGGAGAACGAGTTGCGGGAACAGATGTCA 
GGTTATAAGCGGATGCGGCGCCAGCACCAGAAGCAGCTGATCGCCCTGGAGAACAAGCTG 
AAGGCTGAGATGGACGAGCACCGCCTCAAGCTACAGAAGGAGGTGGAGACGCATGCCAAC 
AACTCGTCCATCGAGCTGGAGAAGCTGGCCAAGAAGCAAGTGGCTATCATAGAAAAGGAG 
GCAAAGGTAGCTGCAGCAGATGAGAAGAAGTTCCAGCAACA6ATCTTGGCCCAGCAGAAG 
AAAGATTTGACAACTTTCTTAGAAAGTCAGAAGAAGCAGTATAAGATTTGTAAGGAAAAA 
ATAAAAGAGGAAATGAATGAGGACCATAGCACACCCAAGAAAGAGAAGCAAGAGCGGATC 
TCCAAACATAAAGAGAACTTGCAGCACACACAGGCTGAAGAGGAAGCCCACCTTCTCACT 
CAACAGAGACTGTACTACGACAAAAATTGTCGTTTCTTCAAGCGGAAAATAATGATCAAG 
CGGCACGAGGTGGAGCAGCAGAACATTCGGGAGGAACTAAATAAAAAGAGGACCCAGAAG 
6AGATGGAGCATGCCATGCTAATCCGGCACGACGAGTCCACCCGAGAGCTAGAGTACAGG 
CAGCTGCACACGTTACAGAAGCTACGCATGGATCTGATCCGTTTACAGCACCAGACGGAA 
CTGGAAAACCAGCTGGAGTACAATAAGAGGCGAGAAAGAGAACTGCACAGAAAGCATGTC 
ATGGGACTTCGGCAACAGCCAAAAAACTTAAAGGCCATGGAAATGCAAATTAAAAAACAG 
TTTCAGGACACTTGCAAAGTACAGACCAAACAGTATAAAGCACTCAAGAATCACCAGTTG 
GAAGTTACTCCAAAGAATGAGCACAAAACAATCTTAAAGACACTGAAAGATGAGCAGACA 
AGAAAACTTGCCATTTTGGCAGAGCAGTATGAACAGAGTATAAATGAAATGATGGCCTCT 
CAAGCGTTACGGCTAGATGAGGCTCAAGAAGCAGAATGCCAGGCCTTGAGGCTACAGCTC 
CAGCAGGAAATGGAGCTGCTCAACGCCTACCAGAGCAAAATCAAGATGCAAACAGAGGCA 
CAACATGAACGTGAGCTCCAGAAGCTAGAGCAGAGAGTGTCTCTGCGCAGAGCACACCTT 
GAGCAGAAGATTGAAGAGGAGCTGGCTGCCCTTCAGAAGGAACGCAGCGAGAGAATAAAG 
AACCTATTGGAAAGGCAAGAGCGAGAGATTGAAACTTTTGACATGGAGAGCCTCAGAATG 
GGATTTGGGAATTTGGTTACATTAGATTTTCCTAAGGAGGACTACAGATGAGATTAAATT 
TTTTGCCATTTACAAAAAAAAAAAAAAAAAAGAAAACAGAAAAAAATTCAGACCCTGCAA 
AACCACATTCCCCATTTTAACGGGCGTTGCTCTCACTCTCTCTCTCTCTTACTCTTACTG 
ACATCGTGTCGGACTAGTGCCTGTTTATTCTTACTCCATCAGGGGCCCCGTTCCTCCCCC 
CGTGTCAACTTTCAGTGCTGGCCAAAACCTGGCCGTCTCTTCTATTCACAGTACACGTCA 
CAGTATTGATGTGATTCAAAATGTTTCAGTGAAAACTTTGGAGACAGTTTTAACAAAACC 
AATAAACCAACAACAAAAAAAGTGGATGTATATTGCTTTAAGCAATCACTCATTACCACC 
AATCTGTGAAAGTAAAGCAAAAAATAATAATAATAAATGCCAAGGGGGAGAGAGACACAA 
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TATCCGCAGCCTTACACCTTAACTAGCTGCTGCATTATTTTATTTTATTTTATTTTTTTG 
GTATTTATTCATCAGGAATAAAAAAAACAAAGTTTTATTAAAGATTGAAAATTTGATACA 
TTTTACAGAAACTAATTGTGATGTACATATCAGTGGTGACATATTATTACTTTTTTGGGG 
ACGGGGGGTGGGTGGGGTGAAGAGATCTTGTGATTTTTAAGAACCTGCTGGCAAGAGTT 
AACTTGTCTTCAGCATATTCTGATTGTATCATAATCATTTTCTGCTGTTGCAGAGGATGT 
GAATACACTTAAGGAGCTCACAGAATCCCA6TAGCACAAATTGGGCTTTGGCAAATCGTG 
TATTTTGTGTATAGAAGGAATTTAAGGAGAGGTATTACTTATTTTCATATTGTATTTTAA 
CTGTTTCTCTGATCAAATTTTTTTACTTCCTCCTCCTGTTCCTCCCCACCTCCCTCCTTT 
TCCAGTTCAGTATTTGGAGTTCAACACTGTCTCTCAATCAGATCATCTTGATCTTTTTCT 
TTATCTCCCTTCCCCTTCCTAAGTCCCATTTCTTG6TCATAAATATTGCATTATTCACAC 
TTTCAAACTGTGTATTTTCTTACAATAAAAAATGATGAAAAAAAAAAAAAAAAAAA 

SEQ ID NO: 20 SULU3 HUMAN 

TATTGAATTGGCGGAACGGAAGCCTCCTTTATTTAATATGAATGCAATGAGTGCCTTATA 

TCACATAGCCCAAAATGAATCCCCTACACTACAGTCTAATGAATGGTCTGATTATTTTCG 

CAACTTTGTAGATTCTTGCCTCCAGAAAATCCCTCAAGATCGACCTACATCAGAGGAACT 

TTTAAAGCACATATTTGTTCTTCGGGAGCGCCCTGAAACCGTGTTAATAGATCTCATTCA 

GAGGACAAAGGATGCAGTAAGAGAGCTGGACAATCTGCAGTATCGAAAGATGAAGAAACT 

CCTTTTCCAGGAGGCACATAATGGACCAGCAGTAGAAGCACAGGAAGAAGAAGAGGAACA 

AGATCATGGTGTTGGCCGGACAGGAACAGTTAATAGTGTTGGAAGTAATCAATCCATTCC 

CAGCATGTCCATCAGTGCCAGCAGCCAAAGCAGTAGTGTTAACAGTCTTCCA6ATGTCTC 

AGATGACAAGAGT6AGCTAGACATGATGGAGGGAGACCACACAGTGATGTCTAACAGTTC 

TGTTATCCATTTAAAACCAGAGGAAGAAAATTACAGAGAAGAGGGAGATCCTAGAACAAG 

AGCATCAGATCCACAATCTCCACCCCAAGTATCTCGTCACAAATCACACTATCGTAATCG 

AGAACACTTTGCTACTATACGGACAGCATCACTGGTTACGAGGCAAATGCAAGAACATGA 

GCAGGACTCTGAGCTTA6AGAACAAATGTCTGGCTATAAGCGAATGAGGCGACAACATCA 

AAAGCAACTGATGACTCTGGAAAACAAGCTAAAGGCTGAGATGGATGAACATCGCCTCAG 

ATTAGACAAAGATCTTGAAACTCAGCGTAACAATTTTGCTGCAGAAATGGAGAAACTTAT 

CAAGAAACACCAGGCTGCCATGGAGAAAGAGGCTAAAGTGATGTCCAATGAAGAGAAAAA 

ATTTCAGCAACATATTCAGGCCCAACAGAAGAAAGAACTGAATAGTTTTCTCGAGTCCCA 

GAAAAGAGAGTATAAACTTCGAAAAGAGCAGCTTAAAGAGGAGCTAAATGAAAACCAGAG 

TACCCCCAAAAAAGAAAAACAGGAGTGGCTTTCAAAGCAGAAGGAGAATATACAGCATTT 

CCAAGCAGAAGAAGAAGCTAACCTTCTTCGACGTCAAAGACAATACCTAGAGCTGGAATG 

CCGTCGCTTCAAGAGAAGAATGTTACTTGGGCGTCATAACTTAGAGCAGGACCTTGTCAG 

GGAGGAGTTAAACAAAAGACAGACTCAGAAGGACTTAGAGCATGCCATGCTACTCCGACA 

GCATGAATCTATGCAAGAACTGGAGTTCCGCCACCTCAACACAATTCAGAAGATGCGCTG 

TGAGTTGATCAGATTACAGCATCAAACTGAGCTCACTAACCAGCTGGAATATAATAAGCG 

AAGAGAACGAGAACTAAGACGAAAGCATGTCATGGAAGTTCGACAACAGCCTAAGAGTTT 
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wo 99/53036 PCT/US99/08150 

gaagtctaaagaactccaaataaaaaagcagtttcaggataccjgcaAaatccaaaccag 
acagtacaaagcattaagaaatcacctgctggagactacaccaaagagtgagcacaaagc 
tgttctgaaacggctcaaggaggaacagacccggaaattagctatcttggctgagcagta 

TGATCACAGCATTAATGAAATGCTCTCCACACAAGCCCTGCGTTTGGATGAAGCACAGGA 

AGCAGAGTGCCAGGTTTTGAAGATGCAGCTGCAGCAGGAACTGGAGCTGTTGAATGCGTA 

TCAGAGCAAAATCAAGATGCAAGCTGAGGCACAACATGATCGAGAGCTTCGCGAGCTTGA 

ACAGAGGGTCTCCCTCCGGAGGGCACTCTTAGAACAAAAGATTGAAGAAGAGATGTTGGC 

TTTGCAGAATGAGCGCACAGAACGAATACGAAGCCTGTTGGAACGTCAAGCCAGAGAGAT 

TGAAGCTTTTGACTCTGAAAGCATGAGACTAGGTTTTAGTAATATGGTCCTTTCTAATCT 

CTCCCCTGAGGCATTCAGCCACAGCTACCCGGGAGCTTCTGGTTGGTCACACAACCCTAC 

TGGGGGTCCAGGACCTCACTGGGGTCATCCCATGGGTGGCCCACCACAAGCTTGGGGCCA 

TCCAATGCAAGGTGGACCCCAGCCATGGGGTCACCCTTCAGGGCCAATGCAAGGGGTACC 

TCGAGGTAGCAGTATGGGAGTCCGCAATAGCCCCCAGGCTCTGAGGCGGACAGCTTCTGG 

GGGACGGACGGAGCAGGGCATGAGCAGAAGCACGAGTGTCACTTCACAAATATCCAATGG 

GTCACACATGTCTTATACATAACTTAATAATTGAGAGTGGCAATTCCGCTGGAGCTGTCT 

GCCAAAAGAAACTGCCTACAGACATCATCACAGCAGCCTCCTCACTTGGGTACTACAGTG 

TGGAAGCTGAGTGCATATGGTATATTTTATTCATTTTTGTAAAGCGTTCTGTTTTGTGTT 

TACTAATTGGGATGTCATAGTACTTGGCTGCCGGGTTTGTTTGTTTTTGGGGAAATTTTG 

AAAAGTGGAGTTGATATTAAAAATAAATGTGTATGTGTGTACATATATATACACACACAT 

ACACATATATTATGCATGTGGTGAAAAGAATTGGCTAGATAGGGGATTTTTCTGAACACT 

GCAAAAATAGAACGTAGCAAAATGGCTTCAGTTATCACTTTTGGGTGTCTGTATCCTAAG 

AAGTTTCTGAAAAGATCTAAAGCCTTTTTATCCCATATCCCAAATTCTTATGAGCCACTC 

ACAGCAGGCAGCATATGTTGAAATAAGTTATTACTGGTACACACCTGCATTGCCTCACCA 

GTGTATTTATTTGTTATTAAATTGATCTGACTTCTCAGCCTCATTTGGACTAAAAAAAGA 

AAGGAGAAATCCATGAACACATTGCTTCTCGGCCTTTTGGCTAAGATCAAGTGTAGAAAT 

CGATGAACACTAAAGGACTTCATTGATTTTTTCAGAGAGTAGAAAACAACTTAGTTTTTC 

TTTTTTCCTGAATGCGTCATAGGCTTGTGAGTGATTTTTGTCCATTCAATTGTGCCTTCT 

TTGTATTATGATAAGATGGGGGTACTTAAGGAGATCACAAGTTGTGTGAGGATTGCATTA 

ACAAACCTATGAGCCTTCAATGGGGAAGACCAGAAGGGTGA6AGGGGCCCTGAAAGTTCA 

TATGGTGGGTATGTCCCGCAGCAGAGTGAGGAGATGAAGCTTACGTGTCCTGACGTTTTG 

TTGCTTATACTGTGATATCTCATCCTAGCTAAGCTCTATAATGCCCAAGACCCCAAACAG 

TACTTTTACTTTGTTTGTACAAAAACAAAGACATATAGCCAATACAAATCAAATGCCGGA 

GGTGTTTGATGCCATATTTGCAAATTGCCATCTATTGAAATTCTCGTCACACTACATAGA 

CATAATTGTTATCTCCTTTTGGCTTATGTGATTTTCTGTTTACAAGTAGAATAGCCAATT 

ATTTAAATGTTTAGTTGCCACAGTGAACCAGGAGTCACTGAGCCAATGACTTTACCAGCT 

GCTGACTAATCTTCATCACCACTGTAGATTTTGCTGCATGTGCAGGTCCTCTATTTTTAA 

TTGCTGTTTTCGTTGCTGCAGTACTTTACAAACTTCTAGTTCGTTGAGACTTAGTGACCA 

TTTGGCATCAAGTTAACATCACACAATAGGAAACACCACTTCCACAAGTCTCAAGCCTCA 
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GTGCTAAAGTACTACTGAAAAGGAACTAGGAAGTTTGGCCAATT 
SEQ ID NO: 21 SULU3 MURINE 

GCAGGATGCCATCAACTAACAGAGCAGGCAGTCTAAAGGACCCTGAAATTGCAGAGCTCT 

TCTTCAAAGAAGATCCGGAAAAGCTCTTCACAGATCTCAGAGAAATCGGCCATGGGAGCT 

TTGGAGCAGTATATTTTGCACGAGATGTGCGTACTAATGAAGTGGTGGCCATCAAGAAAA 

TGTCTTATAGTGGAAAGCAGTCTACTGAGAAATGGCAGGATATTATTAAGGAAGTCAAGT 

TTCTACAAAGAATAAAACATCCCAACAGTATAGAATACAAAGGCTGCTATTTACGTGAAC 

ACACAGCATGGCTTGTAATGGAATATTGTTTAGGATCTGCTTCAGATTTATTAGAAGTTC 

ATAAAAAGCCATTACAAGAAGTGGAAATAGCAGCAATTACACATGGTGCTCTCCAGGGAC 

TAGCTTATTTACATTCTCATACCATGATCCATAGAGATATCAAAGCAGGAAATATCCTTC 

TGACAGAACCAGGCCAAGTGAAACTTGCTGACTTTGGATCTGCTTCCATGGCTTCCCCTG 

CCAATTCTTTTGTGGGAACACCATATTGGATGGCCCCAGAAGTAATTTTAGCCATGGATG 

AAGGACAGTATGATGGCAAAGTTGATGTATGGTCTCTTGGAATAACGTGTATTGAATTAG 

CCGA<JAGGAAGCGTCCTTTATTTAATATGAATGCAATGAGTGCCTTATATCACATAGCCC 

AAAATGAATCCCCTACACTACAATCTAATATGAATGATTCTTGCCTCCAGAAAATCCCTC 

AAGATCGCCCTACATCAGAGGAACTTTTAAAGCACATGTTTGTTCTTCGAGAGCGCCCTG 

AAACAGTGTTAATAGATCTTATTCAAAGGACAAAGGATGCAGTAAGAGAGCTGGACAATC 

TGCAGTATCGAAAGATGAAGAAACTCCTTTTCCAG6AGGCACATAATGGGCCAGCGGTAG 

AAGCACAGGAAGAAGAGGAGGAGCAAGATCATGGTGTTGGCCGAACAGGAACAGTGAATA 

GTGTTGGAAGCAATCAGTCTATCCCTAGTATGTCTATCAGTGCCAGCAGTCAAAGCAGCA 

GTGTTAATAGTCTTCCAGATGCATCAGATGACAAGAGTGAGCTAGACATGATGGAGG6AG 

ACCATACAGTGATGTCTAACAGTTCTGTCATCCACTTAAAACCTGAGGAGGAAAATTACC 

AGGAAGAAGGAGATCCTAGAACAAGAGCATCAGACCCACAGTCTCCCCCTCAGGTGTCTC 

GTCACAAGTCACATTATCGTAATAGAGAACACTTTGCAACCATACGAACAGCATCACTGG 

TTACAAGACAGATGCAAGAACATGAGCAGGACTCTGAACTTAGAGAACA6ATGTCTGGTT 

ATAAGCGGATGAGGCGACAGCATCAAAAGCAGCTGATGACGCTGGAAAATAAACTGAAGG 

CAGAGATGGACGAACATCGGCTCAGATTAGACAAAGATCTTGAAACTCAGCGTAACAATT 

TCGCTGCAGAAATGGAGAAACTTATTAAGAAACACCAAGCTGCTATGGAAAAAGAGGCTA 

AAGTGATGGCCAATGAGGAGAAAAAATTCCAGCAACACATTCAGGCTCAACAGAAAAAAG 

AACTGAATAGCTTTTTGGAGTCTCAAAAAAGAGAATATAAACTTCGCAAAGAGCAGCTTA 

AGGAGGAGCTGAATGAAAACCAGAGCACACCTAAAAAAGAAAAGCAGGAATGGCTTTCAA 

AGCAGAAGGAGAATATACAGCATTTTCAGGCAGAAGAAGAAGCTAATCTTCTTCGACGTC 

AAAGGCAGTATCTAGAGCTAGAATGTCGTCGCTTCAAAAGAAGAATGTTACTTGGGCGAC 

ATAACTTGGAACAGGACCTTGTCAGGGAGGAGTTAAACAAAAGGCAGACTCAAAAGGACT 

TGGAACATGCAATGCTATTGCGACAGCATGAATCAATGCAAGAACTGGAGTTTCGCCATC 

TCAACACTATTCAGAAGATGCGCTGTGAGTTGATCAGACTGCAGCATCAAACTGAGCTCA 

CTAACCAGCTAGAGTACAATAAGAGAAGGGAACGGGAACTGAGGCGAAAACATGTCATGG 
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AAGTTCGACAACAACCTAAGAGTCTGAAGTCTAAAGAACTCCAAATAAAAAAGCAGTTTC 
AGGATACCTGCAAAATTCAAACCAGACAGTACAAAGCATTAAGGAATCACCTACTGGAGA 
CTACACCAAAGAATGAGCACAAAGCAATC 

SEQ ID NO: 25 GEK2 HUMAN 

CGAAGCCACAGCCCGAGCCCGAGCCCGAGCCCGAGCCGGCGCCACCGCGCCCCCGGCCAT 
GGCTTTTGCCAATTTCCGCCGCATCCTGCGCCTGTCTACCTTCGAGAAGAGAAAGTCCCG 
CGAATATGAGCACGTCCGCCGCGACCTGGACCCCAACGAGGTGTGGGAGATCGTGGGCGA 
GCTGGGCGACGGCGCCTTCGGCAAGGTTTACAAGGCCAAGAATAAGGAGACGGGTGCTTT 
GGCTGCGGCCAAAGTCATTGAAACCAAGAGTGAGGAGGAGCTGGAGGACTACATCGTGGA 
GATTGAGATCCTGGCCACCTGCGACCACCCCTACATTGTGAAGCTCCTGGGAGCCTACTA 
TCACGACGGGAAGCTGTGGATCATGATTGAGTTCTGTCCAGGGGGAGCCGTGGACGCCAT 
CATGCTGGAGCTGGACAGAGGCCTCACGGAGCCCCAGATACAGGTGGTTTGCCGCCAGAT 
GCTAGAAGCCCTCAACTTCCTGCACAGCAAGAGGATCATCCACCGAGATCTGAAAGCTGG 
CAACGTGCTGATGACCCTCGAGGGAGACATCAGGCTGGCTGACTTTGGTGT6TCTGCCAA 
GAATCTGAAGACTCTACAGAAACGAGATTCCTTCATCGGCACGCCTTACTGGATGGCCCC 
CGAGGTGGTCATGTGTGAGACCATGAAAGACAC6CCCTACGACTACAAAGCCGACATCTG 
GTCCCTGGGCATCACGCTGATTGAGATGGCCCAGATC6AGCCGCCACACCACGAGCTCAA 
CCCCATGCGGGTCCTGCTAAAGATCGCCAAGTCGGACCCTCCCACGCTGCTCACGCCCTC 
CAAGTGGTCTGTAGAGTTCCGTGACTTCCTGAAGATAGCCCTGGATAAGAACCCAGAAAC 
CCGACCCAGTGCCGCGCAGCTGCTGGAGCATCCCTTCGTCAGCAGCATCACCAGTAACAA 
GGCTCTGCGGGAGCTGGTGGCTGAGGCCAAGGCCGAGGTGATGGAAGAGATCGAAGACGG 
CCGGGATGAGGGGGAAGAGGAGGACGCCGTGGATGCCGCCTCCACCCTGGAGAACCATAC 
TCAGAACTCCTCTGAGGTGAGTCCGCCAAGCCTCAATGCTGACAAGCCTCTCGAGGAGTC 
ACCTTCCACCCCGCTGGCACCCAGCCAGTCTCAGGACAGTGTGAATGAGCCCTGCAGCCA 
GCCCTCTGGGGACAGATCCCTCCAAACCACCAGTCCCCCAGTCGTGGCCCCTGGAAATGA 
GAACGGCCTGGCAGTGCCTGTGCCCCTGCGGAAGTCCCGACCCGTGTCAATGGATGCCAG 
AATTCAGGTAGCCCAGGAGAAGCAAGTTGCTGAGCAGGGTGGGGACCTCAGCCCAGCAGC 
CAACAGATCTCAAAAGGCCAGCCAGAGCCGGCCCAACAGCAGCGCCCTGGAGACCTTGGG 
TGGGGAGAAGCTGGCCAATGGCAGCCTGGAGCCACCTGCCCAGGCAGCTCCAGGGCCTTC 
CAAGAGGGACTCGGACTGCAGCAGCCTCTGCACCTCTGAGAGCATGGACTATGGTACCAA 
TCTCTCCACTGACCTGTCGCTGAACAAAGAGATGGGCTCTCTGTCCATCAAGGACCCGAA 
ACTGTACAAAAAAACCCTCAAGCGGACACGCAAATTTGTGGTGGATGGTGTGGAGGTGAG 
CATCACCACCTCCAAGATCATCAGCGAAGATGAGAAGAAGGATGAGGAGATGAGATTTCT 
CAGGCGCCAGGAACTCCGAGAGCTTCGGCTGCTCCAGAAAGAAGAGCATCGGAACCAGAC 
CCAGCTGAGTAACAAGCATGAGCTGCAGCTGGAGCAAATGCATAAACGTTTTGAACAGGA 
AATCAACGCCAAGAAGAAGTTCTTTGACACGGAATTAGAGAACCTGGAGCGTCAGCAAAA 
GCAGCAAGTGGAGAAGATGGAGCAA6ACCATGCCGTGCGCCGCCGGGAGGAGGCCAGGCG 
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GATCCGCCTGGAGCAGGATCGGGACTACACCAGGTTCCAAGAGCAGCTCAAACTGATGAA 

GAAAGAGGTGAAGAACGAGGTGGAGAAGCTCCCCCGACAGCAGCGGAAGGAAAGCATGAA 

GCAGAAGATGGAGGAGCACACGCAGAAAAAGCAGCTTCTTGACCGGGACTTTGTAGCCAA 

GCAGAAGGAGGACCTGGAGCTGGCCATGAAGAGGCTCACCACCGACAACAGGCGGGAGAT 

CTGTGACAAGGAGCGCGAGTGCCTCATGAAGAAGCAGGAGCTCCTTCGAGACCGGGAAGC 

AGCCCTGTGGGAGATGGAAGAGCACCAGCTGCAGGAGAGGCACCAGCTGGTGAAGCAGCA 

GCTCAAAGACCAGTACTTCCTCCAGCGGCACGAGCTGCTGCGCAAGCATGAGAAGGAGCG 

GGAGCAGATGCAGCGCTACAACCAGCGCATGATAGAGCAGCTGAAGGTGCGGCAGCAACA 

GGAAAAGGCGCGGCTGCCCAAGATCCAGAGGAGTGAGGGCAAGACGCGCATGGCCATGTA 

CAAGAAGAGCCTCCACATCAACGGCGGGGGCAGCGCAGCTGAGCAGCGTGAGAAGATCAA 

GCAGTTCTCCCAGCAGGAGGAGAAGA6GCAGAAGTCGGAGCGGCTGCAGCAACAGCA6AA 

ACACGAGAACCAGATGCGGGACATGCTGGCGCAGTGCGAGAGCAACATGAGCGAGCTGCA 

GCAGCTGCAGAATGAAAAGTGCCACCTCCTGGTAGAGCACGAAACCCAGAAACTGAAGGC 
CCTGGATGAGAGCCATAACCA6AACCTGAAGGAAT 

SEQ ID NO: 27 PAK4 HUMAN 

CGTTCCTGGGCTTCCCGCTCCGCAGGCCTGCGGAGGACTGGCCCAGCAAGGTCCCAGGTC 

TTCCCTCTCCTTAGCGCCTAAGAGAGAGGCCCAGTGCGGGTGAGGAGTCGCGAGGAAGAG 

6CGGAAGGCGCCGGAAGGCACCATGTTCCGCAAGAAAAAGAAGAAACGCCCTGAGATCTC 

AGCGCCACAGAACTTCCAGCACCGTGTCCACACCTCCTTCGACCCCAAAGAAGGCAA6TT 

TGTGGGCCTCCCCCCACAATGGCAGAACATCCTGGACACACTGCGGCGCCCCAAGCCCGT 

GGTGGACCCTTCGCGAATCACACGGGTGCA6CTCCAGCCCATGAAGACAGTGGTGCGGGG 

CAGCGCGATGCCTGTGGATGGCTACATCTCGGGGCTGCTCAACGACATCCAGAAGTTGTC 

AGTCATCAGCTCCAACACCCTGCGTGGCCGCAGCCCCACCAGCCGGCGGCGGGCACAGTC 

CCTGGGGCTGCTGGGGGATGAGCACTGGGCCACCGACCCAGACATGTACCTCCAGAGCCC 

CCAGTCTGAGCGCACTGACCCCCACGGCCTCTACCTCAGCTGCAACGGGGGCACACCAGC 

AGGCCACAAGCAGATGCCGTGGCCCGAGCCACAGAGCCCACGGGTCCTGCCCAATGGGCT 

GGCTGCAAAGGCACAGTCCCTGGGCCCCGCCGAGTTTCAGGGTGCCTCGCAGCGCTGTCT 

GCAGCTGGGTGCCTGCCTGCAGAGCTCCCCACCAGGAGCCTCGCCCCCCACGGGCACCAA 

TAGGCATGGAATGAAGGCTGCCAAGCATGGCTCTGAGGAGGCCCGGCCACAGTCCTGCCT 

GGTGGGCTCAGCCACAGGCAGGCCAGGTGGGGAAGGCAGCCCTAGCCCTAAGACCCGGGA 

6AGCAGCCTGAAGCGCAGGCTATTCCGAAGCATGTTCCTGTCCACTGCTGCCACAGCCCC 

TCCAAGCAGCAGCAAGCCAGGCCCTCCACCACAGAGCAAGCCCAACTCCTCTTTCCGACC 

GCCGCAGAAAGACAACCCCCCAAGCCTGGTGGCCAAGGCCCAGTCCTTGCCCTCGGACCA 

GCCGGTGGGGACCTTCAGCCCTCTGACCACTTCGGATACCAGCAGCCCCCAGAAGTCCCT 

CCGCACAGCCCCGGCCACAGGCCAGCTTCCAGGCCGGTCTTCCCCAGCGGGATCCCCCCG 

CACCTGGCACGCCCAGATCAGCACCAGCAACCTGTACCTGCCCCAGGACCCCACGGTTGC 

CAAGGGTGCCCTGGCTGGTGAGGACACAGGTGTTGTGACACATGAGCAGTTCAAGGCTGC 
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GCTCAGGATGGTGGTGGACCAGGGTGACCCCCGGCTGCTGCTGGACAGCTACGTGAAGAT 

TGGCGAGGGCTCCACCGGCATCGTCTGCTTGGCCCGGGAGAAGCACTCGGGCCGCCAGGT 

GGCCGTCAAGAT6ATGGACCTCAGGAAGCAGCAGCGCAGGGAGCTGCTCTTCAACGAGGT 

GGTGATCATGCGGGACTACCAGCACTTCAACGTGGTGGAGATGTACAAGAGCTACCT6GT 

GGGCGAGGAGCTGTGGGTGCTCATGGAGTTCCTGCAGGGAGGAGCCCTCACAGACATCGT 

CTCCCAAGTCAGGCTGAATGAGGAGCAGATTGCCACTGTGTGTGAGGCTGTGCTGCAGGC 

CCTGGCCTACCTGCATGCTCAGGGTGTCATCCACCGGGACATCAAGAGTGACTCCATCCT 

GCTGACCCTCGATGGCAGGGTGAAGCTCTCGGACTTCGGATTCTGTGCTCAGATCAGCAA 

AGACGTCCCTAAGAGGAAGTCCCTGGTGGGAACCCCCTACTGGATGGCTCCTGAAGTGAT 

CTCCAGGTCTTTGTATGCCACTGAGGTGGATATCTGGTCTCTGGGCATCATGGTGATTGA 

6ATGGTAGATGGGGAGCCACCGTACTTCAGTGACTCCCCAGTGCAAGCCATGAAGAGGCT 

CCGGGACAGCCCCCCACCCAAGCTGAAAAACTCTCACAAGGTCTCCCCAGTGCTGCGAGA 

CTTCCTGGAGCGGATGCTGGTGCGGGACCCCCAAGAGAGAGCCACAGCCCAGGAGCTCCT 

AGACCACCCCTTCCTGCTGCAGACAGGGCTACCTGAGTGCCTGGTGCCCCTGATCCAGCT 

CTACCGAAAGCAGACCTCCACCTGCTGAGCCCACCCCAAGTATGCCTGCCACCTACGCCC 

ACAGGCAGGGCACACTGGGCAGCCAGCCTGCCGGCAGGACTTGCCTGCCTCCTCCTCTCA 

6TATTCTCTCCAAAGATTGAAATGTGAAGCCCCAGCCCCACCCTCTGCCCTTCAGCCTAC 

TGGGCCAGGCCGGACCTGCCCCCTCAGTGTCTCTCCCTCCCGAGTCCCCAGATGGAGACC 

CCTTTCTACAGGATGACCCCTTGATATTTGCACAGGGATATTTCTAAGAAACGCAGAG6C 

CAGCGTTCCTGGCCTCTGCAGCCAACACAGTAGAAAAGGCTGCTGTGGTTTTTTAAAGGC 

AGTTGTCCACTAGTGTCCTAGGCCACTGCAGAGGGCAGACTGCTGGTCTCCACAGATACC 

TGCTGTTCTCAGCTCCA6CTTCAAACCTC6AGTCTCGAGAGGGCCACGGGGTGGTTTTTA 

TGACCGGAATCCCGCTTCCTCCCTCACGTCTGATGTCCTGAAGGTGCAGTCCCACCTGTA 

CAGCCCCTCCCCGCCAAGAACTGTGAATGGCCTGCTCCAGGCCATGGCTGGGGGCAGGGA 

GTGAGGGGACAATTTCTGAGTGAAAGAGAAAGAATGGGGTCGGTGGTGAAGGTGCTCTCA 

CTTTACAGAATGGAGAGAACATCGTGTGTGTGTGTGTGT6TGTGTGTGTGTGTGTGTGTG 

TGTGTGTGTGTGTGTGTGTGTGTGTGTAAGGGGAGGAAAGCCACCTTGACAGCCCAGGTC 

CCTCCAGGTCACCCACAGCCAGTTTCAGGAAGGCTGCCCCTCTCTCCCACTAAGTTCTGG 

CCTGAAGGGACCTGCTTTCTTGGCCTGGCTTCCACCTCTCCACTCCTGTGTCTACCTGGC 

CAGTGGAGTGGTCCATGCTAAGTCTAACACTCCTGGGAGCTCAGGAGGCTTCTGAGCTTC 

TCCTGTACTGTGCATCGTGAGGGCCAGAGACAGGAATGTAAGGATTGGCAACTGTGTTAC 

CTTTCAAGTTTATCTCAATAACCAGGTCATCAGGGACCCATTGTTCTCTTCAGAACCCTA 

TCTGGGAGAGAAGGCGAACCACCTCCGGGTTTCCATCATGTCAAGGTCACAGGCATCCAT 

GTGTGCAAACCATCTGCCCCAGCTGCCTCCACAGACTGCTGTCTCCTTGTCCTCCTCGGC 

CCTGCCCCACTTCAGGGCTGCTGTGAGATGGAATTCCAGGAAAGAACTTCAGGTGTCTGG 

ACCCTTTCTATCTAGATAATATTTTTAGATTCTTCTGCTCCCTAGTGACCTACCTGGGGG 

CAAAGAAATTGCAAGGACTTTTTTTTAAGGGTCA6AGTTTTCAAAACAAAAGCATCTTCC 

CTAGAAATTTTTGTGAATTGTTTGCACTTGTGCCTGTTTTAAATTAAATTGAGTGTTCAA 
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AGCC 

SEQ ID NO: 28 PAK5 HUMAN 

GGCCAGTGGGGCGAAACTGGCAGCTGGCCGGCCCTTTAACACCTACCCGAGGGCTGACAC 
GGACCACCCATCCCGGGGTGCCCAGGGGGAGCCTCATGACGTGGCCCCTAACGGGCCATC 
AGCGGGGGGCCTGGCCATCCCCCAGTCCTCCTCCTCCTCCTCCCGGCCTCCCACCCGAGC 
CCGAGGTGCCCCCAGCCCTGGAGTGCTGGGACCCCACGCCTCAGAGCCCCAGCTGGCCCC 
TCCAGCCTGCACCCCCGCCGCCCCTGCTGTTCCTGGGCCCCCTGGCCCCCGCTCACCACA 
GCGGGAGCCACAGCGAGTATCCCATGAGCAGTTCCGGGCTGCCCTGCAGCTGGTGGTGGA 
CCCAGGCGACCCCCGCTCCTACCTGGACAACTTCATCAAGATTGGCGAG6GCTCCACGGG 
CATCGTGTGCATCGCCACCGTGCGCAGCTCGGGCAAGCTGGTGGCCGTCAAGAAGATGGA 
CCTGCGCAAGCAGCA6AGGCGCGAGCTGCTCTTCAACGAGGTGGTAATCATGAGGGACTA 
CCAGCACGAGAATGTGGTGGAGATGTACAACAGCTACCTGGTGGGGGACGAGCTCTGGGT 
GGTCATGGAGTTCCTGGAAGGAGGCGCCCTCACCGACATCGTCACCCACACCAGGATGAA 
CGAGGAGCAGATCGCGGCCGTGTGCCTTGCAGTGCTGCAGGCCCTGTCGGTGCTCCACGC 
CCAGGGCGTCATCCACCGGGACATCAAGAGCGACTCGATCCTGCTGACCCATGATGGCAG 
GGTGAAGCTGTCAGACTTTGGGTTCTGCGCCCAGGTGAGCAAGGAAGTGCCCCGAAGGAA 
GTCGCTGGTCGGCAC6CCCTACTGGATGGCCCCAGAGCTCATCTCCCGCCTTCCCTACGG 
GCCAGAGGTAGACATCTGGTCGCTGGGGATAATGGTGATTGAGATGGTGGACGGAGAGCC 
CCCCTACTTCAACGAGCCACCCCTCAAAGCCATGAAGATGATTCGGGACAACCTGCCACC 
CCGACTGAAGAACCTGCACAAGGTGTCGCCATCCCTGAAGGGCTTCCTGGACCGCCTGCT 
GGTGCGAGACCCTGCCCAGCGGGCCACGGCAGCCGAGCTGCTGAAGCACCCATTCCTGGC 
CAAGGCAGGGCCGCCTGCCAGCATCGTGCCCCTCATGCGCCA6AACCGCACCAGATGAGG 
CCCAGCGCCCTTCCCCTCAACCAAAGAGCCCCCCCGGGTCACCCCCGCCCCACTGAGGCC 
AGTAGGGGGCCAGGCCTCCCACTCCTCCCAGCCCGGGAGATGCTCCGCGTGGCACCACCC 
TCCTTGCTGGGGGTAGATGAGACCCTACTACTGAACTCCAGTTTTGATCTCGTGACTTTT 
AGAAAAACACAGGGACTCGTGGGAGCAAGCGAGGCTCCCAGGACCCCCACCCTCTGGGAC 
AGGCCCTCCCCCATGTTCTTCTGTCTCCAGGAAGGGCAGCGGCCCTCCCATCACTGGAAG 
TCTGCAGTGGGGGTCGCTGGGGGT6GAGAGAACACTAAGAGGTGAACATGTATGAGTGTG 
TGCACGCGTGTGAGTGTGCATGTGTGTGTGTGTGCAAAGGTCCAGCCACCCCGTCCTCCA 
GCCCGCAAGGGGTGTCTGGCGCCTTGCCTGACACCCAGCCCCCTCTCCCCCTGAGCCATT 
GTGGGGGTCGATCATGAATGTCCGAAGAGTGGCCTTTTCCCGTAGCCCTGCGCCCCCTTT 
CTGTGGCTGGATGGGGAGACAGGTCAGGGCCCCCCACCCTCTCCAGCCCCTGCAGCAAAT 
GACTACTGCACCTGGACAGCCTCCTCTTTTCTAGAAGTCTATTTATATTGTCATTTTATA 
ACACTCTAGCCCCTGCCCTTATTGGGGGACAGATGGTCCCTGTCCTGCGGGGTGGCCCTG 
GCAGAACCACTGCCTGAAGAACCAGGTTCCTGCCCGGTCAGCGCAGCCCCAGCCCGCCCA 
CCCCTGCCTCGAGTTAGTTTTACAATTAAAACATTGTCTTGTTTTGTGAAAAAAAAAAAA 
AAAAAAAAAA 
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SEQUENCE LISTING 

<110> SUGEN, INC. 

<12Q> STE20-RELATED PROTEIN KINASES 

<130> 240/300-PCT 

<140> TO BE ASSIGNED 
<141> Herewith 

<150> US 60/081,784 
<151> 1998-04-14 

<160> 147 



<170> FastSEQ for Windows Version 3.0 

<210> 1 
<211> 3268 
<212> DNA 

<213> Mammalian (Human) STLK2 
<400> 1 

taacagccca cctcctagcc ccgggctacg cgccgccagc ccagtaaccc cacttttgtg 60 

tgtcctccca ggccccgatc gaaaagcctg ggagggccgc cgaactaccc ccggagggag 120 

gagccagtcc gaacccaagg cgccaccgcc gcagaagcgg agcgaggcag ■ cat rcgcctc 180 

catggcccac ccgccggrgg crgtccaagt gcctgggatg cagaataaca tagctgarcc 240 

agaagaactg trcacaaaat ragagcgcat tgggaaaggc tcatttgggg aagttttcaa 300 

aggaattgat aaccgtaccc agcaagtcgt tgctattaaa atcatagacc ttgaggaagc 360 

cgaagatgaa atagaagaca rtcagcaaga aataactgtc ttgagtcaat gtgacagctc 420 

atatgtaaca aaaractarg ggtcatattt aaaggggtct aaaecatgga taataatgga 480 

atacctgggc ggrggttcag cactggatct tctrcgagct ggtccatttg atgagttcca 540 

gattgctacc atgctaaagg aaattttaaa aggtctggac tatctgcatt cagaaaagaa 600 

aattcaccga gacataaaag ctgccaatgt cttgctctca gaacaaggag atgttaaact 660 

tgcrgatttt ggagttgctg gtcagctgac agatacacag attaaaagaa atacctttgt 720 

gggaactcca ttttggatgg ctcctgaagt tattcaacag tcagcttatg actcaaaagc 780 

tgacartrgg tcattgggaa ttactgctat tgaactagcc aagggagagc cacctaactc 840 

cgatangcat ccaatgagag ttctgtttct tattcccaaa aacaatcc'c caactcttgt 900 

tggagacttt actaagtctt ttaaggagtt tattgatgct tgcctgaaca aagatccatc 960 

atttcgtcct acagcaaaag aacrtctgaa acacaaattc attgtaaaaa attcaaagaa 1020 

gacttcttat crgactgaac tgatagatcg ttttaagaga tggaaggcag aaggacacag 1080 

tgatgatgaa tctgattccg agggctictga ttcggaarct accagcaggg aaaacaatac 1140 

tcatcctgaa rggagcttta ccaccgtacg aaagaagcct gatccaaaga aagtacagaa 1200 

tggggcagag caagatcttg tgcaaaccct gagttgcttg tctatgataa tcacacctgc 1260 

atttgctgaa cttaaacagc aggacgagaa taacgctagc aggaatcagg cgattgaaga 1320 

actcgagaaa agtattgctg cggctgaagc cgcctgtccc ggcatcacag ataaaatggt 1380 

gaagaaacta attgaaaaat tccaaaagtg rtcagcagac qaazcacccz aagaaactta 1440 
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ttatrggctt ctgtttcata tggacccaga gagccccacc aaacctacgt caagattaac 1500 

aatgctraac ccacgagctc catgtgcctt ttggatcttt gcaacactga agartrggaa 1560 

gaagctatta aactattttg tgatggcgtt tatcatttta tattttgaaa ggattatttt 1620 

gtaaggaaca acttttaata ctatagtttc acctgtattc tagtaaatgt: rgagacaccg 1680 

trttgctrtr aagtatccct atrtcttaag ttacgaggat gaataccrtt cacattttga 1740 

tctttagttg acrcracagt catgaaacat acaggtcrtt caaagtcatt ctcaatartc 1800 

agcctrtgta aattatcaag cttcaaaaag cttttrttta aaaaaaaaaa catgcatatt 1860 
ctaaaaatga ctattggtgg ggaggtgtaa ataagtcata ccttcttaaa acagaaaatt . 1920 

taagtaaagt cttttaaatg aaacctgtaa aagtattgac tcttctacca agttggtatg 1980 

atattccagg cagctcaatg attatcacat ttgagaccct gtgtttgaag catttacagg 2040 

caatgtacag caacagaggt acctcttggr gcatagtatt tacattctct tttaggtaga 2100 

agaggcaatt ttacccttat ttcacarggt tagaaattta aagcaagatc atttacccaa 2160 

ggataggtgt ttggraatgt tgaaggagtt agtctggctt catgttttac atcttcaact 2220 

aaaatcccat actatctgct tggatttgga gagccaaaaa acaaagctga ttgtcatgtg 2280 

attaaatatc tgatcaacag gtatgaatat aacttaaatc agcatatrtt tgccatggta 2340 

ataaattgtc cracaaacta tttatatatt tttgttcttc ataattatca ctaataagca 2400 

tcagtttgtt gttttraaaa ggatatttaa grgagcattr rctagttcat atgaaaataa 2460 

ccatagtaca ggatgatttc tgtccacaca aaggttaaat tagattgcac agttaatttt 2520 

cacttatatt ta.tggtacta ttatg^gggt. gacgcctttt tcttttaagc ccagtacata 2580 

eattatgcct gcccaagttc tgaactgggg ctgtatttca gtagttgtag aatrattgat 2640 

atrtagtttt gatagctaat grttaattgt ttggatctgc acagttrggt ttttgcacaa 2700 

aagtcattta aaaaaatctg agtaattgtc aaatattaaa agaaagatac tcttcctgta 2760 

aggaatacag tttttagtca aagtggccat tacatcctct ttttaattta cataatacag 2820 

atacttgaga aagttgttgt ggtgttgtat gccaagaaaa ttctttttat tggtgcctat 2880 

attgtaacaa ttatttttaa tgcattgtat tttgaagtaa cggttcagtt aaatttttca 2940 

cctgctgtgc aacrgaaaca caatcacagt ttataatcat ctgtagaagt ctggagataa 3000 

tnttgcaact catgttatgg gttaaatgaa tatttttgta aaagtaaaag caacaaattt 3060 

ataaattgat tatttgaaac tttacaacac aattgcatcc caaatacaaa ttgtattgct 3120 

tattcatrat agctatrcgt cctgtaatct gtttctaggt gaagcatact ccagtgtttt 3180 

agggg-ttttg aaaataaata tttaaatttc acagtcaaaa aaaaaaaaaa aaaaaaaaaa 3240 

aaaaaaaaaa aaaaaaaaaa aaaaaaaa 3268 



<210> 2 
<211> 3030 
<212> DNA 

<213> Mammalian (Human) STLK3 
<400> 2 



gacagcagcg 
ggcggcacag 
cggcagtgga 
agcaataaaa 
aattcaagcc 
ggrcaaagat 
cataaaatac 
agcaacaatt 
ccacagggat 
agattttggg 
aaaaacatrc 
ctatgactcc 
agcagcgcct 
tccacccact 
ctt tagaaaa 



ccggccccgg 
gctgtcggct 
gctactgctg 
cggarcaact 
atgagtcagt 
gaactttggc 
attgtcaacc 
ct taaagagg 
ttgaaagcrg 
gcaagtgcgt 
gttggcaccc 
aaggcrgaca 
tatcacaaat 
ttggaaacag 
t: tact ttcac 



cagctcccgc 
ggcccatctg 
tggttcaggc 
tggaaaaatg 
gcagccatcc 
cggtcatgaa 
gaggagaaca 
ttttggaagg 
gtaatattct 
tcctagcaac 
catgttggat 
tgcggagttt 
atcctcccat 
gggtagagga 
tgtgtcttca 



ggccccggcc 
cagggacgcg 
agccctatgc 
ccagaccagt 
caacgtagtg 
at tactaagt 
caagaatgga 
ct tagactat 
tctgggtgag 
agggggtgar 
ggctcctgaa 
tggaataact 
gaaagtgt ta 
taaaqaaat g 
gaaagatcct 



ccggccccgg 
tacgagctgc 
aaacccaggc 
atggatgaac 
acctattaca 
ggaggttcaa 
gttctggaag 
ctacacagaa 
gatcgttcag 
gt tacccgaa 
gtcatggaac 
gccac tgaat 
atgt ngactt 
a tgaaaaagu 
tccaaaaggc 



ccccggcccc 
aggaggttat 
aagaacgtgt 
tattaaaaga 
cctcttttgt 
tgtrggatat 
aggcaataat 
acggtcagat 
tacaaatagc 
ataaagtaag 
aggtgagagg 
tagcaacagg 
tgcaaaatga 
acggcaagtc 
ccacagcagc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
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agaacttrta 
gctgcttaca 
gtcaagtggt 
ggatgagaag 
agaagaaaat 
ctctgtgcac 
ttgtgccgtg 
atrrgagrrt 
tggcrtggtg 
tgatcccaaa 
gattccngat 
•cctcgatgtc 
tctttctgcc 
tgaagaggcc 
ccgagttctg 
cacgactgt t 
ccct taagtg 
aatct rctat 
aac tgtacat 
aaattaagat 
tt tcatatat 
cc tcTitgcag 
gtaacttcta 
t t tgccraaa 
ctttcacar t 
ttcctctata 
gtaaaatgtt 
ttaagrgata 
1 1 act tgaaa 
actaaaaggt 
tgct tgactg 
t ttcaaggag 
ggcagctctt 
acaattgcct 
attrcattcc 
gaaaataaaa 



aaatgcaaat 

agaacaccag 

caccctcata 

agcgaagaag 

ccagagattg 

gactcrcagg 

aaccucgt 1 1 

act ccaggaa 

gatggtcacg 

get ttaaaaa 

gaagtgaagc 

accctgat ct 

catttcctcc 

ggcttcaaca 

ctttattctc 

gctcattttt 

aaaagaatct 

ccagaagccr 

ctgacaaT:ac 

tta ttagcca 

ttargcatta 

ttaatctcgt 

aattttgaaa 

aaccaattta 

aagaatgtgc 

taaggtggca 

gccattctgt 

ggatttgtta 

tt ttgcagaa 

aaaacgctgt 

tatattatgg 

gtgggctccc 

taar gcactt 

tgtcataatt 

attctattgt 

g-igtattgca 



tcttccagaa 

acatagccca 

aaaccgaaga 

ggaaagcagc 

cagtgagtgc 

gcccacccaa 

tgagat taag 

gagatacagc 

argtagttat 

catcgacatt 

tgattgggtt 

gtcatgcccc 

caccccctca 

aaatgggata 

tagcaatcca 

ccaaaagcta 

gaaatacacr 

atttctcctt 

tgcctctttt 

aact tgaat t 

cacaccttag 

cat traaaca 

tgactgaatt 

caat ctgact 

ctgaggctgc 

tatgr taagt 

tagattgcta 

ttctaaaggt 

tacccaggtg 

taaggr tcaa 

aactagtaaa 

aggatggtac 

caaaaagcaa 

gtraaggtgt 

t tacacaacg 

caatgaaaaa 



agccaagaac 

aagagccaaa 

cggggactgg 

tttttctcag 

cagcaccatc 

tgctaatgaa 

aaactccaga 

agatggtgra 

agtggctgct 

taagttggct 

tgctcagttg 

accgccaccc 

ctcccatttc 

aaaaaataat 

cagtacaaga 

tttaatattc 

caggtggtct 

tcattgttgt 

atgttgtatt 

cragttttaa 

ctataagaaa 

aaaagttttg 

aggaatttgg 

gtctcttggg 

tttactctgg 

tttgcttcat 

actatatacc 

gataaacttg 

gagtgaaaat 

gaatcaatac 

ccttaggatt 

cattgctctt 

atatatatta 

tctagagcca 

att actcgaa 



agagagtacc 

aaggtaagaa 

gagtggagtg 

gaaaagtcac 

cccgaacaaa 

gactacagag 

aaggaactta 

tctcaggagc 

aatttacaga 

tctggctgtg 

agtgtcagct 

ctactccctt 

ctagcaaaat 

tttttaaaac 

acaagcaaat 

ttagcaatca 

tatttattgg 

tatttctgtt 

tagaaattaa 

aactgactgt 

aaaagggttt 

ggtctatctt 

atgcttattc 

agagggaggt 

aatagtctca 

tggaccgtt t 

catctctgat 

aaaatatcag 

tggaagggtt 

tttcaaccca 

ttgaaaattg 

tcctagctaa 

ctaaggaaaa 

t t tgcat aca 

gat gactgca 



tgattgagaa 

gagttcctgg 

acgacgagat 

gaagagtaaa 

tacagtccct 

aagcttcttc 

atgacatacg 

tcttctctgc 

agattgtaga 

atgggtcgga 

gatgtatgtc 

caaccctccc 

cagaagattg 

ttacaacact 

gccacagctg 

atttggatat 

caacaaaagg 

ataaracttt 

tatacctata 

gaattttatt 

tgattatatg 

tggagtattt 

ttttagtctg 

gccttgcaaa 

gat ctaaaat 

agaatgctat 

t tggctctcc 

aatctgagtt 

ttgtgcaatg 

agtagccctc 

gagtctaatc 

ccctagatat 

aaagttattt 

atttaatgta 

aaggtaaaag 



960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1600 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3030 



<210> 3 
<211> 3857 
<212> DNA 

<213> Mammalian (Human) STLK4 
<400> 3 

caaaagtgga gtcctagatg antctaccat tgcracgata ctccgagaag tactggaagg 60 

gctggaatat ctgcataaaa ntggacagat ccacagagat gtgaaagctg gaaacattct 120 

tnttggagaa gatggctcag tacagatttc agacutuggg gtcagtgctt ttttagcaac 180 

tggtggtgat attacccgaa ataaagtgag aaagaccttt gttggcaccc cttgttggat 240 

ggcaccrgaa gttatggaac aggtccgtgg ttatgatttc aaagccgata tttggagttt 300 

tiggaattaca gcaattgaat tggctacagg ggcggctcct tatcataaat atccaccaat 360 

gaaggrttta atgctgacac tgcagaacga tccrccttct ttggaaactg grgttcaaga 420 

taaagaaatg ctgaaaaaat atggaaaatc atrtagaaaa atgarttcat tgtgccttca 480 

aaaagatcca gaaaaaagac caacagcagc agaactatta aggcacaaat ttttccagaa 540 

agcaaagaat aaagaatttc ttcaagaaaa aacattgcag agagcaccaa ccatttctga 600 



BNSDOCID <WO 9953036A2_IA> 



wo 99/53036 



PCT/US99/08150 



4 

aagagcaaaa aaggttcgga gagtaccagg ttccagtggg cgtcrtcata agacagagga 660 

cggaggcrgg gagtggagtg argatgaatt cgatgaagaa agtgaggaag ggaaagcagc 720 

aatttcacaa ctcaggtctc cccgagtgaa agaatcaata tcaaattccg agctctttcc 780 

aacaactgat cctgtgggta ctttgctcca agttccagaa cagatctctg ctcatctacc 840 

ucagccagct gggcagactg ctacacagcc 'aactcaagtc tctctcccac ccaccgcaga 900 

gccagcaaaa acagctcagg ctttgtcttc aggatcaggt tcacaagaaa ccaagatccc 960 

aatcagtcca gtactaagat taaggaattc caaaaaagaa ctaaatgata ttcgatttga 102C 

arttactcct gggagagata cagcagaggg tgtcrctcag gaacrcattt ccgctggcct 1080 

ggtcgacgga agggattcag taatagtggc agcraattug cagaaaattg tggaagaacc 1140 

tcagtcaaat cgatctgtca ctttcaaact ggcatctggt gtcgaaggct cagatattcc 1200 

tgacgatggt aaactgatag gatttgccca gctcagcatc agcraaacca caaccctgga 1260 

agaggcggcc taaggagatt ccacacatgc gtatcrctgt tgcttctatt ggcctaaacc 1320 

cactacrgcc aaagaaccca gcaacaaacc tcccggctag gagctttaga agtctttatg 1380 

ttcttcctgc catcattcct ccttttccca cagggaaaga aaagttggat cactagtggc 1440 

cagcatcccc agagttccgt tagtaaactt acttcatatg tcccctgtct tcctccatct 1500 

gagaagtggc ccatgtgctt caaggcccag gagggagatc tgtcagctca ttcttgcctt 1560 

actccaarga cggcccaggt ggaaaagtag cagctgtatc gggcttcctc atcctgcctg 1620 

ttcccccaca ccrgccagga tatggacatc ttgggatatc tctttaccac tgaagtagaa 1680 

tugattgttc agccggagcc cagagaattt aatttaatgt tttttctttg tacctgatgt 1740 

gaattctagc aacctttgtt aggaaaaagc acagcctcag atggaggcag cctaaactgt 1800 

gttcrtgttt tgttcarggt gtttctaagc gttttgctga agctgctctc aggcaccccc 1860 

ttcttcattg ctctctccag aaagggttgc tagcctcaac trcagctggt gcaaaacatc 1920 

tgactgtagc cgaacttcag ccatcagatc cttcaaagtg gaactttgga ttgtttttac 1980 

agacaacatc gagtaatggc ttgtaaatgt gaattttgcc agaggtggtt tttgaacagg 2040 

aaaatcataa ttcatatcat tggagaagra tttattrtca aatatcaaat tgaagaaaaa 2100 

crcaatcctc ccatgaaaat cagttcgcct ggcctccaag tcgrgaggaa atgggtatgc 2160 

aaggctgaga tittctacagc aacaaaggag acacacactg ggccagagag gcctgccttc 2220 

tgcctgctct cctgcactga ccctttggag ggggtctctg tgtgctgaag ctaactcaag 2280 

atggaaagtg aaaccacatg tgccgtgacc tttaggtttt atgagtagac agtgttcatt 2340 

tgattttcra cagaaataat ataaattatt ctttaggttt aaaaaagagc actcataatg 2400 

caatatgtga ataarcagtg aggttgattt ttcttttttc ccaccgtttc atagtctttg 2460 

tctaactgct agcaacccta ccgagtttta tatatgagtg ggatactcaa tctggcctta 2520 

aaaagataca caaagatggg crgrgggtcc ctggaaaggg ggagagttgc cctttacaga 2580 

atcacrcgag ccctttccag cactgttggt ctgatgaaca aggttgtttt accttatttt 2640 

ctcrtggaac atatctgaaa accttcccca caaataactt gtcacacctt ttgtttcatt 2700 

ctgagtcttt agttttagtc atgggctttc ttcacctgct ctaggtgcaa aggcatgttg 2760 

ggaaagagat ggatgttggg gaggaagaga ggagatggat ttcagttggg agttaggagg 2820 

agagtaggtg agatgatcag acaccggagt tcaacgtccc agcagtcttg gtaaaaggag 2880 

ggagcctgct gagccaggag ggagaaaaga agattgacca gcttgctaga aaaatactta 2940 

gcttttcttt ttcttttttc gcggaggggg gacggagagg aacaaggatg gggaggtagg 3000 

aatgaggtat agaaaagaga tagcatcttc tttggcacaa gactagtggc ttaccgctta 3060 

ccttagagtt ttgttttttt tttttcaaac ccatcaaaat ctacttattt atgaatccaa 3120 

ggggtggcag catcactctg ttctagcatt ctttgtggag atggtctggt gcctagctgg 3180 

gagtgagcag cagcccatcc cctgttcact ttctctagcc catcattacc tgtgaactgc 3240 

agtggggcag tcacggcaaa ragaattggg ctggggttrc tccttctttt cagttcattg 3300 

tttgccctgc taggaattag aagacagaca ccatgtccca ggacagtgtt acttcttctg 3360 

cacgatgcgt ggtagactcc ctctgccggc ttgtgcagtg atactgagaa aatacatgaa 3420 

cagaaactgc ccaggrggaa cagcacgtaa cctagcgagt gactgtacrc ctttctagga 3480 

atgctgattc agagtgcacc tctttgacca ggtcccagga tccccttgtc cccggagtag 3540 

ggactaacza tagcacaaag taacatgtgc caatgctatt tgcgaaatgt rtggtctttc 3600 

taaacgacta aaggatttgt tgggttcttg cttaagtttt gaaccaaatc ctagagccag 3660 

ctgataatat ttaataauct ggaggagaga ataatgatgt accaataagt ggagattcct 3720 

ccttatgacg tatgctaggt tatgqaagat: gcaaaatatt caactttttc ctcctttttt 3780 

tggacrttgt artttactgc atgttttctt catttttaat caacaaagag taaattgtca 3840 

aaaaaaaaaa aaaaaaa 3857 
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<210> 4 
<211> 1584 
<212> DNA 

<213> Maminaiian (Human) STLK5 
<400> 4 

ctcarctgra cacacttcat ggatggcatg aatgagctgg cgattgcrta catcctgcag 60 

ggggtgctga aggccctcga cracatccac cacatgggat atgtacacag gagtgiicaaa 120 

gccagccaca tcctgatctc tgtggatggg aaggtctacc tgtctggttt gcgcagcaac 180 

ctcagcarga taagccatgg gcagcggcag cgagtggtcc acgattttcc caagracagt 240 

gtcaaggctc tgccgtggct cagccccgag gtcctccagc agaatctcca gggttatgat 300 

gccaagtctg acatctacag tgtgggaarc acagcctgtg aacrggccaa cggccatgtc 360 

ccctt::aagg atacgcctgc cacccagatg ctgctagaga aactgaacgg cacagtgccc 420 

cgcctgttgg araccagcac catccccgct gaggagctga ccatgagccc ttcgcgcrca 480 

gtggccaact ccggcccgag tgacagcctg accaccagca ccccccggcc ctccaacggt 540 

gactcgccct cccaccccta ccaccgaacc ttctcccccc acttccacca ctttgtggag 600 

cagtgcctrc agcgcaaccc ggatgccagg cccagtgcca gcaccctcct gaaccactct 660 

tccttcaagc agatcaagcg acgtgcctca gaggctttgc ccgaattgct tcgtcctgtc 720 

acccccatca ccaattttga gggcagccag tctcaggacc acagtggaat ctttggcctg 780 

gtaacaaacc tggaagagct ggaggtggac gattgggagt tctgagcctc tgcaaactgt , 840 

gcgcattctc cagccaggga tgcagaggcc acccagaggc ccttcctgag ggccggccac 900 

artcccgccc ccctgggcag attgggtaga aaggacattc ttccaggaaa gttgactgct 960 

gactgattgg gaaagaaaat cctggagaga tacttcactg ctccaaggct tttgagacac 1020 

aagggaatct caacaaccag ggarcaggag ggtccaaagc cgacattccc agtcctgtga 1080 

gctcaggtga cctcctccgc agaagagaga tgctgctctg gccctgggag ctgaatucca 1140 

agcccagggr ttggctcctt aaacccgagg accgccacct cttcccagtg cttgcgacca 1200 

gcctcattct azttaacttt gctcrcagat gcctcagatg crataggtca gtgaaagggc 1260 

aagtagtaag ctgcctgccr cccttccctc agacctcncc ctcataattc cagagaaggg 1320 

catttctgtc tttttaagca cagactaagg crggaacagt ccatccttat ccctcrtctg 1380 

gctrgggccc tgacacctaa grctttccca cggtrtatgt gtgtgcctca ttcctttccc 1440 

accaagaarc catcttagcg cctcctgcca gctgccctgg tgctttctcc aagggccatc 1500 

agtgtcttgc ctagcttgag ggcttaagtc cttatgcrgt gtcagrttcg ttgtcagaac 1560 

aaatraaaat tttcagagac gctg 1584 



<210> 5 

<211> 416 

<212> PRT 

<213> Mammalian (Human) STLK2 

<400> 5 

Met Ala His Ser Pro Val Ala Vai Gin Vai Pro Giy Met Gin Asn Asn 
^5 10 15 

lie Ala Asp Pro Glu Glu Leu Phe ?hr Lys Leu Giu Arg lie Gly Lys 
20 25 30 

Giy Ser Phe Gly Giu Val Phe Lys Giy lie Asp Asn Arg Thr Gin Gin 
3 5 4 0 4 5 
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Val Val Ala lie Lys lie lie Asp Leu Glu Glu Ala Glu Asp Glu lie 
50 55 60 

Glu Asp lie Gin Gin Glu lie Thr Val Leu Ser Gin Cys Asp Ser Ser 
65 70 75 80 

Tyr Val Thr Lys Tyr Tyr Gly Ser Tyr Leu Lys Gly Ser Lys Leu'Tro 
85 90 95 

lie lie Met Glu Tyr Leu Gly Gly Gly Ser Ala Leu Asp Leu Leu Arg 
100 105 110 

Ala Gly Pro Phe Asp Glu Phe Gin lie Ala Thr Met Leu Lys Glu lie 
115 120 125 

Leu Lys Gly Leu Asp Tyr Leu His Ser Glu Lys Lys lie His Arg Asp 
130 135 140 

lie Lys Ala Ala Asn Val Leu Leu Ser Glu .Gin Gly Asp Val Lys Leu 
1^5 150 155 160 

Ala Asp Phe Gly Val Ala Gly Gin Leu Thr Asp Thr Gin lie Lys Arg 
165 170 ■ 175 

Asn Thr Phe Val Gly Thr Pro Phe Trp Met Ala Pro Glu Val lie Gin 
180 185 190 

Gin Ser Ala Tyr Asp Ser Lys Ala Asp lie Trp Ser Leu Gly lie Thr 
195 200 205 

Ala lie Glu Leu Ala Lys Gly Glu Pro Pro Asn Ser Asp Met His Pro 
210 215 220 

Met Arg Val Leu Phe Leu lie Pro Lys Asn Asn Pro Pro Thr Leu Val 
225 230 235 240 

Gly Asp Phe Thr Lys Ser Phe Lys Glu Phe lie Asp Ala Cys Leu Asn 
245 250 255 

Lys Asp Pro Ser Phe Arg Pro Thr Ala Lys Glu Leu Leu Lys His Lys 
260 265 270 

Phe lie Val Lys Asn Ser Lys Lys Thr Ser Tyr Leu Thr Glu Leu He 
275 280 285 

Asp Arg Phe Lys Arg Trp Lys Ala Glu Gly His Ser Asp Asp Glu Ser 
290 295 300 

Asp Ser Glu Gly Ser Asp Ser Glu Ser Thr Ser Arg Glu Asn Asn Thr 

310 315 320 

■His Pro Glu Trp Ser Phe Thr Thr Val Arg Lys Lys Pro Asp Pro Lys 
325 330 335 

Lys Val Gin A.sn Gly Ala Glu Gin Asp Leu Val Gin Thr Leu Ser Cys 
340 345 ■ 350 
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Leu Ser Met lie. lie Thr Pro Ala 
355 360 

Glu Asn Asn Ala Ser Arg Asn Gin 
370 375 

lie Ala Val Ala Glu Ala Ala Cys 
385 390 

Lys Lys Leu He Glu Lys Phe Gin 
405 



Phe Ala Glu Leu Lys Gin Gin Asp 
365 

Ala He Glu Glu Leu Glu Lys Ser 
380 

Pro Gly He Thr Asp Lys Met Val 
395 400 

Lys Cys Ser Ala Asp Glu Ser Pro 
410 415 



<210> 6 

<211> 5i6 

<212> .P.RT 

<213> Mammalian (Human) STLK3 

<400> 6 

Thr Ala Ala Pro Ala Pro Ala Ala Pro Ala Ala Pro Ala Pro Ala Pro 
1 5 10 15 



Ala Pro Ala Pro 
20 

Ala Tyr Glu Leu 
35 

Gin Ala Ala Leu 
50 

lie Asn Leu Glu 
65 

He Gin Ala Met 



Thr Ser Phe Val 
100 

Ser Gly Gly Ser 
115 

Glu His Lys Asn 
130 

Lys Glu Val Leu 
145 

His Arg Asp Leu 



Ala Ala Gin Ala 



Gin Glu Val He 
40 

Cys Lys Pro Arg 
55 

Lys Cys Gin Thr 
70 

Ser Gin Cys Ser 
85 

Val Lys Asp Glu 



Met Leu Asp He 
120 

Gly Val Leu Glu 
135 

Glu Gly Leu Asp 
150 

Lys Ala Gly Asn 
165 



Val Gly Trp Pro 
25 

Gly. Ser Gly Ala 



Gin . Glu Arg Val 
60 

Ser Met Asp Glu 
75 

His Pro Asn Val 
90 

Leu Trp Leu Val 
105 

He Lys Tyr He 



Glu Ala He He 
140 

Tyr Leu His Arg 
153 

He Leu Leu Gly 
170 



He Cys Arg Asp 
30 

Thr Ala Val Val 
45 

Ala He Lys Arg 



Leu Leu Lys Glu 
80 

Val Thr Tyr Tyr 
95 

Met Lys Leu Leu ' 
110' 

Val Asn Arg Gly 
125 

Ala Thr He Leu 



Asn Gly Gin He 
160 

Glu Asp Gly Ser 
175 
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Vai Gin lie Ala Asp Phe Gly Val Ser Ala Phe Leu Ala Thr Giy Gly 
180 185 190 

Asp Val Thr Arg Asn Lys Val Arg Lys Thr Phe Val Giy Thr Pre Cys 
195 200 205 

Trp Met Ala Pro Glu Val Met Glu Gin Val Arg Gly Tyr Asp Phe Lys 
210 215 220 

Ala Asp Met Trp Ser Phe Gly lie Thr Ala He Glu Leu Ala Thr Gly 
225 230 235 240 

Ala Ala Pro Tyr His Lys Tyr Pro Pro Met Lys Val Leu Met Leu Thr 
245 250 255 

Leu Gin Asn Asp Pro Pro Thr Leu Glu Thr Gly Val Glu Asp Lys Glu 
260 265 270 

Met Met Lys Lys Tyr Giy Lys Ser Phe Arg Lys Leu Leu Ser Leu Cys 
275 280 285 

Leu Gin Lys Asp Pro Ser Lys Arg Pro Thr Ala Ala Glu Leu Leu Lys 
290 295 300 

Cys Lys Phe Phe Gin Lys Ala Lys Asn Arg Glu Tyr Leu lie Glu Lys 
305 - 310 315 320 

Leu Leu Thr Arg Thr Pro Asp He Ala Gin Arg Ala Lys Lys Val Arg 
325 330 335 

Arg Vai Pro Gly Ser Ser Gly His Leu His Lys Thr Glu Asp Gly Asp 
340 345 350 

Trp Glu Trp Ser Asp Asp Glu Met Asp Glu Lys Ser Glu Glu Gly Lys 
355 360 365 

Ala Ala Phe Ser Gin Glu Lys Ser Arg Arg Val Lys Glu Glu Asn Pro 
370 375 380 

Glu He Ala Val Ser Ala Ser Thr He Pro Glu Gin He Gin Ser Leu 
385 390 395 400 

Ser Val His Asp Ser Gin Gly Pro Pro Asn Ala Asn Glu Asp Tyr Arg 
405 410 415 

Glu Ala Ser Ser Cys Ala Val Asn Leu Val Leu Arg Leu Arg Asn Ser 
420 425 " 430 

Arg Lys Glu Leu Asn Asp He Arg Phe Glu Phe Thr Pro Giy Arg Asp 
435 440 445 

Thr Ala Asp Gly Vai Ser Gin Glu Leu Phe Ser Ala Giy Leu Val Asp 
450 455 460 

Gly His Asp Val Val He Val Ala Ala Asn Leu Gin Lys He Val Asp 
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470 



475 



480 



Asp Pro Lys Aia Leu Lys Thr Leu Thr Phe Lys Leu Ala Ser Gly Cys 
485 490 495 

Asp Gly Ser Glu lie Pro Asp Glu Val Lys Leu lie Gly Phe Ala Gin 
500 505 510 



Leu Ser Val Ser 
515 



<210> 7 . 
<211> 414 
<212> PRT 

<213> Mammalian (Human) STLK4 

<220>.... 

<223> "Xaa" srands for any amino acid. 
<400> 7 

Lys Ser Gly Val Leu Asp Xaa Ser Thr lie Aia Thr lie Leu Arg Glu 
15 10 15 

Val Leu Glu Gly Leu Glu Tyr Leu His Lys Xaa Gly Gin lie His Arg 
20 25 30 

Asp Val Lys Aia Gly Asn lie Leu Xaa Gly Glu Asp Gly Ser Val Gin 
35 40 45 

lie Aia Asp Phe Gly Val Ser Aia Phe Leu Aia Thr Gly Giy Asp lie 
50 55 60 

Thr Arg Asn Lys Val Arg Lys Thr Phe Val Gly Thr Pro Cys Trp Met 
65 70 75 80 

Ala Pro Glu Val Met Giu Gin Val Arg Giy Tyr Asp Phe Lys Ala Asp- 
85 90 95 

lie Trp Ser Phe Gly lie Thr Ala lie Giu Leu Aia Thr Giy Ala Ala 
100 105 110 

Pro Tyr His Lys Tyr Pro Pro Met Lys Val Leu Met Leu Thr Leu Gin 
115 120 125 

Asn Asp Pro Pro Ser Leu Glu Thr Gly Val Gin Asp Lys Giu Met Leu 
130 135 140 

Lys Lys Tyr Gly Lys Ser Phe Arg Lys Met lie Ser Leu Cys Leu Gin 

150 155 160 

Lys Asp Pro Glu Lys Arg Pro Thr Ala Aia Glu Leu Leu Arg His Lys 
165 170 175 
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Phe .Phe Gin Lys Ala Lys Asn Lys Glu Phe Leu Gin Giu Lys Thr Leu 
180 185 190 

Gin Arg Aia Pro Thr lie Ser Glu Arg Ala Lys Lys Val Arg Arg Vai 
195 200 205 

Pro Gly Ser Ser Gly Arg Leu His Lys Thr Glu Asp Gly Giy Trp Glu 
210 215 220 

Trp Ser Asp Asp Giu Phe Asp Glu Giu Ser Giu Glu Gly Lys Ala Aia 
225 230 235 240 

lie Ser Gin Leu Arg Ser Pro Arg Val Lys Giu Ser lie Ser Asn Ser 
245 250 255 

Glu Leu Phe Pro Thr Thr Asp Pro Val Giy Thr Leu Leu Gin Vai Pro 
260 265 270 

Glu Gin lie .Ser Aia His Leu Pro Gin Pro Ala Gly Gin lie Aia Thr 
275 280 285 

Gin Pro Thr Gin Val Ser Leu Pro Pro Thr Ala Glu Pro Ala Lys Thr 
290 295 300 

Ala Gin Aia Leu Ser Ser Gly Ser Giy Ser Gin Glu Thr Lys lie Pro 
^05 310 315 320 

lie Ser Leu Vai Leu Arg Leu Arg Asn Ser Lys Lys Glu Leu Asn Asp 
325 330 335 

He Arg Phe Glu Phe Thr Pro Giy Arg Asp Thr Ala Glu Giy Val Ser 
340 345 350 

Gin Giu Leu He Ser Ala Gly Leu Vai Asp Gly Arg Asp Leu Val He 
355 360 365 

Val Ala Ala Asn Leu Gin Lys He Val Giu Giu Pro Gin Ser Asn Arg 
370 375 380 

Ser Val Thr Phe Lys Leu Aia Ser Gly Val Glu Giy Ser Asp He Pro 
385 390 395 400 

Asp Asp Gly Lys Leu He Giy Phe Ala Gin Leu Ser He Ser 
405 410 



<210> 8 
<211> 274 
<212> PRT 

<213> Mammalian (Human) STLK5 
<400> 8 

Leu He Cys Thr His Phe Met Asp Giy Met Asn Glu Leu Ala He Ala 
^5 10 15 
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Tyr lie Leu Gin Gly Val Leu Lys Ala Leu Asp Tyr lie His His Met 
20 25 ^ 30 

Gly Tyr Val His Arg Ser Val Lys Ala Ser His lie Leu lie Ser Val 
35 40 45 

Asp Gly Lys Val Tyr Leu Ser Gly Leu Arg Ser Asn Leu Ser Met lie 
50 55 60 

Ser His Gly Gin Arg Gin Arg Val Val His Asp Phe Pro Lys Tyr Ser 
65 70 75 80 

Val Lys Val Leu Pro Trp Leu Ser Pro Glu Val Leu Gin Gin Asn Leu 
85 90 95 

Gin Giy Tyr Asp Ala Lys Ser Asp lie Tyr Ser Val Gly lie Thr Ala 
100 105 110 

Cys Giu Leu Ala Asn Gly His Val Pro Phe Lys Asp Met Pro Ala Thr 
115 120 125 

Gin Met Leu Leu Glu Lys Leu Asn Giy Thr Val Pro Cys Leu Leu Asp 
130 135 140 

Thr Ser Thr lie Pro Ala Glu Glu Leu Thr Met Ser Pro Ser Arg Ser 
145 150 155 160 

Val Ala Asn Ser Gly Leu Ser Asp Ser Leu Thr Thr Ser Thr Pro Arg 
165 170 175 

Pro Ser Asn Gly Asp Ser Pro Ser His Pro Tyr His Arg Thr Phe Ser 
180 185 190 

Pro His Phe His His Phe Val Glu Gin Cys Leu Gin Arg Asn Pro Asp 
195 200 205 

Ala Arg Pro Ser Ala Ser Thr Leu Leu Asn His Ser Phe Phe Lys Gin 
210 215 220 

lie Lys Arg Arg Ala Ser Giu Ala Leu Pro Glu Leu Leu Arg Pro Val 
225 230 235 240 

Thr Pro lie Thr Asn Phe Glu Giy Ser Gin Ser Gin Asp His Ser Gly 
245 250 255 

lie Phe Giy Leu Val Thr Asn Leu Glu Glu Leu Giu Val Asp Asp Trp 
260 265 270 

Glu Phe 



<210> 9 
<211> 3798 
<212> DNA 
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<213> Mammalian (Human) ZCi 
<4Q0> 9 

gagaccatgg cgaacgacrc ticccgcgaaa agtctggtgg acatcgacct ctcctccctg 60 

cgggatcccg ctgggatttt tgagctggtg gaagtggttg gaaatggcac ctatggacaa 120 

gtctataagg gtcgacatgt taaaacgggt cagttggcag ccatcaaagt tatggatgtc 130 

actgaggatg aagaggaaga aatcaaactg gagataaata tgctaaagaa atactctcat 240 

cacagaaaca ttgcaacata ttatggtgct ttcarcaaaa agagccctcc aggacatgat 300 

gaccaactct ggcttgttat ggagttctgt ggggctgggt ccattacaga ccrtgtgaag 360 

aacaccaaag ggaacacact caaagaagac tggatcgctt acatctccag agaaatcctg 420 

aggggactgg cacatcttca cattcatcat gtgattcacc gggatatcaa gggccagaat 480 

gtgttgctga ctgagaatgc agaggtgaaa cttgttgact ttggtgtgag tgctcagctg 540 

gacaggactg tggggcggag aaatacgttc ataggcactc cctactggat ggctcctgag 600 

gtcatcgcct gtgatgagaa cccagatgcc acctatgatt acagaagtga tcttcggtct 660 

tgtggcatta cagccattga gatggcagaa ggtgctcccc ctctctgtga catgcatcca 720 

atgagagcac tgtttctcat tcccagaaac cctcctcccc ggctgaagtc aaaaaaatgg 780 

tcgaagaagt tttttagttt tatagaaggg tgcctggtga agaattacat gcagcggccc 840 

tctacagagc agcttttgaa acatcctttt ataagggatc agccaaatga aaggcaagtt 900 

agaatccagc ttaaggatca tatagatcgt accaggaaga agagaggcga gaaagatgaa 960 

actgagtatg agtacagtgg gagtgaggaa gaagaggagg aagtgcctga acaggaagga 1020 

gagccaagtr ccattgtgaa cgtgcctggt gagtctactc ttcgccgaga trtcctgaga 1080 

ctgcagcagg agaacaagga acgttccgag gctcttcgga gacaacagtt actacaggag 1140 

caacagctcc gggagcagga agaatataaa aggcaactgc tggcagagag acagaagcgg 1200 

attgagcagc agaaagaaca gaggcgacgg ctagaagagc aacaaaggag agagcgggaa 1260 

gctagaaggc agcaggaacg tgaacagcga aggagagaac aagaagaaaa gaggcgtcta 1320 

gaggagttgg agagaaggcg caaagaagaa gaggagagga gacgggcaga agaagaaaag 1380 

aggagagttg aaagagaaca ggagtatatc aggcgacagc tagaagagga gcagcggcac 1440 

ttggaagrcc ttcagcagca gctgctccag gagcaggcca tgttactgga gcgccgatgg 1500 

cgggagatgg aggagcaccg gcaggcagag aggctccaga ggcagttgca acaagaacaa 1560 

gcatatctcc tgtctctaca gcatgaccat aggaggccgc acccgcagca ctcgcagcag 1620 

ccgccaccac cgcagcagga aaggagcaag ccaagcrtcc atgctcccga gcccaaagcc 1680 

cactacgagc ctgctgaccg agcgcgagag gtggaagata gatttaggaa aactaaccac 1740 

agctcccctg" aagcccagtc taagcagaca ggcagagtat tggagccacc agtgccttcc 1800 

cgatcagagt ctttttccaa cggcaactcc gagtctgtgc atcccgccct gcagagacca 1860 

gcggagccac aggttcctgt gagaacaaca tctcgctccc ctgttctgtc ccgtcgagat 1920 

tccccactgc agggcagrgg gcagcagaat agccaggcag gacagagaaa ctccaccagt 1980 

attgagccca ggcttctgtg ggagagagtg gagaagctgg tgcccagacc tggcagtggc 2040 

agctcctcag ggrccagcaa ctcaggatcc cagcccgggt ctcaccctgg gtctcagagt 2100 

^ggctccgggg aacgcttcag agtgaganca tcatccaagt ctgaaggctc tccatctcag 2160 

cgcctggaaa atgcagtgaa aaaacctgaa gataaaaagg aagttttcag acccctcaag 2220 

cctgctgatc tgaccgcact ggccaaagag cttcgagcag tggaagatgt acggccacct 2280 

cacaaagtaa cggactactc crcatccagt gaggagtcgg ggacgacgga tgaggaggac 2340 

gacgatgtgg agcaggaagg ggctgacgag tccacctcag gaccagagga caccagagca 2400 

gcgtcatctc tgaatttgag caatggtgaa acggaatctg tgaaaaccat gattgtccat 2460 

gatgatgtag aaagcgagcc ggccatgacc ccatccaagg agggcactcr aatcgtccgc 2520 

cggactcagt ccgctagtag cacactccag aaacacaaat crtcctcctc ctttacaccr 2580 

tttatagacc ccagattact acagatttct ccatctagcg gaacaacagi: gacatctgtg 2640 

gtgggatctt cctgtgatgg gatgagacca gaagccataa ggcaagatcc tacccggaaa 2700 

ggctcagcgg tcaatgtgaa tcctaccaac actaggccac agagtgacac cccggagatt 2760 

cgtaaataca . agaagaggtt taactctgag attctgtgtg ctgccttatg gggagtgaat 2820 

ttgctagtgg gtacagagag tggcctgatg ctgccggaca gaagtggcca agggaaggtc 2880 

tatccTictra ccaaccgaag acgacttcaa caaarggacg tacttgaggg cttgaatgtc 2940 

ttggtgacaa tatctggcaa aaaggataag rracgtgtct acta::ttgtc ctggttaaga 3000 

aataaaatac ttcacaatga rccagaagtt gagaagaagc agggatggac aaccgtaggg 3060 

gattrggaag gatgtgtaca utacaaagtt gtaaaatatg aaagaatcaa a'trctggrg 3120 
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artgcrrtga 
atggccttta 
gaggaaggcc 
gtggattcag 
ccccatgcaa 
gatgaggggg 
ggagagatgc 
aaggccarag 
agggctcaaa 
cggtctggtg 
tggtagaagc 
ttggaat tec 



agagttctgt 
agtcatttgg 
agaggttgaa 
gatcagtcta 
tcatcatcct 
tttatgcaaa 
ctacatcagt 
agatccgatc 
gactaaaatt 
gcagcagtca 
agtgtgatcc 
t tgtaact 



ggaagtctar 
agaartggta 
agtgatctat 
tgacatttat 
ccccaataca 
cacaratgga 
agcatatatt 
tgtggaaact 
cttgtgtgaa 
ggtttatttc 
agggattact 



gcgtgggcac 
cataagccat 
ggatcctgtg 
ctaccaacac 
gatggaatgg 
aggatcacca 
cgatccaatc 
ggtcacttgg 
cgcaatgaca 
atgaccttag 
ggcctccaga 



caaagccata 
tactggtgga 
ctggattcca 
atatccagtg 
agcttctggt 
aggatgtagt 
agacaatggg 
atggtgtgtt 
aggtgttctt 
gcaggacttc 
gtctt caaga 



t cacaaatt t 
t ctcactgtt 
tgctgttgat 
tagcatcaaa 
gtgctatgaa 
t ctacagtgg 
ctggggagag 
catgcacaaa 
tgccrctgtt 
tcttctgagc 
tcctgagaac 



3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3798 



<210> 10 
<211> 4055 
<212> DMA 

<213> .Mammalian (Human) ZC2 
<400> 10 



gctttcgggg 

gttatggatg 

aaatattctc 

ccaggcatgg 

gacccgatca 

agggaaatct 

aaagggcaaa 

agtgctcagc 

atggcaccag 

gacttgcggt 

gacatgcacc 

tctaagaagt 

agccagcgac 

gagcgacagg 

gaaaaagatg 

tcaggagagc 

ctgaggctgc 

cagcagcagc 

atcgaggagc 

ctgcggaagc 

gaggagagga 

agacaagcag 

cagcatcagc 

ggaatgagtc 

aaccggcaaa 

crgcccccaa 

atgctcagac 

gcctt gaccg 

gaggctctga 

tcggaaaatc 

caggaagaag 

gccagaaaga 

arcagagcaa 

aggaccagca 



aggtctatga 

tcacagggga 

atcaccggaa 

atgaccaact 

agaacacaaa 

tacgggggct 

atgtcLtgct 

rtgatcgaac 

aagttartgc 

ct- tgggr at 

ccatgagagc 

ggtcaaaaaa 

cagcaacaga 

tccgcat tea 

agacagagta 

ccagctccat 

agetggccaa 

gggagaarga 

agaaagagea 

agcaggagag 

ggcgtgcgga 

aaagact gca 

ggcaggagca 

ctagrgagaa 

gt tccectgc 

ggtcggagtc 

cagtcgarcc 

cctcccagtc 

acgtgacctc 

ctcctctccc 

acattccacc 

at retcetgg 

gcaaccctga 

gtggcagttc 



gggtcgtcat 

tgaagaggaa 

tattgctaca 

ttggttggtg 

aggtaacacg 

gagtcacctg 

gactgaaaat 

agtgggcagg 

etgtgatgaa 

eaccgccatt 

t etcttcctc 

attccagr ca 

acaattgatg 

aet caaggac 

tgagtacagt 

cctgaatctg 

caaggagcgt 

ggagcacaag 

gaggcggcgg 

ggagcagcgc 

gcatgaacag 

gaggcagcta 

gaggcctgtg 

gccagcatgg 

catgcctcac 

cttcagcat t 

ceagatecea 

agtgcacgag 

ccaccgcgtg 

cactcgcatt 

aaaggr gcct 

gaatggtagt 

rctceggaga 

cr ccaact cc 



gtcaaaacgg 

gaaatcaaac 

tactatggtg 

atggagtt t t 

ttgaaagagg 

caecagcata 

gcagaagtta 

aggaatactt 

aacccagatg 

gaaatggcag 

atcceccgga 

tttattgaga 

aagcat eeat 

catattgata 

ggaagtgagg 

ccaagggagt 

tctgaggccc 

cggcagetgc 

ctggaggagc 

cggcactatg 

gaatataagc 

aagcaagaaa 

gagaagaagc 

gccaaggagg 

aaggttgcca 

agtggagttc 

catctggtag 

cagcccacaa 

gagatgccac 

gaaaagtttg 

caaagaaeaa 

gctctgggae 

actgagceca 

agcaccecca 



gccagcttgc 

aagaaattaa 

et tttatcaa 

gtggtgctgg 

agtggartgc 

aagtgattca 

aactagtgga 

tcattggaac 

ccacatatga 

aaggtgctcc 

atccagcgcc 

gctgcr tggt 

tt atacgaga 

gaacaaagaa 

aagaagagga 

cgacgctgcg 

tacggaggca 

tggccgagcg 

aacaaaggcg 

aggagcagat 

gcaaacaatt 

gagactactt 

cactgtacca 

tagaagaaeg 

aeaggatacc 

agcctgctcg 

ctgtaaaate 

agggcctctc 

gccagaacrc 

accgaagctc 

cttctatatc 

ccagactagg 

tct tggagag 

get cccagcc 



agccatcaag 

catgttgaag 

aaagaaccca 

ctctgtcacc 

atacatctgc 

tcgagatatt 

ctttggagtc 

tccctactgg 

tttcaagagt 

ccctctctgt 

tcggctgaag 

aaagaatcac 

ccaacctaat 

gaagcgagga 

ggagaatgac 

gagggacttt 

gcagctggag 

r cagaagcgc 

agagaaggag 

gcgccgggag 

ggaagaacag 

agtttccctt 

t tacaaagaa 

gtcaaggctc 

t gaccccaac 

aacacccccc 

ccagggaect 

tgggtttcag 

agatcccacc 

ttggttacga 

cccagcatta 

atctcaaecc 

ccccttgcag 

cagcteccaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
198C - 
2040 
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ggaggctccc agcctggarc acaagcagga tccagtgaac gcaccagagt tcgagccaac 2100 

agtaagtcag aaggatcacc tgrgctcccc cargagcctg ccaaggcgaa accagaagaa 2160 

cccagggaca ttacccggcc cagrcgacca gccagctaca aaaaagctar agatgaggat 2220 

ctgacggcat tagccaaaga actaagagaa ctccggattg aagaaacaaa ccgcccaatg 2280 

aagaaggtga ctgatcactc ctccrccagt gaggagtcag aaagtagcga ggaagaggag 2340 

gaagatggag agagcgagac ccatgarggg acagtggctg tcagcgacat acccagacrg 2400 

ataccaacag gagctccagg cagcaacgag cagtacaarg tgggaarggr ggggacgcat 2460 

gggctggaga cctctcatgc ggacagttrc agcggcagta tttcaagaga aggaaccttg 2520 

atgattagag agacgtctgg agagaagaag cgatctggcc acagtgacag caatggcttt 2580 

gctggccaca tcaacctccc tgacctggtg cagcagagcc attctccagc tggaaccccg 2640 

actgagggac rggggcgcgt ctcaacccat tcccaggaga tggactctgg gactgaatac 2700 

ggcatgggga gcagcaccaa agccrccttc accccctttg tggaccccag agtataccag 2760 

acgtctccca ctgatgaaga tgaagaggat gaggaatcat cagccgcagc tctgtttact 2820 

ggcgaacttc ttaggcaaga acaggccaaa ctcaatgaag caagaaagat ttcggtggta 2880 

aatgtaaacc caaccaacat tcggcctcat agcgacacac cagaaatcag aaaatacaag 2940 
aaacgattca acrcagaaat actttgtgca gctctgtggg gtgtaaacct tctggtgggg ' 3000 

actgaaaatg gcctgatgct tttggaccga agtgggcaag gcaaagrcca taatctgatc 3060 

aaccggaggc gatttcagca gatggatgtg ctagagggac tgaacgtcct tgtgacaatt 3120 

tcaggaaaga agaa^aagct acgagtttac tatctttcat ggttaagaaa cagaatacta 3180 

cataatgacc cagaagtaga aaagaaacaa ggctggacca ctgttgggga cttggaaggc 3240 

tgtatacatt ataaagttgt raaatatgaa aggatcaaat ttttggrgat tgccttaaag 3300 

aatgctgtgg aaatatatgc rtgggctcct aaaccgtatc ataaattcat ggcatrtaag 3360 

tctttrgcag atctccagca caagcctctg ccagttgatc tcacggtaga agaaggtcaa 3420 

agattaaagg ttattttrgg ttcacacact ggtttccatg taattgatgt tgattcagga 3480 

aactcttatg atatctacac accatctcat attcagggca atatcactcc tcatgctatt 3540 

gtcatcttgc ctaaaacaga tggaatggaa atgcttgttt gctatgagga tgagggggtg 3600 

tatgraaaca cctatggccg gataactaag gatgtggtgc tccaatgggg agaaatgccc 3660 

acgtcrgtgg cctacattca trccaatcag ataatgggct ggggcgagaa agctattgag 3720 

atccggtcag tggaaacagg acatttggat ggagtattta tgcataagcg agctcaaagg 3780 

ctaaagtttc tatgrgaaag aaatgataag gtattttttg catccgtgcg atctggagga 3840 

agnagccaag tgtttttcat gaccctcaac agaaattcca tgatgaactg gtaacagaag 3900 

agcacttggc acrratcttc atggcgttat ttctaattta aaagaacata actcatgtgg 3960 

acttatgcca gtctagaggc agaatcagaa ggcttggttg aacatatcgc tttccctttt 4020 

t.C'::t-z'^cccz ccgaccazcc cagcacagtc catct 4055 



<210> 11 
<211> 4133 
<212> DNA 

<213> Mammalian (Human) ZC3 
<400> 11 



gcatt tgggg 
gt catggatg 
aagractctc 
ccgggaaacg 
gacctggtaa 
agggagatcc 
aaggggcaga 
agtgctcagc 
atggctccag 
gatatttggt 
gaca- gcacc 
tccaagaagt 



aggtgtatga 
tcacggagga 
accaccgcaa 
atgaccagct 
agaacacaaa 
tcaggggtcr 
at gngctgct 
rggaccgcac 
aggtcat cgc 
ct ctaggaat 
ccat gcgagc 
ggtccaagaa 



gggtcggcat 
cgaggaggaa 
cat cgccacc 
ctggctggtg 
aggcaacgcc 
ggcccatctc 
gacagagaat 
cgtgggcaga 
ctgtgatgag 
cacagccat c 
cct crtcctc 
gt ccat tgac 



gtcaagacgg 
gagatcaaac 
tactacggag 
atggagt tct 
ctgaaggagg 
catgcccaca 
gctgaggt ca 
cggaacactt 
aaccctga tg 
gaganggcag 
attcct cgga 
t ccattgaca 



ggcagctggc 
aggagatcaa 
ccttcatcaa 
gtggtgcugg 
actgtatcgc 
aggtgatcca 
agctagtgga 
tcatrgggac 
ccacctatga 
agggagcccc 
accct ccgcc 
catgtctcat 



tgccatcaag 
catgccgaaa 
gaagagcccc 
ttcagtgact 
ctatatctgc 
tcgagacatc 
t tttggggtg 
tccctactgg 
ttacaggagt 
ccctctgtgt 
caggct caag 
caagacttac 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
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ctgagccgcc cacccacgga gcagctacrg aagtttccct tcatccggga ccagcccacg 780 

gagcggcagg tccgcatcca gcttaaggac cacartgacc gatcccggaa gaagcggggt 840 

gagaaagagg agacagaata tgagtacagc ggcagcgagg aggaagatga cagccatgga 900 

gaggaaggag agccaagctc catcatgaac gtgcctggag agrcgactct acgccgggag 960 

tttcrccggc tccagcagga aaataagagc aactcagagg ctttaaaaca gcagcagcag 1020 

ctgcagcagc agcagcagcg agaccccgag gcacacatca aacacctgct gcaccagcgg 1080 

cagcggcgca tagaggagca gaaggaggag cggcgccgcg tggaggagca acagcggcgg 1140 

gagcgggagc agcggaagct gcaggagaag gagcagcagc ggcggctgga ggacatgcag 1200 

gctctgcggc gggaggagga gcggcggcag gcggagcgcg agcaggaata tatccgccac 1260 

aggctagagg aggagcagcg acagctcgag atccttcagc aacagctgct ccaggaacag 1320 

gccctgctgc tggaatacaa gcggaagcag ctggaggagc agcggcagtc agaacgtctc 1380 

cagaggcagc tgcagcagga gcatgcctac ctcaagtccc tgcagcagca gcaacagcag 1440 

cagcagcttc agaaacaaca gcagcagcag ctcctgcctg gggacaggaa gcccctgtac 1500 

cattatggtc ggggcatgaa tcccgctgac aaaccagcct gggcccgaga ggtagaagag 1560 

agaacaagga tgaacaagca gcagaactct cccttggcca agagcaagcc aggcagcacg 1620 

gggcctgagc cccccatccc ccaggcctcc ccagggcccc caggacccct ttcccagact 1680 

cctcctatgc agaggccggt ggagccccag gagggaccgc acaagagcct ggtggcacac 1740 

cgggtcccac cgaagccata tgcagcacct gtaccccgat cccagtccct gcaggaccag 1800 

cccacccgaa acctggctgc ctrcccagcc tcccatgacc ccgaccctgc carccccgca 1860 

cccactgcca cgcccagtgc ccgaggagct gtcatccgcc agaattcaga ccccacctct 1920 

gaaggacctg gccccagccc gaatccccca gcctgggtcc gcccagataa cgaggcccca 1980 

cccaaggtgc ctcagaggac ctcatctatc gccactgccc ttaacaccag tggggccgga 2040 

gggtcccggc cagcccaggc agtccgtgcc agacctcgca gcaactccgc ctggcaaatc 2100 

tatctgcaaa ggcgggcaga gcggggcacc ccaaagcctc cagggccccc tgctcagccc 2160 

ccrggcccgc ccaacgcctc tagtaacccc gacctcagga ggagcgaccc tggctgggaa 2220 

cgctcggaca gcgtccttcc agcctctcac gggcacctcc cccaggctgg ctcactggag 2280 

cggaaccgcg tgggagtctc ctccaaaccg gacagctccc ctgtgctctc ccctgggaat 2340 

aaagccaagc ccgacgacca ccgctcacgg ccaggccggc ccgcagactt tgtgttgctg 2400 

aaagagcgga ctctggacga ggcccctcgg ccrcccaaga aggccatgga ctactcgtcg 24 60 

tccagcgagg aggtggaaag cagtgaggac gacgaggagg aaggcgaagg cgggccagca 2520 

gaggggagca gagatacccc tgggggccgc gatggggata cagacagcgt cagcaccatg 2580 

gtggtccacg acgtcgagga gatcaccggg acccagcccc catacggggg cggcaccatg 2640 

grggtccagc gcacccctga agaggagcgg aacctgctgc atgctgacag caatgggtac 2700 

acaaacctgc ctgacgtggt ccagcccagc cacrcaccca ccgagaacag caaaggccaa 2760 

agcccaccct cgaaggatgg gagrggrgac taccagtctc gtgggctggt aaaggcccct 2820 

ggcaagagct cgttcacgat gtttgtggat ctagggatct accagcctgg aggcagtggg 2880 

gacagcatcc ccatcacagc cctagtgggt ggagagggca ctcggctcga ccagctgcag 2940 

tacgacgtga ggaagggttc tgtggtcaac gtgaatccca ccaacacccg ggcccacagt 3000 

gagacccctg agatccggaa gtacaagaag cgattcaact ccgagatcct ctgtgcagcc 3060 

ctttgggggg tcaacctgct ggtgggcacg gagaacgggc tgatgttgct ggaccgaagt 3120 

gggcagggca aggtgtatgg actcattggg cggcgacgct tccagcagat ggatgtgctg 3180 

gaggggctca acctgctcat caccatcrca gggaaaagga acaaactgcg ggtgtattac 3240 

ttgtcctggc tccggaacaa gattctgcac aatgacccag aagtggagaa gaagcagggc 3300 

cggaccaccg tgggggacat ggagggctgc gggcactacc gtgttgtgaa atacgagcgg 3360 

attaagttcc tggtcatcgc cctcaagagc tccgtggagg tgtatgcctg ggcccccaaa 3420 

ccctaccaca aattcatggc cttcaagtcc tttgccgacc tcccccaccg ccctctgctg 3480 

gtcgaccrga cagtagagga ggggcagcgg ctcaaggtca tccauggctc cagtgctggc 3540 

ttccatgctg tggatgtcga ctcggggaac agctatgaca tctacatccc tgtgcacatc 3600 

cagagccaga tcacgcccca tgccatcatc ctccrcccca acaccgacgg catggagatg 3660 

ctgctgtgct acgaggacga gggtgtctac gtcaacacgt acgggcgcat cattaaggat 3720 

gtggtgctgc agtgggggga gatgcctact tctgtggcct acatctgctc caaccagata 3780 

atgggctggg gtgagaaagc cattgagatc cgctctgtgg agacgggcca cctcgacggg 3840 

gtcttcatgc acaaacgagc tcagaggctc aagrrcctgt gtgagcggaa tgacaaggrg 3900 

tttttrgcct cagtccgctc tgggggcagc agccaagttt acttcatgac tccgaaccgt 3960 

aaccgcatca tgaactggtg acggggccct gggctggggc -gtcccacac tggacccagc 4020 

tccccccctg cagccaggcr tcccgggccg cccctctttc ccctcccugg qcttttgctt 4080 
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ttactggt-:t gatttcactg gagcctgctg ggaacgtgac ctctgacccc tga 



4133 



<210> 12 
<211> 1459 
<212> DNA 

<213> Mammalian (Human) 2C4 
<400> 12 



caatgttaac 

aaaggagttc 

aacccgatct 

aaggcgaaga 

ctcaggtcar 

gaacaa tgat 

caaagctatt 

aacatatatt 

ctttgargaa 

tgacctggct 

tcacctcatc 

ggaaatcatt 

gctcaccttc 

ccttgaaatg 

aggatggggc 

gctgaagcgc 

ctttacctct 

tgaagagctc 

aaacatcatg 

ttattttcag 

atgagaaatg 

gracaatgt c 

ttggacagt t 

atgaattaga 

agaaaaacat 



ccactctarg 

actr crgaga 

aatctatatc 

ccattccgcc. 

aagaacagac 

ccagaaagta 

gataagttaa 

gcaattgctt 

agcactgcta 

attggntctg 

gatgcagaat 

ataccacaga 

aatgctgaag 

tggaaagaca 

caaaaggcca 

aggtcaatta 

accctgcgca 

caaagcaat t 

cataggcagt 

taaagat taa 

cagctttatg 

t t tt gcacaa 

tggacagtat 

ttacaagcag 

cggcaaact 



tctctcctgc 

uctgctgtgg 

t gatggacag 

agattcaagt 

ttcgggrgta 

aaagaaggca 

caggctg-ga 

tgaaatcatc 

r taaagtatt 

aaaaaagact 

crgaggttat 

atatcatcat 

ccctctctgt 

taccatct tc 

ttgaagtgcg 

agaagctgag 

atcaccacag 

atgatgtcta 

ctgcatcttc 

atccaaaact 

tataaaatta 

actttgtaac 

ctgtattcag 

ccttcaaaag 



atgtaaaaaa 

ttctttgtgg 

aagtggaaag 

cttagagcca 

tcatctgacc 

agaagaaatg 

acacttcagt 

aattcacctt 

tccaacactt 

aaagattttc 

gtctgatgtg 

tttacctgat 

ggaagcaaat 

tatagctttt 

ctctttgcaa 

attcctgtgc 

ccgggtttac 

aaagtttcca 

agatt tcaga 

ttactttraa 

actatagcaa 

ttttgttact 

at tttacaac 

aattggcact 



ccacraatcc 

ggagtcaatt 

gcrgacatta 

ctcaatt tgc 

tggttgagga 

ctgaagacag 

gtcctccaac 

tatgcatggg 

gatcataagc 

ttcagctcag 

accccgccaa 

tgcttgggaa 

gaacaactct 

gaatgtacac 

tccagggttc 

acccggggtg 

ttcatgacac 

gtgatttatt 

gattaaatga 

tgragcacag 

gctctaggta 

gtgaattcaa 

atggagtaaa 

gggataagat 



acatgtatga 

tgctgttggg 

ctaaacttat 

tgattaccat 

acaagatttt 

aggaagcctg 

atgaagaaac 

caccaaagtc 

cagLgacagt 

cagatggata 

agaatcccct 

ttggcatgat 

t caagaagat 

agcgaaccac 

tggaaagtga 

acaagctgtt 

ttggaaaact 

accacattat 

gtatrcagtt 

aaragtttta 

ctccaatggt 

acattactct 

gaaacctgtt 

ttttcagaaa 



60 
120 
180 
240 
300 
360 
420 
430 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1459 



<210> 13 
<211> 1239 
<212> PRT 

<213> Mammalian (Human) ZCl 
<400> 13 

Met Ala Asn Asp Ser Pro Ala Lys Ser Leu Val Asp He Asp Leu Ser 
^5 10 15 

Ser Leu Arg Asp Pro Ala Gly lie Phe Glu Leu Val Glu Val Val Gly 
20 25 30 

Asn Gly Thr Tyr Gly Gin Val Tyr Lys Gly Arc His Val Lys Thr Gly 
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35 40 45 

Gin Leu Ala Ala lie Lys Val Met Asp Val Thr Glu Asp Glu Glu Glu 
50 55 - 60 

Glu lie Lys Leu Glu lie Asn Met Leu Lys Lys Tyr Ser His His Arg 
65 70 75 80 

Asn lie Ala Thr Tyr Tyr Giy Ala Phe lie Lys Lys Ser Pro Pro Giy 
85 90 95 

His Asp Asp Gin Leu Trp Leu Val Met Glu Phe Cys Giy Ala Giy Ser 
100 105 110 

lie Thr Asp Leu Val Lys Asn Thr Lys Giy Asn Thr Leu Lys Glu Asp 
115 120 125 

Trp He Ala Tyr He Ser Arg Glu He Leu Arg Giy Leu Ala His Leu 

. . 130 135 140 

His He His His Val He His Arg Asp He Lys Giy Gin Asn Val Leu 

145 150 155 160 

Leu Thr Glu Asn Ala Glu Val Lys Leu Val Asp Phe Giy Val Ser Ala 
165 170 175 

Gin Leu Asp Arg Thr Val Giy Arg Arg Asn Thr Phe He Giy Thr Pro 
180 185 190 

Tyr Trp Met Ala Pro Glu Val He Ala Cys Asp Glu Asn Pro Asp Ala 
195 ■ 200 205 

Thr Tyr Asp Tyr Arg Ser Asp Leu Trp Ser Cys Giy He Thr Ala He 
210 215 220 

Glu Met Ala Glu Giy Ala Pro Pro Leu Cys Asp Met His Pro Met Arg 
225 230 235 240 

Ala Leu Phe Leu He Pro Arg Asn Pro Pro Pro Arg Leu Lys Ser Lys 
245 250 255 

Lys Trp Ser Lys .Lys Phe Phe Ser Phe He Glu Giy Cys Leu Val Lys 
260 265 270 

Asn Tyr Met Gin Arg Pro Ser Thr Giu Gin Leu Leu Lys His Pro Phe 
275 280 285 

He Arg Asp Gin Pro Asn Glu Arg Gin Val Arg lie Gin Leu Lvs Asp 
290 295 300 

His He Asp Arg Thr Arg Lys Lys Arg Giy Glu Lys Asp Glu Thr Glu 
^05 310 315 320 

Tyr Glu Tyr Ser Giy Ser Glu Glu Glu Giu Glu Glu Val Pro Glu Gin 
325 330 335 
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Giu Gly Giu Pro Ser Ser lie Val Asn Val Pro Giy Glu Ser Thr Leu 
340 345 350 



Arg Arg Asp Phe 
355 

Ala Leu Arg Arg 
370 

Glu Glu Tyr Lys 
385 

Gin Gin Lys Glu 



Arg Glu Ala Arg 
420 

Glu Glu Lys .Arg 
435 

Glu Glu Arg Arg 
450 

Gin Glu Tyr lie 
465 

Val Leu Gin Gin 



Arg Trp Arg Glu 
500 



Leu Arg Leu Gin 
360 

Gin Gin Leu Leu 
375 

Arg Gin Leu Leu 
390 

Gin Arg Arg Arg 
405 

Arg Gin Gin Glu 



Arg Leu Glu Glu 
440 

Arg Ala Glu Glu 
455 

Arg Arg Gin Leu 
470 

Gin Leu Leu Gin 
485 

Met Glu Glu His 



Gin Glu Asn Lys 



Gin Glu Gin Gin 
380 

Ala Glu Arg Gin 
395 

Leu Glu Glu Gin 
410 

Arg Glu Gin Arg 
425 

Leu Glu Arg Arg 



Glu Lys Arg Arg 
4 60 

Glu Glu Glu Gin 
475 

Glu Gin Ala Met 
490 

Arg Gin Ala Glu 
505 



Glu Arg Ser Glu 
365 

Leu Arg Glu Gin 



Lys Arg lie Glu 
400 

Gin Arg Arg Glu 
415 

Arg Arg Glu Gin 
430 

Arg Lys Glu Glu 
445 

Val Giu Arg Glu 



Arg His Leu Glu 
480 

Leu Leu Glu Cys 
495 

Arg Leu Gin Arg 
510 



Gin Leu Gin Gin Glu Gin Ala Tyr 
515 520 

Arg Arg Pro His Pro Gin His Ser 
530 535* 

Glu Arg Ser Lys Pro Ser Phe His 
545 550 

Glu Pro Ala Asp Arg Ala Arg Glu 
565 

Asn His Ser Ser Pro Glu Ala Gin 
580 

Glu Pro Pro Val Pro Ser Arg Ser 

595 600 

Glu Ser Val His Pro Ala Leu Gin 
610 615 

Val Arg Thr Thr Ser Arg Ser Pro 



Leu Leu Ser Leu Gin His Asp His 
525 

Gin Gin Pro Pro Pro Pro- Gin Gin 
540 

Ala Pro Glu Pro Lys Ala His Tyr 
555 560 

Val Giu Asp Arg Phe Arg Lys Thr 
570 575 

Ser Lys Gin Thr Gly Arg Val Leu 
585 590 

Glu Ser Phe Ser Asn Gly Asn Ser 
605 

Arg Pro Ala Giu Pro Gin Val Pro 
620 

Val Leu Ser Arg Arg Asp Ser Pro 
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625 630 635 640 

Leu Gin Gly Ser Gly Gin Gin Asn Ser Gin Ala Gly Gin Arg Asn Ser 
645 650 655 

Thr Ser lie Glu Pro Arg Leu Leu Trp Giu Arg Val Glu Lys Leu Vai 
660 665 670 

Pro Arg Pro Gly Ser Gly Ser Ser Ser Gly Ser Ser Asn Ser Gly Ser 
675 680 685 

Gin Pro Gly Ser His Pro Gly Ser Gin Ser Gly Ser Gly Glu Arg Phe 
690 695 700 

Arg Val Arg Ser Ser Ser Lys Ser Glu Gly Ser Pro Ser Gin Arg Leu 
705 710 715 720 

Giu Asn Ala Val Lys Lys Pro Glu Asp Lys Lys Glu Val Phe Arg Pro 
725 730 735 

Leu Lys Pro Ala Asp Leu Thr Ala Leu Ala Lys Glu Leu Arg Ala Val 
740 745 750 

Glu Asp Vai Arg Pro Pro His Lys Val Thr Asp Tyr Ser Ser Ser Ser 
, 755 760 ' 765 

Glu Glu Ser Gly Thr Thr Asp Glu Glu Asp Asp Aso Val Glu Gin Glu 
■ 770 775 . . 780 



Gly Ala Asp Glu Ser Thr Ser Gly Pro Glu Asp Thr Arg Ala Ala Ser 
^85 790 795 800 

Ser Leu Asn Leu Ser Asn Gly Glu Thr Glu Ser Val Lys Thr Met lie 
805 810 815 

Val His Asp Asp Vai Glu Ser Glu Pro Ala Met Thr Pro Ser Lys Glu 
820 825 830 

Gly Thr Leu lie Val Arg Arg Thr Gin Ser Ala Ser Ser Thr Leu Gin 
835 840 845 

Lys His Lys Ser Ser Ser Ser Phe Thr Pro Phe He Asp Pro Arg Leu 
850 855 860 

Leu Gin lie Ser Pro Ser Ser Giv Thr Thr Val Thr Ser Val Vai Gly 

870 875 880 

Phe Ser Cys Asp Gly Met Arg Pro Glu Ala He Arg Gin Asp Pro Thr 
885 890 ^ 895 

Arg Lys Gly Ser Val Val Asn Vai Asn Pro Thr Asn Thr Arg Pro Gin 
900 905 910 

Ser Asp Thr Pro Glu He Arg Lys Tyr Lvs Lys Arg Phe Asn Ser Glu 
915 920 925 
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lie Leu Cys Ala Ala Leu Trp Gly Val Asn Leu Leu Val Gly Thr Glu 
930 935 

Ser Gly Leu Met Leu Leu Asp Arg Ser Gly Gin Gly Lys Val Tyr Pro 

950 955 960 

Leu He Asn Arg Arg Arg Phe Gin Gin Met Asp Val Leu Glu Gly Leu 
965 970 975 

Asn Val Leu Val Thr He Ser Gly Lys Lys Asp Lys Leu Arg Val Tyr 
980 985 990 

Tyr Leu Ser Trp Leu Arg Asn Lys He Leu His Asn Asp Pro Glu Val 
995 1000 1005 

Glu Lys Lys Gin Gly Trp Thr Thr Val Gly Asp Leu Glu Gly Cys Val 
1010 1015 1020 

His Tyr Lys Val Val Lys Tyr Glu Arg He Lys Phe Leu Val He Ala 
1025 1030 1035 1040 

Leu Lys Ser Ser Val Glu Val Tyr Ala Trp Ala Pro Lys Pro Tyr His 
1045 1050 1055 

Lys Phe Met Ala Phe Lys Ser Phe Gly Glu Leu Val His Lys Pro Leu 
1060 1065 1070 

Leu Val Asp Leu Thr Val Glu Glu Gly Gin Arg Leu Lys Val He Tyr 
1075 1080 1085 

Gly Ser Cys Ala Gly Phe His Ala Val Asp Val Asp Ser Gly Ser Val 
1090 1095 1100 

Tyr Asp He Tyr Leu Pro Thr His He Gin Cys Ser He Lys Pro His 
110^ 1110 1115 1120 

Ala He He He Leu Pro Asn Thr Asp Gly Met Glu Leu Leu Val Cys 
1125 1130 1135 

Tyr Glu Asp Glu Gly Val Tyr Val Asn Thr Tyr Gly Arg He Thr Lys 
1140 1145 1150 

Asp Val Val Leu Gin Trp Gly Glu Met Pro Thr Ser Val Ala Tyr He 
1155 1160 1165 

Arg Ser Asn Gin Thr Met Gly Trp Gly Giu Lys Ala He Glu He Arg 
11*70 1175 1180 

Ser Val Glu Thr Gly His Leu Asp Gly Val Phe Met His Lvs Arg Ala 
119^ 1190 ■ 1195 ^ 1200 

Gin Arg Leu Lys Phe Leu Cys Glu Arg Asn Asp Lys Val Phe Phe Ala 
1205 1210 ' 1215 
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Ser Val Arg Ser Gly Gly Ser Ser Gin Val Tyr Phe Met Thr Leu Gly 
1220 1225 1230 

Arg Thr Ser Leu Leu Ser Trp 
1235 



<210> 14 
<211> 1297 
<212> PRT 

<213> Mammalian (Human) ZC2 
<400> 14 

Ala Phe Gly Glu Val Tyr Glu Gly Arg His Val Lys Thr Gly Gin Leu 
1 5 10 15 

Ala Ala He Lys Val Met Asp Val Thr Gly Asp Glu Glu Glu Glu He 
20 25 ' 30 

Lys Gin Glu He Asn Met Leu Lys Lys Tyr Ser His His Arg Asn He 
35 40 45 

Ala Thr Tyr Tyr Gly Ala Phe He Lys Lys Asn Pro Pro Gly Met Asp 
50 55 50 

Asp Gin Leu Trp, Leu Val Met Giu Phe Cys Gly Ala Gly Ser Val Thr 
^5 70 75 80 

Asp Leu He Lys Asn Thr Lys Gly Asn Thr Leu Lys Glu Glu Trp He 
85 90 95 

Ala Tyr He Cys Arg Glu He Leu Arg Gly Leu Ser His Leu His Gin 
100 105 110 

His Lys Val He His Arg Asp He Lys Gly Gin Asn Val Leu Leu Thr 
115 120 125 

Giu Asn Ala Glu Val Lys Leu Val Asp Phe Gly Val Ser Ala Gin Leu 
130 135 140 

Asp Arg Thr Val Gly Arg Arg Asn Thr Phe He Gly Thr Pro Tyr Trp 

150 155 160 

Met Ala Pro Giu Val He Ala Cys Asp Glu Asn Pro Asp Ala Thr Tyr 
165 170 175 

Asp Phe Lys Ser Asp Leu Trp Ser Leu Gly He Thr Ala He Giu Met 
180 185 190 

Ala Glu Gly Ala Pro Pro Leu Cys Asp Met His Pro Met Arg Ala Leu 
195 200 205 

Phe Leu He Pro Arg Asn Pro Ala Pro Arg Leu Lys Ser Lys Lys Trp 
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210 215 220 

Ser Lys Lys Phe Gin Ser Phe He Glu Ser Cys Leu Vai Lys Asn His 
225 230 235 240 

Ser Gin Arg Pro Ala Thr Glu Gin Leu Met Lys His Pro Phe He Arg 
245 250 255 

Asp Gin Pro Asn Glu Arg Gin Val Arg lie Gin Leu Lys Asp His He 
260 265 270 

Asp Arg Thr Lys Lys Lys Arg Gly Glu Lys Asp Glu Thr Glu Tyr Glu 
275 280 285 

Tyr Ser Gly Ser Glu Glu Glu Glu Glu Glu Asn Asp Ser Gly Glu Pro 
290 295 300 



Ser Ser He Leu Asn Leu Pro Arg Glu Ser Thr Leu Arg Arg Asp Ph^ 
305 310 315 320 

Leu Arg Leu Gin Leu Ala Asn Lys Glu Arg Ser Glu Ala Leu Arg Arg 
325 330 335 

Gin Gin Leu Glu Gin Gin Gin Arg Glu Asn Glu Glu His Lys Arg Gin 
340. 345 350 

Leu Leu Ala Glu Arg Gin Lys Arg He Glu Glu Gin Lys Glu Gin Arg 
355 360 365 

Arg Arg Leu Glu Glu Gin Gin Arg Arg Glu Lys Glu Leu Arg Lys Gin 
370 375 380 

Gin Glu Arg Glu Gin Arg Arg His Tyr Glu Giu Gin Met Arg A-g Glu 

390 395 400 

Giu Giu Arg Arg Arg Ala Glu His Giu Gin Glu Tyr Lys Arg Lys Gin 
405 410 415 

Leu Glu Giu Gin Arg Gin Ala Giu Arg Leu Gin Arg Gin Leu Lys Gin 
^20 425 430 

Glu Arg Asp Tyr Leu Val Ser Leu Gin His Gin Arg Gin Giu Gin Arg 
435 440 445 

Pro Vai Giu Lys Lys Pro Leu Tyr His Tyr Lys Glu Gly Met Ser Pro 
450 455 460 

Ser Glu Lys Pro Ala Trp Ala Lys Glu Vai Glu Glu Arg Ser Arg Leu 
465 470 475 480 

Asn Arg Gin Ser Ser Pro Ala Met Pro His Lys Val Ala Asn Arg He 
485 490 495 

Ser Asp Pro Asn Leu Pro Pro Arg Ser Giu Ser Phe Ser lie Ser Gly 
500 505 510 
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Val Gin Pro Ala Arg Thr Pro Pro Met Leu Arg Pro Vai Asp Pro Gin 
515 520 525 

lie Pro His Leu Val Ala Val Lys Ser Gin Gly Pro Ala Leu Thr Ala 
530 535. 540 

Ser Gin Ser Val His Glu Gin Pro Thr Lys Gly Leu Ser Gly Phe Gin 
545 550 555 560 

Glu Ala Leu Asn Val Thr Ser His Arg Val Glu Met Pro Arg Gin Asn 
565 570 575 



Ser Asp Pro Thr Ser Glu Asn Pro Pro Leu Pro Thr Arg lie Glu Lys 
580 585 " 590 

Phe Asp Arg Ser Ser Trp Leu Arg Gin Glu Glu Asp lie Pro Pro Lys 
595-. 600. 605 

Val Pro Gin Arg Thr Thr Ser lie Ser Pro Ala Leu Ala Arg Lys Asn 
610 615 620 

Ser Pro Gly Asn Gly Ser Ala Leu Gly Pro Arg Leu Gly Ser Gin Pro 
625 630 635 640 

lie Arg Ala Ser Asn Pro Asp Leu Arg Arg Thr Glu Pro lie Leu Glu 
645 650 655 

Ser Pro Leu Gin Arg Thr Ser Ser Gly Ser Ser Ser Ser Ser Ser Thr 
660 665 670 

Pro Ser Ser Gin Pro Ser Ser Gin Gly Gly Ser Gin Pro Gly Ser Gin 
675 680 685 

Ala Gly Ser Ser Glu Arg Thr Arg Val Arg Ala Asn Ser Lys Ser Glu 
690 695 700 

Gly Ser Pr.-- Val Leu Pro His Glu Pro Ala Lys Val Lys Pro Glu Glu 
705 710 715 720. 

Ser Arg Asp lie Thr Arg Pro Ser Arg Pro Ala Ser Tyr Lys Lys Ala 
725 730 735 

lie Asp Glu Asp Leu Thr Ala Leu Ala Lys Glu Leu Arg Glu Leu Arg 
740 745 750 

lie Glu Glu Thr Asn Arg Pro Met Lys Lys Vai Thr Asp Tyr Ser Ser 
755 760 765 

Ser Ser Glu Glu Ser Glu Ser Ser Glu Glu Glu Glu Glu Asp Gly Glu 
770 775 780 

Ser Glu Thr His Asp Gly Thr Val Ala Val Ser Asp lie Pro Arg Leu 
785 790 795 ~ 800 
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805 

Vai Giy Thr His Giy Leu Giu Thr Ser 
820 825 

Ser lie Ser Arg Giu Giy Thr Leu Met 
835 840 



PCTAiS99/0815a 
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Giu Gin Tyr Asn Vai Giy Met 
810 815 

His Ala Asp Ser Phe Ser Giy 
830 

lie Arg Giu Thr Ser Giy Giu 
845 



Lys Lys Arg Ser Giy His Ser Asp Ser Asn Giy Phe Aia Giy His lie 
850 855 860 

Asn Leu Pro Asp Leu Vai Gin Gin Ser His Ser Pro Aia Giy Thr Pro 

870 875 880 

Thr Giu Giy Leu Giy Arg Vai Ser Thr His Ser Gin Giu Met Asd Ser 
885 890 895 

Giy Thr Giu Tyr Giy Met Giy Ser Ser Thr Lys Aia Ser Phe Thr Pro 
900 905 910 

Phe Vai Asp Pro Arg Vai Tyr Gin Thr Ser Pro Thr Asp Giu Asp Giu 
915 920 925 

Giu Asp Giu Giu Ser Ser Aia Ala Aia Leu Phe Thr Giy^ Giu Leu Leu 
930 935 

Arg Gin Giu Gin Aia Lys Leu Asn Giu Ala Arg Lys lie Ser Vai Vai 

950 955 960 

Asn Vai Asn Pro Thr Asn lie Arg Pro His Ser Asp Thr Pro Giu lie 
965 970 975 

Arg Lys Tyr Lys Lys Arg Phe Asn Ser Giu lie Leu Cys Ala Aia Leu 
980 985 990 

Trp Giy Vai Asn Leu Leu Vai Giy Thr Giu Asn Giy Leu Met Leu Leu 
995 1000 1005 

Asp Arg Ser Giy Gin Giy Lys Vai Tyr Asn Leu lie Asn Arg Arg Arg 
1010 1015 1020 

Phe Gin Gin Met Asp Vai Leu Giu Giy Leu Asn Vai Leu Vai Thr lie 
^025 1030 1035 ^Q4Q 

Ser Giy Lys Lys Asn Lys Leu Arg Vai Tyr Tyr Leu Ser Trp Leu Arg 
1045 1050 1055 

Asn Arg lie Leu His Asn Asp Pro Giu Vai Giu Lys Lys Gin Giy Trp 
1060 1065 1070 

lie Thr Vai Giy Asp Leu Giu Giy Cys lie His Tyr Lvs Vai Vai Lys 
1075 1080 1085 



Tyr Giu Arg lie Lys Phe Leu Vai lie Aia Leu Lvs Asn 



Ala Vai Giu 
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1090 1095 1100 

He Tyr Ala Trp Ala Pro Lys Pro Tyr His Lys Phe Met Ala Phe Lys 

1105 1110 1115 1120 



Ser Phe Ala Asp Leu Gin His Lys Pro Leu Leu Val Asp Leu Thr Val 
1125 1130 1135 

Glu Glu Gly Gin Arg Leu Lys Val He Phe Gly Ser His Thr Gly Phe 
1140 1145 1150 

His Val He Asp Val Asp Ser Gly Asn Ser Tyr Asp He Tyr Thr Pro 
1155 1160 1165 



Thr Pro His Ala He Val He Leu Pro 
1180 



Ser His He Gin Gly Asn He 
1170 1175 

Lys Thr Asp Gly Met Glu Met Leu 
1185 1190 

Tyr Val Asn Thr Tyr Gly Arg He 
1205 

Gly Glu Met Pro Thr Ser Val Ala 
1220 



Val Cys Tyr Glu Asp Glu Gly Val 
1195 1200 

Thr Lys Asp Val Val Leu Gin Trp 
1210 1215 

Tyr He His Ser Asn Gin He Met 
1225 1230 



Gly Trp Gly Glu Lys Ala He Glu He Arg Ser Val Glu Thr Gly His 
1235 1240 1245 

Leu Asp Gly Val Phe Met His Lys Arg Ala Gin Arg Leu Lys Phe Leu 
1250 1255 1260 

Cys Glu Arg Asn Asp Lys Val Phe Phe Ala Ser Val Arg Ser Gly Gly 
1265 1270 1275 1280 

Ser Ser Gin Val Phe Phe Met Thr Leu Asn Arg Asn Ser Met Met Asn 
1285 ■ 1290 1295 

Trp 



<210> 15 
<211> 1326 
<212> PRT 

<213> Mammalian (Human) 2C3 
<400> 15 

Ala Phe Gly Glu Val Tyr Glu Gly Arg His Val Lys Thr Gly Gin Leu 
15 10 15 

Ala Ala He Lys Val Met Asp Val Thr Glu Asp Glu Glu Glu Glu He 
20 25 30 
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Lys Gin Glu lie Asn Met Leu Lys Lys Tyr Ser His His Arg Asn lie 
35 40 45 

Ala Thr Tyr Tyr Giy Ala Phe lie Lys Lys Ser Pro Pro Giy Asn Asp 
50 55 60 

Asp Gin Leu Trp Leu Vai Met Glu Phe Cys Giy Ala Giy Ser Val Thr 
65 70 75 80 

Asp Leu Val Lys Asn Thr Lys Giy Asn Ala Leu Lys Glu Asp Cys He 
85 90 95 

Ala Tyr He Cys Arg Glu He Leu Arg Giy Leu Ala His Leu His Ala 
100 105 110 

His Lys Val lie His Arg Asp He Lys Giy Gin Asn Vai Leu Leu Thr 
115 120 125 

Glu Asn Ala -Glu Val Lys Leu Val Asp Phe Giy Val Ser Ala Gin Leu 
130 135 140 

Asp Arg Thr Val Giy Arg Arg Asn Thr Phe He Giy Thr Pro Tyr Trp 

150 155 160 

Met Ala Pro Glu Val He Ala Cys Asp Glu Asn Pro Asp Ala Thr Tyr 
165 170 175 

Asp Tyr Arg Ser Asp He Trp Ser Leu Giy He Thr Ala He Glu Met 
180 185 190 

Ala Glu Giy Ala Pro Pro Leu Cys Asp Met His Pro Met Arg Ala Leu 
195 200 205 

Phe Leu He Pro Arg Asn Pro Pro Pro Arg Leu Lys Ser Lvs Lys Trp 
210 215 220 

Ser Lys Lys Phe He Asp Phe He Asp Thr Cys Leu He Lys Thr Tyr 
225 230 235 240 

Leu Ser Arg Pro Pro Thr Glu Gin Leu Leu Lys Phe Pro Phe He Arg 
245 250 255 

Asp Gin Pro Thr Glu Arg Gin Val Arg He Gin Leu Lys Asp His He 
260 265 270 

Asp Arg Ser Arg Lys Lys Arg Giy Glu Lys Glu Glu Thr Glu Tyr Glu 
275 280 285 

Tyr Ser Giy Ser Glu Glu Glu Asd Asp Ser His Giy Glu Glu Giy Glu 
290 295 300 

Pro Ser Ser He Met Asn Vai Pro Giy Glu Ser Thr Leu Arg Arg Glu 
305 310 315 320 

Phe Leu Arg Leu Gin Gin Glu Asn Lys Ser Asn Ser Glu Ala Leu Lys 
325 330 335 
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Gin Gin Gin Gin Leu Gin Gin Gin Gin Gin Arg Asp Pro Giu Aia His 
340 345 350 

lie Lys His Leu Leu His Gin Arg Gin Arg Arg lie Giu Giu Gin Lys 
355 360 365 

Giu Giu Arg Arg Arg Vai Giu Giu Gin Gin Arg Arg Giu Arg Giu Gin 
370 375 380 

Arg Lys Leu Gin Giu Lys Giu Gin Gin Arg Arg Leu Giu Asp Met Gin 
385 390 395 400 

Aia Leu Arg Arg Giu Giu Giu Arg Arg Gin Aia Giu Arg Giu Gin Giu 
405 4iO 4i5 

Tyr lie Arg His Arg Leu Giu Giu Giu Gin Arg Gin Leu Giu lie Leu 
420 425 430 

Gin Gin Gin Leu Leu Gin Giu Gin Aia Leu Leu Leu Giu Tyr Lys Arg 
435 440 445 

Lys Gin Leu Giu Giu Gin Arg Gin Ser Giu Arg Leu Gin Arg Gin Leu 
4.50 455 460 

Gin Gin Giu His Aia Tyr Leu Lys Ser Leu Gin Gin Gin Gin Gin Gin 
465 470 475 480 

Gin Gin Leu Gin Lys Gin Gin Gin Gin Gin Leu Leu Pro Giy Asp Arg 
485 490 495 

Lys Pro Leu Tyr His Tyr Giy Arg Giy Met Asn Pro Aia Asp Lys Pro 
500 505 5i0 

Aia Trp Aia Arg Giu Vai Giu Giu Arg Thr Arg Met Asn Lys Gin Gin 
515 520 525 

Asn Ser Pro Leu Aia Lys Ser Lys Pro Giy Ser Thr Giy Pro Giu Pro 
530 535 540 

Pro lie Pro Gin Ala Ser Pro Giy Pro Pro Giy Pro Leu Ser Gin Thr 
545 550 555 560 

Pro Pro Met Gin Arg Pro Vai Giu Pro Gin Giu Giy Pro His Lys Ser 
565 570 ' 575 

Leu Vai Aia His Arg Vai Pro Leu Lys Pro Tyr Aia Ala Pro Vai Pro 
580 585 590 



Arg Ser Gin Ser Leu Gin Asp Gin Pro Thr Arg Asn Leu Aia Aia Phe 
595 600 605 



Pro Aia Ser His Asp Pro Asp Pro Aia lie Pro Aia Pro Thr Aia Thr 
610 615 620 
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Pro Ser Aia Arg Gly Ala Val lie Arg Gin Asn Ser Asp Pro Thr Ser 
625 630 635 ' 640 

Glu Gly Pro Gly Pro Ser Pro Asn Pro Pro Ala Trp Val Arg Pro Asp 
645 650 655 

Asn Glu Ala Pro Pro Lys Val Pro Gin Arg Thr Ser Ser lie Ala Thr 
660 665 670 

Ala Leu Asn Thr Ser Gly Ala Gly Gly Ser Arg Pro Ala Gin Ala Val 
675 680 685 

Arg Ala Arg Pro Arg Ser Asn Ser Ala Trp Gin He Tyr Leu Gin Arg 
690 695 700 

Arg Ala Glu Arg Gly Thr Pro Lys Pro Pro Gly Pro Pro Ala Gin Pro 
705 710 715 720 

Pro Gly Pro Pro Asn Ala Ser Ser Asn Pro- Asp Leu Arg Ara Ser Asp 
725 730 ^ 735 

Pro Gly Trp Glu Arg Ser Asp Ser Val Leu Pro Ala Ser His Gly Ris 
740 745 750 

Leu Pro Gin Ala Gly Ser Leu Glu Arg Asn Arg Val Gly Val Ser Ser 
755 760 765 

Lys Pro Asp Ser Ser Pro Val Leu Ser Pro Gly Asn Lys Aia Lys Pro 
770 775 780 

Asp Asp His Arg Ser Arg Pro Gly Arg Pro Ala Asd Phe Val Leu Leu 
785 790 795 800 

Lys Glu Arg Thr Leu Asp Glu Ala Pro Arg Pro Pro Lys Lys Ala Met 
805 810 815 

Asp Tyr Ser Ser Ser Ser Glu Glu Val Glu Ser Ser Glu Asp Asp Glu 
820 825 830 

Glu Glu Gly Glu Gly Gly Pro Ala Glu Gly Ser Arg Asp Thr Pro Gly 
835 840 845 

Gly Arg Asp Gly Asp Thr Asp Ser Val Ser Thr Met Val Val His Asp 
850 855 860 



Val Glu Glu He Thr 
865 

Val Val Gin Arg Thr 
885 

Ser Asn Gly Tyr Thr 
900 

Pro Thr Glu Asn Ser 



Gly Thr Gin Pro Pro 
870 

Pro Glu Glu Glu Arg 
890 

Asn Leu Pro Asp Val 
905 

Lys Gly Gin Ser Pro 



Tyr Gly Gly Gly Thr Met 
875 880 

Asn Leu Leu His Ala Asp 
895 

Val Gin Pro Ser His Ser 
910 

Pro Ser Lys Asp Gly Ser 
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S15 920 925 

Gly Asp Tyr Gin Ser Arg Giy Leu Vai Lys Ala Pro Gly Lys Ser Se^ 
930 935 9A0 

Phe Thr Met Phe Val Asp Leu Gly He Tyr Gin Pro Glv Gly Ser Gly 
945 950 955 ' 960 

Asp Ser lie Pro He Thr Ala Leu Val Gly Gly Glu Gly Thr Arg Leu 
965 970 975 

Asp Gin Leu Gin Tyr Asp Val Arg Lys Gly Ser Val Vai Asn Val Asn 
980 985 990 

Pro Thr Asn Thr Arg Ala His Ser Glu Thr Pro Glu He Arg Lys Tyr 
995 1000 1005 

Lys Lys Arg Phe Asn Ser Glu He Leu Cys Ala Ala Leu Trp Gly Val 
1010 1015 1020 

Asn Leu Leu Val Gly Thr Glu Asn Giy Leu Met Leu Leu Asp Arg Ser 
1025 1030 1035 " 1040 

Gly Gin Giy Lys Val Tyr Gly Leu He Gly Arg Arg Arg Phe Gin Gin 
1045 1050 1055 

Met Asp Val Leu Glu Gly Leu Asn Leu Leu He Thr He Ser Gly Lys 
1060 1065 1070 

Arg Asn Lys Leu Arg Val Tyr Tyr Leu Ser Trp Leu Arg Asn Lys He 
1075 1080 1085 

Leu His Asn Asp Pro Glu Val Glu Lys Lys Gin Gly Trp Thr Thr Val 
109G 1095 HOG 

Gly Asp Met Glu Gly Cys Giy His Tyr Arg Vai Val Lys Tyr Glu Arg 
1105 1110 1115 1120 

He Lys Phe Leu Val He Ala Leu Lys Ser Ser Val Glu Val Tyr Ala 
1125 1130 . 1135 



Trp Ala Pro Lys Pro Tyr His Lys Phe Met Ala Phe Lys Ser Phe Ala 
1140 1145 1150 

Asp Leu Pro His Arg Pro Leu Leu Val Asp Leu Thr Val Glu Glu Gly 
1155 1160 1165 

Gin Arg Leu Lys Val He Tyr Gly Ser Ser Ala Gly Phe His Ala Val 
11*70 1175 1180 

Asp Val Asp Ser Gly Asn Ser Tyr Asp He Tyr He Pro Vai His He 
1185 1190 1195 1200 



Gin Ser Gin He Thr Pro His Ala He He Phe Lei 



eu Pro Asn Thr Asp 
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1205 1210 1215 

Gly Met Giu Met Leu Leu Cys Tyr Glu Asp Glu Gly Val Tvr Val Asn 
1220 1225 123C 

Thr Tyr Gly Arg lie He Lys Asp Val Vai Leu. Gin Trp Gly Giu Met 
1235 1240 1245 

Pro Thr Ser Val Ala Tyr He Cys Ser Asn Gin He Met Gly Trp Gly 
1250 1255 1260 

Glu Lys Aia lie Glu He Arg Ser Vai Glu Thr Gly His Leu Asp Gly 
1265 1270 1275 1280 

Val Phe Met His Lys Arg Aia Gin -Arg Leu Lys Phe Leu Cys Giu Arg 
1285 1290 1295 

Asn Asp Lys Val Phe Phe Aia Ser Vai Arg Ser Gly Gly Ser Ser Gin 
1300 1305 1310 

Vai Tyr Phe Met Thr Leu Asn Arg Asn Arg He Met Asn Trp 
1315 1320 1325 



<210> 16 
<211> 349 
<212> PRT 

<213> Mammalian (Human) ZC4 
<400> 16 

Asn Vai Asn Pro Leu Tyr Vai Ser Pro Aia Cvs Lys Lys Pro Leu He 
1 5 10 " 15 

His Met Tyr Giu Lys Giu Phe Thr Ser Giu He Cys Cys Giy Ser Leu 
20 25 30 

Trp Giy Vai Asn Leu Leu Leu Giy Thr Arg Ser Asn Leu Tyr Leu Met 
35 40 45 

Asp Arg Ser Gly Lys Aia Asp He Thr Lys Leu He Arg Arg Arg Pro 
50 55 60 

Phe Arg Gin He Gin Vai Leu Giu Pro Leu Asn Leu Leu He Thr He 
65 70 75 80 

Ser Gly His Lys Asn Arg Leu Arg Vai Tyr His Leu Thr Trp Leu Arg 
85 90 95 

Asn Lys He Leu Asn Asn Asp Pro Giu Ser Lys Arg Arg Gin Glu Glu 
100 105 110 

Met Leu Lys Thr Giu Giu Aia Cys Lys Aia He Asp Lys Leu Thr Giy 
115 120 125 
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Cys Glu His Phe Ser Val Leu Gin His Glu Glu Thr Thr Tyr lie Ala 
130 135 140 

lie Ala Leu Lys Ser Ser lie His Leu Tyr Ala Trp Ala Pro Lys Ser 
145 150 155 160 

Phe Asp Glu Ser Thr Ala lie Lys Val Phe Pro Thr Leu Asp His Lys 
165 170 175 

Pro Val Thr Val Asp Leu Ala lie Gly Ser Glu Lys Arg Leu Lys lie 
180 185 190 

Phe Phe Ser Ser Ala Asp Gly Tyr His Leu lie Asp Ala Glu Ser Glu 
195 200 205 

Val Met Ser Asp Val Thr Leu Pro Lys Asn Pro Leu Glu lie lie He 
210 215 220 

Pro Gin Asn. Ile lie .He Leu Pro. Asp Cys Leu Gly He Gly Met Met 
225 230 235 240 

Leu Thr Phe Asn Ala Glu Ala Leu Ser Val Glu Ala Asn Glu Gin Leu 
245 250 255 

Phe Lys Lys He Leu Glu Met Trp Lys Asp He Pro Ser Ser He Ala 
260 265 270 

Phe Giu Cys Thr Gin Arg Thr Thr Gly Trp Gly Gin Lys Ala He Glu 
275 280 285 

Val Arg Ser Leu Gin Ser Arg Val Leu Glu Ser Glu Leu Lys Arg Arg 
290 295 300 



Ser He Lys Lys Leu Arg Phe Leu Cys Thr Arg Gly Asp Lys Leu Phe 
305 310 315 320 

Phe Thr Ser Thr Leu Arg Asn His His Ser Arg Val Tyr Phe Met Thr 
325 330 335 

Leu Gly Lys Leu Glu Giu Leu Gin Ser A.sn Tyr Asp Val 
340 345 



<210> 17 
<211> 4023 
<212> DNA 

<213> Mammalian (Human) KHS2 
<400> 17 

ccgccatgaa ccccggcttc gatttgtccc gccggaaccc gcaggaggac ttcgagcrga 60 
ttcagcgcat cggcagcggc acctacggcg acgrctacaa ggcacggaat gttaacactg 120 
gtqaactagc agcaartaaa gtaaraaaar rggaaccagg agaagacttt gcagrtgngc 180 
agcaagaaat tatcargatg aaagactgca aacacccaaa tactgttgct tattttggaa 240 
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gctatctcag gcgagataag ctttggattr gcatggagtt ttgtggaggt ggttcrttac 300 

aggatattta rcacgtaact ggacctctgt cagaactgca aattgcatat gttagcagag 360 

aaacactgca gggattatat tatcctcaca gtaaaggaaa aatgcacaga gacataaagg 420 

gagctaacat tctattaacg gataatggtc atgrgaaatt ggctgatttt ggagratctg 480 

cacagataac agctacaatt gccaaacgga agtctttcat tggcacacca tattggatgg 540 

ccccagaagt tgcagctgtr gagaggaagg ggggttacaa tcaactctgt gatctctggg 600 

cagtgggaar cacrgccara gaacttgcag agcttcagcc tcctatgttt gacttacacc 660 

caatgagagc attiatttcta acgacaaaaa gcaattttca gccrcctaaa ctaaaggata 720 

aaatigaaatg gtcaaatagt tttcatcact ttgtgaaaat ggcacttacc aaaaarccga 780 

aaaaaagacc tactgctgaa aaattattac agcatccttt tgtaacacaa catttgacac 840 

ggtctttggc aatcgagctg ttggataaag taaataatcc agatcattcc acttaccatg 900 

atttcgatga tgatgatcct gagcctcttg ttgctgtacc acatagaatt cactcaacaa 960 

gragaaacgn gagagaagaa aaaacacgct cagagataac ctttggccaa gtgaaatttg 1020 

atccacccrt aagaaaggag acagaaccac atcatgaact tcccgacagt gatggrtttt 1080 

tggacagttc agaagaaata tactacactg caagatctaa tctggatctg caactggaat 1140 

atggacaagg acaccaaggt ggttactttt taggtgcaaa caagagtctt ctcaagtctg 1200 

ttgaagaaga antgcatcag cgaggacacg tcgcacattt agaagatgar gaaggagatg 1260 

argatgaatc raaacactca actctgaaag caaaaattcc acctcctttg ccaccaaagc 1320 

craagcccat cttcaracca caggaaatgc attctactga ggatgaaaat caaggaacaa 1380 

tcaagagatg tcccatgtca gggagcccag caaagccatc ccaagttcca cctagaccac 1440 

cacctcccag attaccccca cacaaacctg ttgccttagg aaatggaatg agctccttcc 1500 

agtcaaatgg tgaacgagat ggctcattat gtcaacaaca gaatgaacat agaggcacaa 1560 

acctttcaag aaaagaaaag aaagatgtac caaagcctat tagtaatggt cttcctccaa 1620 

cacctaaagt gcatatgggt gcatgttttt caaaagtttt taatgggtgt cccttgaaaa 1680 

ttcactgtgc atcatcatgg ataaacccag atacaagaga tcagtacttg atatttggtg 1740 

ccgaagaagg gatttatacc ctcaatctta atgaacttca tgaaacatca atggaacagc 1800 

tattccctcg aaggtgtaca tggttgtatg taatgaacaa ttgcttgcta tcaatatctg I860 

gtaaagcttc tcagctttat tcccataatt taccagggct ttttgattat gcaagacaaa 1920 

tgcaaaagtt acctgttgct attccagcac acaaactccc tgacagaata ctgccaagga 1980 

aattttctgt atcagcaaaa atccctgaaa ccaaatggtg ccagaagtgt tgtgttgtaa 2040 

gaaatcctta cacgggccat aaatacctat gtggagcact tcagactagc attgttctat 2100 

tagaatgggt tgaaccaatg cagaaattta tgttaattaa gcacatagat tttcctatac 2160 

catgtccact tagaacgttt gaaatgctgg tagttcctga acaggagtac cctttagttt 2220 

gtgttggtgt cagtagaggt agagacttca accaagtggt tcgatttgag acggtcaatc 2280 

caaattctac ctcttcatgg tttacagaat cagatacccc acagacaaat gttactcatg 2340 

taacccaact ggagagagat accatccttg tatgcttgga ctgttgtata aaaatagtaa 2400 

atctccaagg aagattaaaa cctagcagga aattgtcatc agaactcacc tttgatttcc 2460 

agattgaatc aatagtgtgc ccacaagaca gtgtgctagc tttctggaaa catggaatgc 2520 

aaggtagaag ttttagatct aatgaggtaa cacaagaaat ttcagatagc acaagaattt 2580 

tcaggctgct tggatctgac agggtcgtgg ttttggaaag taggccaact gataacccca 2640 

cagcaaatag caatttgtac atcctggcgg gtcatgaaaa cagttactga gaattgttgt 2700 

gctttgacag ttaactctag aaagaaagaa cactaccact gcaacattaa tggatgcttg 2760 

aagctgtaca aaagctgcag taacctgtct tcagttactt tgtaatttat tgtggcatga 2820 

garaagatgg ggaaaatttt gttttaagtg gtatggatat atttagcata ttgaaccaca 2880 

caagtgctta attcactgtt atgtaatctt tgtacatata ggcagtattt tttcTigtgaa 2940 

acttcatatt gctgaagaca tacactaaga acttatgtag ataatgtact tttacgagat 3000 

gtacaagtaa gtgrcttatc rgtacagatg taaargttga tgaaaatgca attggggtta 3060 

atattttaag aatrctttag tatattcrtg ggtgtggcta tattacaaaa tgggatgctg 3120 

gcaatgaaac aaracattta acactatngt attrttatta tatgtaartt agtaatatga 3180 

atataaatct tgtaactttt aaaattgtaa tggaggctgt aatcatttta taatcttttt 3240 

aartttaatg caagtacact ggtgtttata ttcgcacaaa gtattgatat gtgatgtatt 3300 

aagtcacaaa agtaagctgt gacattgtct a-aagcattt ggctccacaa atgtatttgg 3360 

atcgttttcr atgrgaagca aaccaattat aattaaccac atgttgtagt aactggtctr 3420 

tttatattta agcagaatcc tgtaagattg cttgtctttg cttaaaaaca acacctttga 3480 

acattitttga atcacagaat agcggtacca tgatagaata ctgcaattgt ggtcagaatt 3540 

acagtatgca caaagaatra attagcatta ttaaagagtc crcaccaaac atttcatatg 3600 
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arcacactga 
tttacttttg 
atacatggcc 
tcacaggaac 
atataccagg 
tttaatgtga 
ttaatgacaa 
ttt 



agaactgtaa 
ttggccttat 
aaaataat t t 
ttagttttgg 
ttttagcaaa 
gtatatacac 
tggttttaaa 



cattccatag 
cttatgatcc 
agrtactacc 
tttaagtcat 
atgcacact t 
taagaacaaa 
atatgttcct 



agrgaagcgg 
ttttcatatt 
tcatacaaac 
ttrtgattgc 
ttggctcttt 
ctaaattgtg 
gattgtacat 



ttcaaatttc 
tcttttgact 
aatataatgg 
ttttttccaa 
.ttggtatatg 
atttatgatc 
attataaaat 



tcttggaatt 
tagagtatta 
t tactacaca 
tggaatatgt 
ttctttatat 
ttcatttatt 
aaacatgttt 



3660 
3720 
3780 
3840 
3900 
3960 
4020 
4023 



<210> 18 
<211> 894 
<212> PRT 

<213> Mammalian (Human) KHS2 
<400> 18 

Met Asn Pro. Gly Phe Asp Leu Ser Arg Arg Asn Pro Gin Glu Aso Phe 
15 10 15' 

Glu Leu lie Gin Arg lie Gly Ser Gly Thr Tyr Gly Asp Val Tyr Lys 
20 25 30 

Ala Arg Asn Val Asn Thr Gly Glu Leu Ala Ala lie Lys Val lie Lys 
35 40 45 

Le.u Glu Pro Gly Glu Asp Phe Ala Val Val Gin Gin Glu lie- lie Met 
50 55 60 

Met Lys Asp Cys Lys His Pro Asn lie Val Ala Tyr Phe Gly Ser Tyr 
65 70 75 80 

Leu Arc Arg Asp Lys Leu Trp lie Cys Met Glu Phe Cys Gly Gly Gly 
85 90 95 

Ser Leu Gin Asp He Tyr His Val Thr Gly Pro Leu Ser Glu Leu Gin 
100 105 110 

He Ala Tyr Val Ser Arg Glu Thr Leu Gin Gly Leu Tyr Tyr Leu His 
115 120 125 

Ser Lys Gly Lys Met His Arg Asp He Lys Gly Ala Asn He Leu Leu 
130 ^ 135 140 

Thr Asp Asn Gly His Val Lys Leu Ala Asp Phe Gly Val Ser Ala Gin 

145 150 155 

He Thr Ala Thr He Ala Lys Arg Lys Ser Phe He Gly Thr Pro Tyr 
165 170 175 

Trp Met Ala Pro Glu Val Ala Ala Val Glu Arg Lys Gly Gly Tyr Asn 
180 185 190 

Gin Leu Cys Asp Leu Trp Ala Val Gly He Thr Ala He Glu Leu Ala 
155 200 205 
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Glu Leu Gin Pro Pro Met Phe Asp Leu His Pro Met Arg Ala Leu Phe 
210 215 220 

Leu Met Thr Lys Ser Asn Phe Gin Pro Pro Lys Leu Lys Asp Lys Met 
225 230 235 240 

Lys Trp Ser Asn Ser Phe His His Phe Val Lys Met Ala Leu Thr Lys 
245 250 255 

Asn Pro Lys Lys Arg Pro Thr Ala Glu Lys Leu Leu Gin His Pro Phe 
260 265 270 

Val Thr Gin His Leu Thr Arg Ser Leu Ala lie Glu Leu Leu Asp Lys 
275 280 285 

Val Asn Asn Pro Asp His Ser Thr Tyr His Asp Phe Asd Asp Asp Asp 
290 295 300 * 

Pro Glu Pro Leu Val Ala Val Pro His Arg He His Ser Thr Ser Arg 

310 315 320 

Asn Val Arg Glu Glu Lys Thr Arg Ser Glu He Thr Phe Gly Gin Val 
325 330 335 

Lys Phe Asp Pro Pro Leu Arg Lys Glu Thr Glu Pro His His Glu Leu 
340 345 350 

Pro Asp Ser Asp Gly Phe Leu Asp Ser Ser Glu Glu He Tyr Tyr Thr 
355 360 365 

Ala Arg Ser Asn Leu Asp Leu Gin Leu Glu Tyr Gly Gin Gly His Gin 
370 375 380 

Gly Gly Tyr Phe Leu Gly Ala Asn Lvs Ser Leu Leu Lvs Ser Val Glu 
385 390 395 ' 400 

Glu Glu Leu His Gin Arg Gly His Val Ala His Leu Glu Asp Asp Glu 
405 410 415 

Gly Asp Asp Asp Glu Ser Lys His Ser Thr Leu Lys Ala Lys He Pro 
420 425 430 

Pro Pro Leu Pro Pro Lys Pro Lys Ser He Phe He Pro Gin Glu Met 
435 440 445 

His Ser Thr Glu Asp Glu Asn Gin Gly Thr He Lys Arg Cys Pro Met 

455 460 

Ser Gly Ser Pro Ala Lys Pro Ser Gin Val Pro Pro Arg Pro Pro Pro 

470 475 480 

Pro Arg Leu Pro Pro His Lys Pro Val Ala Leu Gly Asn Gly Met Ser 
485 490 ^ 495 

Ser Phe Gin Leu Asn Gly Glu Arg Asp Gly Ser Leu Cys Gin Gin Gin 
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500 505 510 

Asn Giu His Arg Gly Thr Asn Leu Ser Arg Lys Giu Lvs Lys Asp Val 
515 520 525 

Pro Lys Pro lie Ser Asn Gly Leu Pro Pro Thr Pro Lys Val His Met 
530 535 540 

Gly Ala Cys Phe Ser Lys Val Phe Asn Gly Cys Pro Leu Lys lie His 

550 555 560 

Cys Ala Ser Ser Trp He Asn Pro Asp Thr Arg Asp Gin Tyr Leu He 
565 570 575 

Phe Gly Ala Giu Giu Gly He Tyr Thr Leu Asn Leu Asn Giu Leu His 
580 585 590 

Giu Thr Ser Met GJ u Gin Leu Phe Pro Arg Arg Cys Thr Trp Leu Tyr 
595 600 605 

Val Met Asn Asn Cys Leu Leu Ser He Ser Gly Lys Ala Ser Gin Leu 
610 615 620 

Tyr Ser His Asn Leu Pro Gly Leu Phe Asp Tyr Ala Arg Gin Met Gin 
625 630 635 640 

Lys Leu Pro Val Ala He Pro Ala His Lys Leu Pro Asp Arg He Leu 
645 650 655 

Pro Arg Lys Phe Ser Val Ser Ala Lys He Pro Giu Thr Lys Trp Cys 
660 665 670 

Gin Lys Cys Cys Val Val Arg Asn Pro Tyr Thr Gly His Lys Tyr Leu 
675 680 685 

Cys Gly Ala Leu Gin Thr Ser He Val Leu Leu Giu Tro Val Giu Pro 
690 695 700 

Met Gin Lys Phe Met Leu He Lys His He Asp Phe Pro He Pro Cys 
■^05 "710 715 720 

Pro Leu Arg Met Phe Giu Met Leu Val Val Pro Giu Gin Giu Tyr Pro 
■725 730 735 

Leu Val Cys Val Gly Val Ser Arg Gly Arg Asp Phe Asn Gin Val Val 
740 745 750 

Arg Phe Giu Thr Val Asn Pro Asn Ser Thr Ser Ser Trp Phe Thr Giu 
755 760 765 

Ser Asp Thr Pro Gin Thr Asn Val Thr His Val Thr Gin Leu Giu Arg 
770 775 780 

Asp Thr lie Leu Val Cys Leu Asp Cys Cys He Lvs He Val Asn Leu 

790 795 " 800 • 
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Gin Giy Arg Leu Lys Ser Ser Arg Lys Leu Ser Ser Glu Leu Thr Phe 
805 810 815 

Asp Phe Gin He Glu Ser He Val Cys Leu Gin Asp Ser Val Leu Ala 
820 825 830 

Phe Trp Lys His Gly Met Gin Gly Arg Ser Phe Arg Ser Asn Glu Val 
835 840 845 

Thr Gin Glu He Ser Asp Ser Thr Arg lie Phe Arg Leu Leu Gly Ser 
850 855 860 

Asp Arg Val Val Val Leu Glu Ser Arg Pro Thr Asp Asn Pro Thr Ala 
865 870 875 880 

Asn Ser Asn Leu Tyr He Leu Ala Gly His Glu Asn Ser Tyr 



<210> 19 
<211> 4196 
<212> DNA 

<213> Mammalian (Human) SOLUl 
<400> 19 

gggagggtcc ttgtggcgcc gggcggcggg gtcctgcgtg gagagtggga cgcaacgccg 60 

agaccgcgag cagaggctgc gcacagccgg atccggcact cagcgaccgg acccaaggat 120 

ccgccgggga acaagccaca ggagagcgac tcaggaacaa gtgtgggaga ggaagcggcg 180 

gcggcggcgc cgggcccggg ggtggtgaca gcaggtctga ggttgcatca taaatacaaa 240 

ggactgaagt tataaaagag aaaagagaag tttgctgcra aaatgaatct gagcaatatg 300 

gaatattttg tgccacacac aaaaaggtac tgaagattta ccccccaaaa aaaattgtca 360 

atgagaaata aagctaactg atatcaaaaa gcagagcctg ctctactggc catcatgcgt 420 

aaaggggtgc tgaaggaccc agagattgac gatctattct acaaagatga tcctgaggaa 480 

ctttttatcg gtttgcatga aattggacat ggaagttttg gagcagttta ttttgctaca 540 

aatgctcaca ccaargaggt ggtggcaatt aagaagatgt cctatagtgg gaagcagacc 600 

catgagaaat ggcaagatat tcttaaggaa gttaaatttt tacgacaatt gaagcatcct 660 

aatactattg agtacaaagg ctgttacttg aaagaacaca ctgcttggtt ggtgatggaa 720 

tattgcttag gctcagcctc tgatttatta gaagttcata aaaaaccact tcaggaagtg 780 

gagatcgctg ccattactca cggagccttg catggactag cctacctaca ttctcatgca 840 

ttgattcata gggatattaa agcaggaaat attcttctaa cagagccagg tcaggtaaaa 900 

ctagctgatt ttggatctgc ttcaatggct tctcctgcca actccttcgt gggcacaccr 960 

tactggatgg ctccagaggt gatcttagct arggatgaag gacagtatga tgggaaagtt 1020 

gatatttggt cacttggcat cacttgtatt gaattggcgg aacggaagcc gcccctttrc 1080 

aacatgaa-g caatgagtgc cttatatcac attgcccaga atgactcccc aacattacaa 1140 

tctaatgaat ggacagactc ctttaggaga tttgctgatr actgcttgca gaaaatacct 1200 

caggaaaggc caacatcagc agaacuatta aggcatgact ttgttcgacg agaccggcca 1260 

ctacgtgtcc tcattgacct catacagagg acaaaagatg cagttcgtga gctagataac 1320 

ctacagtacc gaaaaargaa aaaaatacrt trccaagaga cacggaatgg acccrtgaat 1380 

gagtcacagg aggatgagga agacagtgaa carggaacca gccrgaacag ggaaatggac 1440 

agcctgggca gcaaccartc catuccaagc atgcccgtga -gcacaggcag ccagagcagc 1500 

agtgtgaaca gcatgcagga agtcanggac gagagcagtt ccgaacttgr catgatgcac 1560 

gatgacgaaa gcacaatcaa trccagctcc tccgtcgrgc araagaaaga tcatgtattc 1620 
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acaaggga tg 
cagagccagg 
ttggttacac 
ggttataagc 
aaggctgaga 
aactcgr cca 
gcaaaggtag 
aaagatttga 
ataaaagagg 
tccaaacata 
caacagagac 
cggcacgagg 
gagatggagc 
cagctgcaca 
ctggaaaacc 
atgggacttc 
t trcaggaca 
gaagccactc 
agaaaacctg 
caagcgttac 
cagcaggaaa 
caacatgaac 
gagcagaaga 
aacctattgg 
ggatttggga 
ttttgccatt 
aaccacattc 
acatcgtgtc 
cgtgtcaact 
cagtattgat 
aataaaccaa 
aatctgtgaa 
tatccgcagc 
gtatttattc 
ttttacagaa 
acggggggcg 
aacttgtctt 
gaatacactt 
tattttgtgt 
ctgtttctct 
tccagttcag 
ttatctccct 
tt tcaaacrg 



aggcgggcca 
ccctccacta 
gacagatcca 
ggatgcggcg 
tggacgagca 
tcgagctgga 
ctgcagcaga 
caactttctt 
aaargaatga 
aagagaacrt 
tgtactacga 
tggagcagca 
atgccatgct 
cgttacagaa 
agctggagta 
ggcaacagcc 
ctrgcaaagt 
caaagaatga 
ccattttggc 
ggctagatga 
tggagctgct 
gtgagctcca 
ttgaagagga 
aaaggcaaga 
atttggttac 
tacaaaaaaa 
cccattttaa 
ggactagtgc 
ttcagtgctg 
gtgattcaaa 
caacaaaaaa 
agtaaagcaa 
cttacacctt 
atcaggaata 
actaattgtg 
ggtggggtga 
cagcatatrc 
aaggagctca 
atagaaggaa 
gatcaaattt 
tatttggagt 
tccccttcct 
tgtattttct 



cggcgatccc 
ccggaacaga 
tgagcatgag 
ccagcaccag 
ccgcctcaag 
gaagccggcc 
tgagaagaag 
agaaagccag 
ggaccatagc 
gcagcacaca 
caaaaattgt 
gaacattcgg 
aatccggcac 
gctacgcatg 
caataagagg 
aaaaaactta 
acagaccaaa 
gcacaaaaca 
agagcagtat 
ggctcaagaa 
caacgcctac 
gaagctagag 
gctggctgcc 
gcgagagatt 
attagatttt 
aaaaaaaaaa 
cgggcgttgc 
ctgtrtattc 
gccaaaacct 
atgtttcagt 
agtggatgta 
aaaa raataa 
aactagctgc 
aaaaaaacaa 
atgtacatat 
agagatcttg 
t gattgtatc 
cagaacccca 
tttaaggaga 
ttttacttcc 
tcaacactgt 
aagtcccatt 
tacaataaaa 



aggcctgagc 
gagcgctttg 
caggagaacg 
aagcagctga 
ctacagaagg 
aagaagcaag 
ttccagcaac 
aagaagcagt 
acacccaaga 
caggctgaag 
cgtttcttca 
gaggaactaa 
gacgagtcca 
gatctgatcc 
cgagaaagag 
aaggccatgg 
cagtataaag 
atcttaaaga 
gaacagagta 
gcagaatgcc 
cagagcaaaa 
cagagagtgt 
cttcagaagg 
gaaacttttg 
cctaaggagg 
agaaaacaga 
tctcactctc 
ttactcca tc 
ggccgtctct 
gaaaactttg 
tattgcttta 
taataaatgc 
tgcattattt 
agttttatna 
cagtggtgac 
tgatttttaa 
ataarcattt 
gtagcacaaa 
ggtattactt 
tcctcctgtt 
ctctcaatca 
tcttggtcat 
aatgatgaaa 



cgcggcctac 
ccacgatcaa 
agttgcggga 
tcgccccgga 
aggtggagac 
cggctatcat 
agatcttggc 
ataagatttg 
aagagaagca 
aggaagccca 
agcggaaaat 
ataaaaagag 
cccgagagct 
gtttacagca 
aactgcacag 
aaatgcaaat 
cactcaagaa 
cactgaaaga 
taaargaaat 
aggccttgag 
tcaagatgca 
ctctgcgcag 
aacgcagcga 
acatggagag 
actacagat g 
aaaaaattca 
tctctctctt 

^ggggccccc 

tctattcaca 
gagacagttt 
agcaatcact 
caagggggag 
tattttattt 
aagattgaaa 
atattattac 
gaacctgctg 
tctgctgttg 
ttgggctttg 
attttcatat 
cctccccacc 
gatcatcttg 
aaatattgca 
aaaaaaaaaa 



ccagtcagtt 
atcagcatcr 
acagatgtca 
gaacaagctg 
gcatgccaac 
agaaaaggag 
ccagcagaag 
taaggaaaaa 
agagcggatc 
ccttctcact 
aatgatcaag 
gacccagaag 
agagracagg 
ccagacggaa 
aaagcatgtc 
taaaaaacag 
tcaccagttg 
tgagcagaca 
gatggcctCu 
gctacagctc 
aacagaggca 
agcacacctt 
gagaataaag 
cctcagaatg 
agartaaatt 
gaccctgcaa 
actcttactg 
tt cct ccccc 
gtacacgtca 
taacaaaacc 
cattaccacc 
agagacacaa 
tatttttttg 
attrgataca 
ttttttgggg 
gcaagagttt 
cagaggatgt 
gcaaatcgtg 
tgtattttaa 
tccctccttt 
atctttttct 
ttat tcacac 
aaaaaa 



1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4196 



<210> 20 
<211> 3824 
<212> DNA 

<213> Mammalian (Human) SULU3 
<400> 20 



tattgaactg gcggaacgga. agcctccttt atttaatacg aatgcaatga gtgcctta 
rcacaragcc caaaatgaar cccctacact acagzczaat gaatggtcrg attatttt 
caacrttgta gattcttgcc tccagaaaat ccctcaagat cgaccracat cagaggaact 



ata 
eg 



60 
120 
180 
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tttaaagcac atatttgtrc ttcgggagcg ccctgaaacc gtgttaacag atctcattca 
gaggacaaag gatgcagtaa gagagctgga caatctgcag tatcgaaaga tgaagaaact 
ccttttccag gaggcacata atggaccagc agtagaagca caggaagaag aagaggaaca 
agatcacggt gttggccgga caggaacagt taatagtgtt ggaagtaatc aatccattcc 
cagcatgtcc atcagtgcca gcagccaaag cagtagtgtt aacagtctrc cagatgtctc 
agatgacaag agtgagctag acatgatgga gggagaccac acagtgatgt ctaacagttc 
tgttatccat tnaaaaccag aggaagaaaa ttacagagaa gagggagatc ctagaacaag 
agcatcagar ccacaatctc caccccaagt atctcgtcac aaatcacacc atcgtaatcg 
agaacactrt gctactatac ggacagcatc actggttacg aggcaaatgc aagaacatga 
gcaggactct gagcttagag aacaaatgtc tggctataag cgaatgaggc gacaacatca 
aaagcaactg atgactctgg aaaacaagct aaaggctgag atggatgaac atcgcctcag 
attagacaaa gatcttgaaa ctcagcgtaa caattttgct gcagaaatgg agaaacttat 
caagaaacac caggctgcca cggagaaaga ggctaaagtg atgtccaatg aagagaaaaa 
atttcagcaa catattcagg cccaacagaa gaaagaactg aatagttttc tcgagtccca 
gaaaagagag tataaacttc gaaaagagca gcttaaagag gagctaaatg aaaaccagag 
tacccccaaa aaagaaaaac aggagtggct ttcaaagcag aaggagaata tacagcattt 
ccaagcagaa gaagaagcta accttcttcg acgtcaaaga caatacctag agctggaatg 
ccgtcgcttc aagagaagaa tgttacttgg gcgtcataac ttagagcagg accttgtcag 
ggaggagtta aa.caaaagac agactcagaa ggacttagag catgccatgc tactccgaca 
gcatgaatct atgcaagaac cggagttccg ccacctcaac acaatrcaga agatgcgctg 
tgagttgaTic agactacagc atcaaactga gctcactaac cagctggaat ataataagcg 
aagagaacga gaactaagac gaaagcatgt catggaagtt cgacaacagc ctaagagttt 
gaagtctaaa gaactccaaa taaaaaagca gtttcaggat acctgcaaaa tccaaaccag 
acagtacaaa gcattaagaa atcacctgct ggagactaca ccaaagagtg agcacaaagc 
tgttctgaaa cggctcaagg aggaacagac ccggaaatta gctatcttgg ctgagcagta 
tgatcacagc attaatgaaa tgctctccac acaagccctg cgtttggatg aagcacagga 
agcagagtgc caggttttga agatgcagct gcagcaggaa ctggagctgt tgaatgcgta 
tcagagcaaa atcaagatgc aagctgaggc acaacatgat cgagagcttc gcgagcttga 
acagagggtc tccctccgga gggcactctt agaacaaaag attgaagaag agatgttggc 
tttgcagaat gagcgcacag aacgaatacg aagccrgttg gaacgtcaag ccagagagat 
tgaagctttt gactctgaaa gcatgagacc aggttttagt aatatggtcc tttctaatct 
ctcccctgag gcatrcagcc acagctaccc gggagcttct ggttggtcac acaaccctac 
tgggggtcca ggacctcact ggggrcatcc catgggtggc ccaccacaag cttggggcca 
tccaatgcaa ggtggacccc agccargggg tcacccttca gggccaatgc aaggggtacc 
tcgaggtagc agtatgggag tccgcaatag cccccaggct ctgaggcgga cagcttctgg 
gggacggacg gagcagggca tgagcagaag cacgagtgtc acttcacaaa tatccaatgg 
gtcacacatg tcttatacat aacttaataa ttgagagtgg caattccgct ggagctgtct 
gccaaaagaa actgcctaca gacatcatca cagcagcctc ctcacttggg tactacagtg 
tggaagctga gtgcatatgg tatattttat tcatttttgt aaagcgttct gttttgtgtt 
tactaattgg gatgtcatag tacttggctg ccgggtttgt .ttgtttttgg ggaaattttg 
aaaagtggag ttgatattaa aaataaatgt gtatgtgtgt acatatatat acacacacat 
acacatatat tatgcatgtg gtgaaaagaa ttggctagat aggggatttt tctgaacact 
gcaaaaatag aacgtagcaa aatggcttca gttatcactt ttgggtgtct gtatcctaag 
aagtttctga aaagatctaa agccttttta tcccatatcc caaattctta tgagccactc 
acagcaggca gcatang-rg aaataagtta ttactggtac acacctgcat tgcctcacca 
gtgtatrrat ttgttattaa attgatctga cttctcagcc tcatttggac taaaaaaaga 
aagcagaaat ccatgaacac attgcttctc ggccrtttgg ctaagatcaa gtgtagaaat 
ccatgaacac taaaggactr catngatttt ttcagagagt agaaaacaac ttagtttttc 
ttttttccTig aacgcgtcat aggcttgrga gtgatttttg tccattcaa:: tgtgccttct 
ttgtattatg acaagatggg ggtacttaag gagatcacaa gttgtgtgag gattgcatta 
acaaacctat gagcct-tcaa tggggaagac cagaagggtg agaggggccc tgaaagttca 
tatggtgggt atgtcccgca gcagagtgag gagatgaagc ttacgtgtcc tgacgttttg 
ttgcttacac tg-gatatct catcctagct aagctcnata atgcccaaga ccccaaacag 
tacttttact ttgtttgtac aaaaacaaag acatatagcc aatacaaatc aaatgccgga 
ggtgtttgar gccatatttg caaatrgcca tctattgaaa ttctcgtcac actacataga 
cataattgtt atctcctuct ggcrtatgrg attttctgtt tacaagtaga atagccaatt 



240 
3C0 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
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atttaaatgt ttagttgcca cagtgaacca ggagtcactg agccaatgac tttaccagcr 3600 

gctgactaat cttcatcacc actgtagatt rtgctgcatg tgcaggtcct ctatttttaa 3660 

ttgcngttrt cgttgctgca gtacrttaca aactitctagr tcgttgagac ttagtgacca 3720 

tttggcatca agttaacatc acacaatagg aaacaccact tccacaagtc tcaagcctca 3780 

gtgctaaagt actactgaaa aggaactagg aagtttggcc aatt 3824 



<210> 21 
<211> 2249 
<212> DNA 

<213> Mammalian (Murine) SULU3 
<400> 21 

gcaggatgcc atcaactaac agagcaggca gtccaaagga ccctgaaatt gcagagctct 60 

tcttcaaaga agatccggaa aagctcttca cagatctcag agaaatcggc catgggagct 120 

ttggagcagt atattttgca cgagatgtgc gtactaatga agtggtggcc atcaagaaaa 180 

tgrcttatag -tggaaagcag tctactgaga aatggcagga tattattaag gaagtcaagt 240 

ttctacaaag aataaaacat cccaacagta tagaatacaa aggctgctat rtacgtgaac 300 

acacagcatg gcttgtaatg gaatattgtt taggatctgc ttcagattta ttagaagttc 360 

ataaaaagcc attacaagaa gtggaaatag cagcaattac acatggtgct ctccagggac 420 

tagcttattt acattctcat accatgatcc atagagatat caaagcagga aatatccttc 480 
tgacagaacc aggccaagtg aaacttgcrg actttggatc tgcttccatg gctrcccctg ' 540 

ccaattcttt tgtgggaaca ccatattgga tggccccaga agtaatttta gccatggatg ' 600 

aaggacagta tgatggcaaa gttgatgtat ggtctcttgg aataacgtgt attgaattag 660 

ccgagaggaa gcctccttta tttaatatga atgcaatgag tgccttatat cacatagccc 720 

aaaatgaatc ccctacacta caatctaata tgaatgattc ttgcctccag aaaatccctc 780 

aagatcgccc tacatcagag gaacttttaa agcacatgtt tgttcttcga gagcgccctg ■840 

aaacagtgtt aatagatctt attcaaagga caaaggatgc agtaagagag ctggacaatc 900 

tgcagtatcg aaagatgaag aaactccttt tccaggaggc acataatggg ccagcggtag 960 

aagcacagga agaagaggag gagcaagatc arggtgttgg ccgaacagga acagtgaata 1020 

gtgttggaag caatcagtct atccctagta tgtctatcag tgccagcagt caaagcagca 1080 
gtgttaatag tcttccagat gcatcagatg acaagagtga gctagacatg atggagggag ' 1140 

accatacagt gatgtctaac agttctgtca tccacttaaa acctgaggag gaaaattacc 1200 

aggaagaagg agatcctaga acaagagcat cagacccaca gtctccccct caggtgtctc 1260 

gtcacaagtc acattatcgt aatagagaac actttgcaac catacgaaca gcatcactgg 1320 

ttacaagaca gatgcaagaa catgagcagg actctgaact tagagaacag atgtctggtt 1380 

ataagcggat gaggcgacag catcaaaagc agctgatgac gctggaaaat aaactgaagg 1440 

cagagacgga cgaacatcgg ctcagattag acaaagatct tgaaactcag cgtaacaatt 1500 

tcgctgcaga aatggagaaa ctrattaaga aacaccaagc tgctatggaa aaagaggcta 1560 

aagtgatggc caatgaggag aaaaaattcc agcaacacat tcaggctcaa cagaaaaaag 1620 

aactgaatag ctttttggag tctcaaaaaa gagaatataa acttcgcaaa gagcagctta 1680 
aggaggagct gaatgaaaac cagagcacac ctaaaaaaga aaagcaggaa tggctttcaa • 1740 

aqcagaagga gaatatacag cattttcagg cagaagaaga agctaatctt cttcgacgtc 1800 

aaaggcagta tctagagcca gaatgtcgtc gcttcaaaag aagaatgtta cttgggcgac 1860 

ataacttgga acaggacctt gtcagggagg agttaaacaa aaggcagact caaaaggact 1920 

tggaacatgc aatgctartg cgacagcatg aatcaatgca agaactggag tttcgccatc 1980 

tcaacactat tcagaagatg cgctgtgagt tgatcagacr gcagcatcaa actgagctca 2040 

ctaaccagct agagtacaat aagagaaggg aacgggaact gaggcgaaaa catgtcatgg 2100 

aagttcgaca acaacctaag agtctgaagt ccaaagaact ccaaataaaa aagcagtttc 2160 

aggatacctg caaaattcaa accaqacagt acaaagcatt aaggaatcac ctactggaga 2220 

ctacaccaaa gaatgagcac aaagcaatc 2249 

<210> 22 
<211> 898 
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<212> PRT 

<213> Mammalian (Human) SULUl 
<400> 22 

Met Arg Lys Giy Val Leu Lys Asp. Pro Glu lie Asp Asd Leu Phe Tyr 
1 5 10 ' 15 

Lys Asp Asp Pro Glu Glu Leu Phe lie Gly Leu His Glu He Gly His 
20 25 30 

Gly Ser Phe Gly Ala Val Tyr Phe Ala Thr Asn Ala His Thr Asn Glu 
35 40 45 

Val Val Ala He Lys Lys Met Ser Tyr Ser Gly Lys Gin Thr His Glu 
50 55 60 

Lys Trp Gin Asp lie Leu Lys Glu Val Lys Phe Leu Arg Gin Leu Lys 

.... 70 . . 75 80 

His Pro Asn Thr He Glu Tyr Lys Gly Cys Tyr Leu Lys Glu His Thr 
85 90 95 

Ala Trp Leu Val Met Glu Tyr Cys Leu Gly Ser Ala Ser Asp Leu Leu 
100 105 110 

Glu Val His Lys Lys Pro Leu Gin Glu Val Glu He Ala Ala He Thr 

120 125 

His Gly Ala Leu His Gly Leu Ala Tyr Leu His Ser His Ala Leu He 
130 135 

His Arg Asp He Lys Ala Gly Asn He Leu Leu Thr Glu Pro Gly Gin 

150 155 160 

Val Lys Leu Ala Asp Phe Gly Ser Ala Ser Met Ala Ser Pro Ala Asn 
165 170 175 

Ser Phe Val Gly Thr Pro Tyr Trp Met Ala Pro Glu Val He Leu Ala 
180 185 190 

Met Asp Glu Gly Gin Tyr Asp Gly Lys Val Aso He Trp Ser Leu Gly 
155 200 205 

He Thr Cys He Glu Leu Ala Glu Arg Lys Pro Pro Leu Phe Asn Met 
210 215 220 

Asn Ala Met Ser Ala Leu Tyr His He Ala Gin Asn Asp Ser Pro ^hr 
225 230 235 ,40 

Leu Gin Ser Asn Glu Trp Thr Asp Ser Phe Arg Arg Phe Val Aso Tyr 
245 250 255 

Cys Leu Gin Lys He Pro Gin Glu Arg Pro Thr Ser Ala Glu Leu Leu 
260 265 270 
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Arg His Asp Phe Vai Arg Arg Asp Arg Pro Leu Arg Val Leu lie Asp 
275 280 285 

Leu lie Gin Arg Thr Lys Asp Ala Val Arg Glu Leu Asp Asn Leu Gin 
290 295 300 

Tyr Arg Lys Met Lys Lys lie Leu Phe Gin Giu Thr Arg Asn Giy Pro 
305 310 315 320 

Leu Asn Glu Ser Gin Glu Asp Glu Glu Asp Ser Giu His Giy Thr Ser 
325 330 335 

Leu Asn Arg Glu Met Asp Ser Leu Giy Ser Asn His Ser lie Pro Ser 
340 345 350 

Met Ser Vai Ser Thr Giy Ser Gin Ser Ser Ser Val Asn Ser Met Gin 
355 360 365 

Giu Vai Met Asp Giu Sor Ser Ser Glu Leu Val Met Met His Asp Asp 
370 375 380 

Giu Ser Thr lie Asn Ser Ser Ser Ser Vai Val His Lys Lys Asp His 
385 390 395 400 

Vai Phe Thr Arg Asp Glu Ala Giy His Giy Asp Pro Arg Pro Glu Pro 
405 410 415 

Arg Pro Thr Gin Ser Val Gin Ser Gin Ala Leu His Tyr Arg Asn Arg 
420 425 430 

Glu Arg Phe Ala Thr lie Lys Ser Ala Ser Leu Val Thr Arg Gin lie 
435 440 445 

His Giu His Glu Gin Glu Asn Giu Leu Arg Glu Gin Met Ser Giy Tyr 
450 455 460 

Lys Arg Met Arg Arg Gin His Gin Lys Gin Leu lie Ala Leu Glu Asn 
465 470 475 480 

Lys Leu Lys Ala Glu Met Asp Giu His Arg Leu Lys Leu Gin Lys Giu 
485 490 495 

Vai Giu Thr His Ala Asn Asn Ser Ser He Giu Leu Glu Lys Leu Ala 
500 505 510 



Lys Lys Gin Val Ala 
515 

Asp Giu Lys Lys Phe 
530 

Leu Thr Thr Phe Leu 
545 

Glu Lys lie Lys Glu 



lie lie Glu Lys Glu 
520 

Gin Gin Gin He Leu 
535 

Glu Ser Gin Lys Lvs 
550 

Giu Met Asn Glu Asp 



Ala Lys Vai Ala Ala Ala 
525 

Ala Gin Gin Lys Lys Asp 
540 

Gin Tyr Lys lie Cys Lys 

555 560 

His Ser Thr Pro Lys Lys 
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S65 570 575 

Glu Lys Gin Glu Arg lie Ser Lys His Lys Glu Asn Leu Gin' His Thr 
580 585 590 

Gin Ala Glu-Glu Glu Ala His Leu Leu Thr Gin Gin Arg Leu Tyr Tyr 
595 600 605 

Asp Lys Asn Cys Arg Phe Phe Lys Arg Lys He Met He Lys Arg His 
610 615 620 

Glu Val Glu Gin Gin Asn He Arg Glu Glu Leu Asn Lys Lys Arg Thr 
625 630 635 640 

Gin Lys Glu Met Glu His Ala Met Leu He Arg His Asp Glu Ser Thr 
645 650 ~ 655 

Arg Glu Leu Glu Tyr Arg Gin Leu His Thr Leu Gin Lys Leu Arg Met 
6.60 665 670 

Asp Leu He Arg Leu Gin His Gin Thr Glu Leu Glu Asn Gin Leu Glu 
675 680 685 

Tyr Asn Lys Arg Arg Glu Arg Glu Leu His Arg Lys His Val Met Gly 
690 695 700 

Leu Arg Gin Gin Pro Lys Asn Leu Lys Ala Met Glu Met Gin He I,ys 
705 710 715 720 

Lys Gin Phe Gin Asp Thr Cys Lys Val Gin Thr Lys Gin Tyr Lys Ala 
725 730 735 

Leu Lys Asn His Gin Leu Glu Val Thr Pro Lys Asn Glu His Lys Thr 
740 745 750 

He Leu Lys Thr Leu Lys Asp Glu Gin Thr Arg Lys Leu Ala He Leu 
755 760 765 

Ala Glu Gin Tyr Glu Gin Ser He Asn Glu Met Met Ala Ser Gin Ala 
770 775 780 



Leu Arg Leu Asp Glu Ala 
785 ' 790 

Gin Leu Gin Gin Glu Met 
805 

Lys Met Gin Thr Glu Ala 
820 

Gin Arg Val Ser Leu Arg 
835 

Glu Leu Ala Ala Leu Gin 
850 



Gin Glu Ala Glu Cys 
795 

Glu Leu Leu Asn Ala 
810 

Gin His Giu Arg Glu 
825 

Arg Ala His Leu Glu 
840 

Lys Glu Arg Ser Glu 
855 



Gin Ala Leu Arg Leu 
800 

Tyr Gin Ser Lys He 
815 

Leu Gin Lys Leu Glu 
830 

Gin Lys He Glu Glu 
845 

Arg He Lys Asn Leu 
860 
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Leu Giu Arg Gin Giu Arg Giu lie Giu Thr Phe Asp Met Giu Ser Leu 
865 870 875 880 

Arg Met Giy Phe Giy Asn Leu Vai Thr Leu Asp Phe Pro Lys Giu Asp 
885 890 895 

Tyr Arg 



<2i0> 23 
<211> 786 
<2i2> PRT 

<2i3> Mammaiian (Human) SULU3 
<400> ,23 

lie Glu Leu Ala Giu Arg Lys Pro Pro Leu Phe Asn Met Asn Ala Met 
15 10 15 

Ser Ala Leu Tyr His lie Ala Gin Asn Glu Ser Pro Thr Leu Gin Ser 
20 25 30 

Asn Glu Trp Ser Asp Tyr Phe Arg Asn Phe Val Asp Ser Cys Leu Gin 
35 .40 45 

Lys lie Pro Gin Asp Arg Pro Thr Ser Glu Glu Leu Leu Lys His lie 
50 55 60 

Phe Vai Leu Arg Glu Arg Pro Giu Thr Vai Leu lie Asp Leu lie Gin 
^5 70 75 80 

Arg Thr Lys Asp Ala Val Arg Glu Leu Asp Asn Leu Gin Tyr Arg Lys 
85 90 95 

Met Lys Lys Leu Leu Phe Gin Glu Ala His Asn Gly Pro Ala Val Giu 
100 105 110 

Ala Gin Glu Glu Glu Glu Glu Gin Asp His Gly Val Gly Arg Thr Gly 
115 120 125 

Thr Val Asn Ser Vai Gly Ser Asn Gin Ser lie Pro Ser Met Ser lie 
■ 130 135 

Ser Ala Ser Ser Gin Ser Ser Ser Val Asn Ser Leu Pro Asp Val Ser 
^^^5 150 155 ' 160 

Asp Asp Lys Ser Glu Leu Asp Met Met Glu Gly Asp His Thr Val Met 
165 170 175 

Ser Asn Ser Ser Val lie His Leu Lys Pro Giu Glu Glu Asn Tyr Arg 
180 185 1,90 
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Glu Glu Gly Asp Pro Arg Thr Arg Ala Ser Asp Pro Gin Ser ' Pro Pro 
195 200 205 

Gin Vai Ser Arg His Lys Ser His Tyr Arg Asn Arg Glu His Phe Ala 
210 215 220 

Thr lie Arg Thr Ala Ser Leu Val Thr Arg Gin Met Gin Glu His Glu 
225 230 235 240 

Gin Asp Ser Glu Leu Arg Glu Gin Met Ser Gly Tyr Lys Arg Met Arg 
245 250 255 

Arg Gin His Gin Lys Gin Leu Met Thr Leu Glu Asn Lys Leu Lys Ala 
260 265 270 

Glu Met Asp Glu His Arg Leu Arg Leu Asp Lys Asp Leu Glu Thr Gin 
275 280 285 

Arg Asn Asn- Phe Ala Ala Glu Met Glu Lys Leu He Lys Lys His Gin 
290. 295 300 

Ala Ala Met Glu Lys Glu Ala Lys Val Met Ser Asn Glu Glu Lys Lys 
305 310 315 320 

Phe Gin Gin His He Gin Ala Gin Gin Lys Lys Glu Leu Asn Ser Phe 
325 330 335 

Leu Glu Ser Gin Lys Arg Glu Tyr Lys Leu Arg Lys Glu Gin Leu Lys 
340 345 350 

Glu Glu Leu Asn Glu Asn Gin Ser Thr Pro Lys Lys Glu Lys Gin Glu 
355 360 365 

Trp Leu Ser Lys Gin Lys Glu Asn He Gin His Phe Gin Ala Glu Glu 
370 375 380 

Glu Ala Asn Leu Leu Arg Arg Gin Arg Gin Tyr Leu Glu Leu Glu Cys 
3B5 390 395 4OO 

Arg Arg Phe Lys Arg Arg Met Leu Leu Gly Arg His Asn Leu Glu Gin 
405 410 415 

Asp Leu Val Arg Glu Glu Leu Asn Lys Arg Gin Thr Gin Lys Asp Leu 
420 425 430 

Glu His Ala Met Leu Leu Arg Gin His Glu Ser Met Gin Glu Leu Glu 
435 440 445 

Phe Arg His Leu Asn Thr He Gin Lys Met Arg Cys Glu Leu He Arg 
450 455 460 

Leu Gin His Gin Thr Giu Leu Tnr Asn Gin Leu Glu Tyr Asn Lys Arg 
^65 470 415 480 

Arg Glu Arg Glu Leu Arg Arg Lys His Val Met Glu Val Arg Gin Gin 
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485 490 495 

Pro Lys Ser Leu Lys Ser Lys Glu Leu Gin lie Lys Lys Gin Phe Gin 
500 505 510 

Asp Thr Cys Lys He Gin Thr Arg Gin Tyr Lys Ala Leu Arg Asn His 
515 520 525 

Leu Leu Glu Thr Thr Pro Lys Ser Glu His Lys Ala Val Leu Lys Arg 
530 535 540 

Leu Lys Glu Glu Gin Thr Arg Lys Leu Ala He Leu Ala Glu Gin Tyr 
545 550 555 560 

Asp His Ser lie Asn Glu Met Leu Ser Thr Gin Ala Leu Arg Leu Asp 
565 570 575 

Glu Ala Gin Glu Ala Glu Cys Gin Val Leu Lys Met Gin Leu Gin Gin 
■580 .585 590 

Glu Leu Glu Leu Leu Asn Ala Tyr Gin Ser Lys He Lys Met Gin Ala 
595 600 605 

Glu Ala Gin His Asp Arg Glu Leu Arg Glu Leu Glu Gin Arg Val Ser 
610 615 620 

Leu Arg Arg Ala Leu Leu Glu Gin Lys He Glu Glu Glu Met Leu Ala 
^25 630 635 640 

Leu Gin Asn Glu Arg Thr Glu Arg He Arg Ser Leu Leu Glu Arg Gin 
645 650 655 

Ala Arg Glu He Glu Ala Phe Asp Ser Glu Ser Met Arg Leu Gly Phe 
660 665 670 

Ser Asn Met Val Leu Ser Asn Leu Ser Pro Glu Ala Phe Ser His Ser 
675 680 685 

Tyr Pro Gly Ala Ser Gly Trp Ser His Asn Pro Thr Gly Gly Pro Gly 
690 695 700 

Pro His Trp Gly His Pro Met Gly Gly Pro Pro Gin Ala Tro Gly His 
"^05 710 715 ' 720 

Pro Met Gin Gly Gly Pro Gin Pro Trp Gly His Pro Ser Gly Pro Met 
725 730 735 

Gin Gly Val Pro Arg Gly Ser Ser Met Gly Val Arg Asn Ser Pro Gin 
740 745 750 

Ala Leu Arg Arg Thr Ala Ser Gly Gly Arg Thr Glu Gin Gly Met Ser 
755 760 765 

Arg Ser Thr Ser Val Thr Ser Gin He Ser Asn Gly Ser His Met Ser 
770 775 780 
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Tyr Thr 
785 



<210> 24 
<211> 748 
<212> PRT 

<213> Mammalian (Murine) SULU3 
<400> 24 

Met Pro Ser Thr Asn Arg Ala Gly Ser Leu Lys Asp Pro Glu lie Ala 
1 5 10 ' 15 

Glu Leu Phe Phe Lys Glu Asp Pro .Glu Lys Leu Phe Thr Asp Leu Arg 
20 25 30 

Glu He Gly His Gly Ser Phe Gly Ala Val Tyr Phe Ala Arg Asp Val 
35 40 45 

Arg Thr Asn Glu Val Val Ala He Lys Lys Met Ser Tyr Ser Gly Lys 
50 55 60 

Gin Ser Thr Glu Lys Trp Gin Asp He He Lys Glu Val Lys Phe Leu 
65 70 75 80 

Gin Arg He Lys His Pro Asn Ser He Glu Tyr Lys Gly Cys Tyr Leu 
85 90 95 

Arg Glu His Thr Ala Trp Leu Val Met Glu Tyr Cys Leu Gly Ser Ala 
100 105 110 

Ser Asp Leu Leu Glu Val His Lys Lys Pro Leu Gin Glu Val Glu He 
115 120 125 

Ala Ala He Thr His Gly Ala Leu Gin Gly Leu Ala Tyr Leu His Ser 
130 135 140 

His Thr. Met He His Arg Asp He Lys Ala Gly Asn He Leu Leu Thr 

150 155 160 

Glu Pro Gly Gin Val Lys Leu Ala Asp Phe Gly Ser Ala Ser Met Ala 
165 170 175 

Ser Pro Ala Asn Ser Phe Val Gly Thr Pro Tyr Trp Met Ala Pro Glu 
180 185 190 

Val He Leu Ala Met Asp Glu Gly Gin Tyr Asp Gly Lys Val Asd Val 
193 200 205 

Trp Ser Leu Gly He Thr Cys He Glu Leu Ala Glu Arg Lvs Pro Pro 
210 215 220 
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Leu Phe Asn Met Asn Ala Met Ser Ala Leu Tyr His He Ala Gin Asn 
225 230 235 240 

Glu Ser Pro Thr Leu Gin Ser Asn Met Asn Asp Ser Cys Leu Gin Lys 
245 250 255 

He Pro Gin Asp Arg Pro Thr Ser Glu Glu Leu Leu Lys His Met Phe 
260 265 270 

Val Leu Arg Glu Arg Pro Glu Thr Val Leu He Asp Leu He Gin Arg 
275 280 285 

Thr Lys Asp Ala Val Arg Glu Leu Asp Asn Leu Gin Tyr Arg Lys Met 
290 295 300 

Lys Lys Leu Leu Phe Gin Glu Ala His Asn Gly Pro Ala Val Glu Ala 

310 315 320 

Gin Giu Glu Glu Glu Glu Gin Asp His Gly Val Gly Arg Thr Gly Thr 
325 330 335 

Val Asn Ser Val Gly Ser Asn Gin Ser He Pro Ser Met Ser He Ser 
340 345 350 

Ala Ser Ser Gin Ser Ser Ser Val Asn Ser Leu Pro Asp Ala Ser Asp 
355 360 365 

Asp Lys Ser Glu Leu Asp Met Met Glu Gly Asp His Thr Val Met Ser 
370 375 380 

Asn Ser Ser .Val He His Leu Lys Pro Glu Glu Glu Asn Tyr Gin Glu 
385 390 395 4OO 

Glu Gly Asp Pro Arg Thr Arg Ala Ser Asp Pro Gin Ser Pro Pro Gin 
405 410 415 

Val Ser Arg His Lys Ser His Tyr Arg Asn Arg Glu His Phe Ala Thr 
420 425 430 

He Arg Thr Ala Ser Leu Val Thr Arg Gin Met Gin Glu His Glu Gin 
435 440 445 

Asp Ser Glu Leu Arg Glu Gin Met Ser Gly Tyr Lys Arg Met Arg Arg 
450 455 460 

Gin His Gin Lys Gin Leu Met Thr Leu Glu Asn Lys Leu Lys Ala Glu 

470 475 480 

Met Asp Glu His Arg Leu Arg Leu Asp Lys Asp Leu Glu Thr Gin Arg 
485 490 495 

Asn Asn Phe Ala Ala Glu Met Glu Lys Leu He Lvs " Lys His Gin Ala 
500 505 ' 510 

Ala Met Glu Lys Glu Ala Lys Val Met Ala Asn Glu Glu Lys Lys Phe 
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515 520 525 

Gin Gin His lie Gin Aia Gin Gin Lys Lys Giu Leu Asn Ser Phe Leu 
530 535 540 

Giu Ser Gin Lys Arc Giu Tyr Lys Leu Arg Lys Giu Gin Leu Lys Giu 

550 555 560 

Giu Leu Asn Giu Asn Gin Ser Thr Pro Lys Lys Giu Lys Gin Giu Trp 
565 570 575 

Leu Ser Lys Gin Lys Giu Asn lie Gin His Phe Gin Ala Giu Giu Giu 
580 585 590 

Ala Asn Leu Leu Arg Arg Gin Arg Gin Tyr Leu Giu Leu Giu Cys Arg 
595 600 605 

Arg Phe Lys. Arg Arg Met Leu Leu Gly Arg His Asn Leu Giu Gin Asp 

615 620 

Leu Val Arg Giu Giu Leu Asn Lys Arg Gin Thr Gin Lys Asp Leu Giu 
^25 630 635 640 

His Aia Met Leu Leu Arg Gin His Giu Ser Met Gin Giu Leu Giu Phe 
645 650 655 

Arg 'His Leu Asn Thr lie Gin Lys Met Arg Cys Giu Leu lie Arg Leu 
660 665 670 

Gin His Gin Thr Giu Leu Thr Asn Gin Leu Giu Tyr Asn Lys Arg Arg 
^■75 680 685 

Giu Arg Giu Leu Arg Arg Lys His Vai Met Giu Val Arg Gin Gin Pro 
690 695 700 

Lys Ser Leu Lys Ser Lys Giu Leu Gin lie Lys Lys Gin Phe Gin Asp 

710 715 720 

Thr Cys Lys lie Gin Thr Arg Gin Tyr Lys Ala Leu Arg Asn His Leu 
725 730 735 

Leu Giu Thr Thr Pro Lys Asn Giu His Lys Aia lie 
740 745 



<210> 25 
<211> 2795 
<212> DNA 

<2i3> Mammalian (Human) GEK2 
<400> 25 

cgaagccaca gcccgagccc gagcccgagc ccgagccggc gccaccgcgc ccccggccat 60 
ggcttttgcc aatctccgcc gcatcctgcg cctgtcracc ttcgagaaga gaaagtcccc 120 
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cgaaTiatgag cacgtccgcc gcgacctgga ccccaacgag gtgtgggaga tcgtgggcga 180 

gctgggcgac ggcgccttcg gcaaggttta caaggccaag aataaggaga cgggtgcttt 240 

ggctgcggcc aaagtcattg aaaccaagag tgaggaggag ctggaggacc acatcgtgga 300 

gattgagatc ctggccacct gcgaccaccc ctacattgtg aagctcctgg gagcctacta 360 

tcacgacggg aagctgrgga tcatgattga gttctgtcca gggggagccg tggacgccat 420 

catgctggag ctggacagag gcctcacgga gccccagata caggtggttt gccgccagat 480 

gctagaagcc ctcaacttcc tgcacagcaa gaggatcatc caccgagatc tgaaagctgg 540 

caacgtgctg atgaccctcg agggagacat caggctggct gactttggtg tgtctgccaa 600 

gaatctgaag actctacaga aacgagattc cttcatcggc acgccttact ggatggcccc 660 

cgaggrggtc atgtgtgaga ccatgaaaga cacgccctac gactacaaag ccgacatctg 720 

gtccctgggc atcacgccga ttgagatggc ccagatcgag ccgccacacc acgagctcaa 780 

ccccatgcgg gtcctgctaa agatcgccaa gtcggaccct cccacgctgc tcacgccctc 840 

caagtggcct gtagagctcc gtgacttcct gaagatagcc ctggataaga acccagaaac 900 

ccgacccagt gccgcgcagc tgctggagca tcccrtcgtc agcagcatca ccagtaacaa 960 

ggctctgcgg gagctggtgg ctgaggccaa ggccgaggtg atggaagaga tcgaagacgg 1020 

ccgggatgag ggggaagagg aggacgccgt ggatgccgcc tccaccctgg agaaccatac 1080 

tcagaactcc tctgaggtga gtccgccaag cctcaatgct gacaagcctc tcgaggagtc 1140 

acctrccacc ccgctggcac ccagccagtc tcaggacagt gtgaatgagc cctgcagcca 1200 

gccctcrggg gacagatccc tccaaaccac cagtccccca gtcgtggccc ctggaaatga 1260 

gaacggcctg gcagtgcctg tgcccctgcg gaagtcccga cccgtgtcaa tggatgccag 1320 

aattcaggca gcccaggaga agcaagttgc tgagcagggt ggggacctca gcccagcagc 1380 

caacagatct caaaaggcca gccagagccg gcccaacagc agcgccctgg agaccttggg 1440 

tggggagaag ctggccaatg gcagcctgga gccacctgcc caggcagctc cagggccttc 1500 

caagagggac tcggactgca gcagcctctg cacctctgag agcatggact atggtaccaa 1560 

tctctccact gacctgtcgc tgaacaaaga gatgggctct: ctgtccatca aggacccgaa 1620 

actgtacaaa aaaaccctca agcggacacg caaatttgtg gtggatggtg tggaggtgag 1680 

catcaccacc tccaagatca tcagcgaaga tgagaagaag gatgaggaga tgagatttct 1740 

caggcgccag gaactccgag agcttcggcr gctccagaaa gaagagcatc ggaaccagac 1800 

ccagctgagt aacaagcatg agctgcagct ggagcaaatg cataaacgtt ttgaacagga 1860 

aatcaacgcc aagaagaagt tctttgacac ggaattagag aacctggagc gtcagcaaaa 1920 

gcagcaagtg gagaagatgg agcaagacca tgccgtgcgc cgccgggagg aggccaggcg 1980 

gatccgcctg gagcaggatc gggactacac caggttccaa gagcagctca aactgatgaa 2040 

gaaagaggtg aagaacgagg tggagaagct cccccgacag cagcggaagg aaagcatgaa 2100 

gcagaagatg gaggagcaca cgcagaaaaa gcagcttctt gaccgggact ttgtagccaa 2160 

gcagaaggag gacctggagc tggccatgaa gaggctcacc accgacaaca ggcgggagat 2220 

ctgtgacaag gagcgcgagt gcctcatgaa gaagcaggag ctccttcgag accgggaagc 2280 

agccctgtgg gagatggaag agcaccagct gcaggagagg caccagctgg tgaagcagca 2340 

gctcaaagac cagtacttcc tccagcggca cgagctgctg cgcaagcatg agaaggagcg 2400 

ggagcagatg cagcgctaca accagcgcat gatagagcag ctgaaggtgc ggcagcaaca 2460 

ggaaaaggcg cggctgccca agatccagag gagrgagggc aagacgcgca tggccatgta 2520 

caagaagagc ctccacatca acggcggggg cagcgcagct gagcagcgtg agaagatcaa 2580 

gcagttctcc cagcaggagg agaagaggca gaagtcggag cggctgcagc aacagcagaa 2640 

acacgagaac cagatgcggg acatgctggc gcagtgcgag agcaacarga gcgagctgca 2700 

gcagctgcag aatgaaaagt gccacctcct ggtagagcac gaaacccaga aactgaaggc 2760 

cctggatgag agccataacc agaacctgaa ggaat 2795 



<210> 26 
<211> 912 
<212> PRT • 

<213> Mammalian (Human) GEK2 
<400> 26 

Met Ala Phe Ala Asn Phe Arc Arg lie Leu Arg Leu 3er Thr Phe Glu 
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Lys Arg Lys 3er Arg Giu Tyr Glu His Val Arg Arg Aso Leu As^ ^ro 
20 25 ' 30 ' 

Asn Giu Val Trp Giu He Val Gly Glu Leu Giy Asp Giy Ala Phe Gly 
35 40 ' 45 

Lys Val Tyr Lys Ala Lys Asn Lys Glu Thr Gly Ala Leu Ala Ala Ala 
50 55 60 

Lys Val He Glu Thr Lys Ser Glu Glu Glu Leu Glu Asp Tyr I^e Val 
65 70 75 * 80 

Giu He Glu lie Leu Ala Thr Cys Asp His Pro Tyr He Val Lys Leu 
85 90 95 

Leu Gly Ala Tyr Tyr His Asp Gly Lys Leu Trp He Met He Giu Phe 
•100 105 110 

Cys Pro Gly Gly Ala Val Asp Ala He Met Leu Glu Leu Asp Arg Gly 

120 125 

Leu Thr Giu Pro Gin He Gin Val Val Cys Arg Gin Met Leu Glu Ala 
1^0 135 140 

Leu Asn Phe Leu His Ser Lys Arg He He His Arg Asp Leu Lys Ala 

150 155 160 

Gly Asn Val Leu Met Thr Leu Glu Gly Asp He Arg Leu Ala Asp Phe 
165 170 175 

Giy Val Ser Ala Lys Asn Leu Lys Thr Leu Gin Lys Arg Asp Ser Phe 
180 185 190 

He Gly Thr Pro Tyr Trp Met Ala Pro Glu Val Val Met Cys Glu Thr 
1^5 200 205 

Met Lys Asp Thr Pro Tyr Asp Tyr Lys Ala Asp He Tro Ser Leu Gly 
210 215 220 

He Thr Leu He Giu Met Ala Gin He Giu Pro Pro His His Giu Leu 

230 235 240 

Asn Pro Met Arg Val Leu Leu Lys He Ala Lys Ser Asp Pro Pro Thr 
245 250 255 

Leu Leu Thr Pro Ser Lys Trp Ser Val Giu Phe Arg Asd Phe Leu Lys 
260 265 270 

He Ala Leu Asp Lys Asn Pro Glu Thr Arg Pro Ser Ala Ala Gin Leu 
275 280 285 

Leu Glu His Pro Phe Val Ser Ser He Thr Ser Asn Lys Ala L-u Ara 
250 295 300 
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Glu Leu Val Ala Glu Ala Lys Ala Glu Val Met Glu Glu lie Glu Asp 
305 310 315 320 

Gly Arg Asp Giu Gly Glu Glu Glu Asp Ala Val Asp Ala Ala Ser Thr 
325 330 335 

Leu Glu Asn His Thr Gin Asn Ser Ser Glu Val Ser Pro Pro Ser Leu 
340 345 

Asn Ala Asp Lys Pro Leu Glu Glu Ser Pro Ser Thr Pro Leu Ala Pro 
355 360 365 

Ser Gin Ser Gin Asp Ser Val Asn Glu Pro Cys Ser Gin Pro Ser Gly 
370 375 380 

Asp Arg Ser Leu Gin Thr Thr Ser Pro Pro Val Val Ala Pro Gly Asn 

390 395 400 

Glu Asn Gly. Leu Ala Val Pro Val Pro Leu Arg Lys Ser Arg Pro Val 
405 410 415 

Ser Met Asp Ala Arg He Gin Val Ala Gin Glu Lys Gin Val Ala Glu 
420 425 430 

Gin Gly Gly Asp Leu Ser Pro Ala Ala Asn Arg Ser Gin Lys Ala Ser 
435 440 445 

Gin Ser Arg Pro Asn Ser Ser Ala Leu Glu Thr Leu Gly Gly Glu Lvs 

455 460 

Leu Ala Asn Gly Ser Leu Glu Pro Pro Ala Gin Ala Ala Pro Gly Pro 

470 475 480 

Ser Lys Arg Asp Ser Asp Cys Ser Ser Leu Cys Thr Ser Glu Ser Met 
485 490 

Asp Tyr Gly Thr Asn Leu Ser Thr Asp Leu Ser Leu Asn Lys Glu Met 
500 505 510 

Gly Ser Leu Ser He Lys Asp Pro Lys Leu Tyr Lys Lys Thr Leu Lys 
515 520 525 

Arg Thr Arg Lys Phe Val Val Asp Gly Val Glu Val Se- II- Thr Th- 

535 540 

Ser Lys lie He Ser Glu Asp Glu Lys Lys Asp Glu Glu Met Arg Phe 



^45 550 555 



560 



Leu Arg Arg Gin Glu Leu Arg Giu Leu Arg Leu Leu Gin Lys Glu Glu 

565 570 575 

His Arg Asn Gin Thr Gin Leu Ser Asn Lys His Glu Leu Gin Leu Glu 
580 585 

Gin Met His Lys Arg Phe Glu Gin Glu He Asn Aia Lvs Lys Lvs Phe 
59= 600 605 
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Phe Asp Thr Glu Leu Glu Asn Leu Glu Arg Gin Gin Lys Gin Gin Val 
610 615 620 

Glu Lys Met Glu Gin Asp His Ala Val Arg Arg Arg Glu Glu Ala Arg 
625 630 635 640 

Arg He Arg Leu Glu Gin Asp Arg Asp Tyr Thr Arg Phe Gin Glu Gin 
645 650 655 

Leu Lys Leu Met Lys Lys Glu Val Lys Asn Glu Val Glu Lys Leu Pro 
660 665 670 

Arg Gin Gin Arg Lys Glu Ser Met Lys Gin Lys Met Glu Glu His Thr 
675 680 685 

Gin Lys Lys Gin Leu Leu Asp Arg Asp Phe Val Ala Lys Gin Lys Glu 
. 690 695 700 

Asp Leu Glu Leu Ala Met Lys Arg Leu Thr Thr Asp Asn Arg Arg Glu 
705 710 715 720 

He Cys Asp Lys Glu Arg Glu Cys Leu Met Lys Lys Gin Glu Leu Leu 
725 730 735 

Arg Asp Arg Glu Ala Ala Leu Trp Glu Met Glu Glu His Gin Leu Gin 
■740 745 750 

Glu Arg His Gin Leu Val Lys Gin Gin Leu Lys Asp Gin Tyr Phe Leu 
755 760 765 

Gin Arg His Glu Leu Leu Arg Lys His Giu Lys Glu Arg Glu Gin Met 
770 775 780 

Gin Arg Tyr Asn Gin Arg Met lie Glu Gin Leu Lys Val Arg Gin Gin 
785 790 795 800 

Gin Glu Lys Ala Arg Leu Pro Lys He Gin Arg Ser Glu Gly Lys Thr 
805 810 815 

Arg Met Ala Met Tyr Lys Lys Ser Leu His He Asn Gly Gly Gly Ser 
820 825 830 

Ala Ala Glu Gin Arg Glu Lys He Lys Gin Phe Ser Gin Gin Glu Glu 
635 840 845 

Lys Arg Gin Lys Ser Glu Arg Leu Gin Gin Gin Gin Lys His Giu Asn 
850 855 860 

Gin Met Arg Asp Met Leu Ala Gin Cys Glu Ser Asn Met Ser G^u Leu 
865 870 875 880 

Gin Gin Leu Gin Asn Glu Lys Cys His Leu Leu Val Glu His Glu Thr 
885 890 895 
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Gln.Lys Leu Lys Aia Leu Asp Glu Ser His Asn Gin Asn Leu Lys Glu 



<210> 27 
<211> 3604 
<2i2> DNA 

<213> Mammalian (Human) PAK4 
<400> 27 

cgttcctggg czzcccgccc cgcaggcctg cggaggactg gcccagcaag gtcccaggtc 60 

ttccctctcc ttagcgccta agagagaggc ccagtgcggg tgaggagtcg cgaggaagag 120 

gcggaaggcg ccggaaggca ccatgrtccg caagaaaaag aagaaacgcc ctgagatctc 180 

agcgccacag aacttccagc accgtgrcca cacctccttc gaccccaaag aaggcaagtt 240 

tgtgggcctc cccccacaat ggcagaacat cctggacaca ctgcggcgcc ccaagcccgt 300 

ggtggaccct tcgcgaatca cacgggtgca gctccagccc atgaagacag tggtgcgggg 360 

cagcgcgarg cctgtggatg gctacatctc ggggctgcrc aacgacatcc agaagttgtc 420 

agtcatcagc tccaacaccc tgcgtggccg cagccccacc agccggcggc gggcacagtc 480 

cctggggctg crgggggatg agcactgggc caccgaccca gacatgtacc tccagagccc 540 

ccagtctgag cgcactgacc . cccacggcct ctacctcagc tgcaacgggg gcacaccagc 600 

aggccacaag cagatgccgt ggcccgagcc acagagccca cgggtcctgc ccaatgggct 660 

ggcrgcaaag gcacagtccc tgggccccgc cgagtttcag ggtgcctcgc agcgctgtct 720 

gcagctgggt gcctgcctgc agagctcccc accaggagcc tcgcccccca cgggcaccaa 780 

taggcatgga atgaaggctg ccaagcacgg ctctgaggag gcccggccac agtcctgcct 840 

ggtgggctca gccacaggca ggccaggtgg ggaaggcagc cctagcccta agacccggga 900 

gagcagcctg aagcgcaggc tattccgaag catgttcctg tccactgctg ccacagcccc 960 

tccaagcagc agcaagccag gccctccacc acagagcaag cccaactcct ctttccgacc 1020 

gccgcagaaa gacaaccccc caagcctggt ggccaaggcc cagtccttgc cctcggacca 1080 

gccggtgggg accttcagcc ctctgaccac ttcggatacc agcagccccc agaagtccct 1140 

ccgcacagcc ccggccacag gccagcttcc aggccggtct tccccagcgg gatccccccg 1200 

cacctggcac gcccagatca gcaccagcaa cctgtacctg ccccaggacc ccacggttgc 1260 

caagggtgcc ctggctggtg aggacacagg tgttgtgaca catgagcagt tcaaggctgc 1320 

gctcaggatg gtggtggacc agggtgaccc ccggctgctg ctggacagct acgtgaagat 1380 

tggcgagggc tccaccggca tcgtccgctr ggcccgggag aagcactcgg gccgccaggt 1440 

ggccgtcaag atgatggacc tcaggaagca gcagcgcagg gagctgctct tcaacgaggt 1500 

ggtgatcatg cgggactacc agcacttcaa cgtggtggag atgtacaaga gctacctggt 1560 

gggcgaggag ctgtgggtgc tcatggagtt cctgcaggga ggagccctca cagacatcgt 1620 

ctcccaagtc aggctgaatg aggagcagat tgccactgtg tgtgaggctg tgctgcaggc 1680 

cctggcctac ctgcatgctc agggtgtcat ccaccgggac arcaagagtg actccatcct 1740 

gctgaccctc gatggcaggg cgaagctctc ggacttcgga ttctgtgctc agatcagcaa 1800 

agacgtccct aagaggaagt ccctggtggg aaccccctac tggatggctc ctgaagtgat 1860 

ctccaggtct ttgtatgcca ctgaggtgga tatctggtct ctgggcatca tggtgattga 1920 

gatggtagat ggggagccac cgtacttcag tgactcccca gtgcaagcca tgaagaggct 1980 

Gcgggacagc cccccaccca agctgaaaaa ctctcacaag gtctccccag tgctgcgaga 2040 

cttcctggag cggatgctgg tgcgggaccc ccaagagaga gccacagccc aggagctcct 2100 

agaccacccc ttcctgctgc agacagggct acctgagtgc ctggtgcccc tgarccagct 2160 

ctaccgaaag cagacctcca cccgctgagc ccaccccaag tatgcctgcc acctacgccc 2220 

acaggcaggg cacactgggc agccagcctg ccgqcaggac ttgcctgcct cctcctctca 2280 

gtattctctc caaagattga aatgtgaagc cccagcccca ccctctgccc tccagcctac 2340 

rgggccaggc cggacctgcc ccctcagtgt ctctccctcc cgagtcccca gatggagacc 2400 

cctttctaca ggatgacccc trgatatttg cacagggata tttctaagaa acgcagaggc 2460 

cagcgttcct ggcctctgca gccaacacag tagaaaaggc tgctgtggtt ttrtaaaggc 2520 

agttgrccac tagtgtccta ggccactgca gagggcagac tgctggtctc cacagaCacc 2530 
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tgctgtrctc agctccagct tcaaacctcg agtctcgaga gggccacggg gcggttttca 2640 

tgaccggaat cccgcttcct cccrcacgtc tgatgtcctg aaggtgcagr cccacctgta 2700 

cagcccctcc ccgccaagaa ctgtgaatgg cctgctccag gccatggcrg ggggcaggga 2760 

gtgaggggac aatttctgag tgaaagagaa agaatggggt cggtggtgaa ggtgcccrca 2820 

ctttacagaa tggagagaac atcgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg 2880 

tgtgtgtgtg tgtgtgtgtg tgtgtgtaag gggaggaaag ccaccttgac agcccaggrc 2940 

cctccaggtc acccacagcc agtttcagga aggctgcccc tctctcccac taagttctgg 3000 

cctgaaggga cctgctttct tggcctggct tccacctctc cactccrgtg tctacctggc 3060 

cagtggagtg gtccatgcta agtctaacac tcctgggagc tcaggaggct tctgagcttc 3120 

tcctgtactg tgcatcgtga gggccagaga caggaatgta aggattggca actgtgttac 3180 

ctttcaagtt tatctcaata accaggtcat cagggaccca ttgttctctt cagaacccta 3240 

tctgggagag aaggcgaacc acctccgggt ttccatcatg tcaaggtcac aggcatccat 3300 

gtgtgcaaac catctgcccc agcrgcctcc acagactgct gtctccttgt cctcctcggc 3360 

cctgccccac ttcagggcrg crgtgagatg gaattccagg aaagaacttc aggtgtctgg 3420 

acccttrcta tctagataat atttttagat tcttctgctc cctagtgacc tacctggggg 3480 

caaagaaatc gcaaggactt ctttttaagg gtcagagttt tcaaaacaaa agcatcttcc 3540 

ctagaaattt ttgtgaattg tttgcacttg tgcctgtttt aaattaaart gagtgttcaa 3600 

^^^^ 3604 



<210> 28 
<211> 2050 
<212> DNA 

<213> Mammalian (Human) PAK5 
<400> 28' 

ggccagtggg gcgaaactgg cagctggccg gccctttaac acctacccga gggctgacac 60 

ggaccaccca tcccggggtg cccaggggga gcctcatgac gtggccccta acgggccatc 120 

agcggggggc ctggccatcc cccagtcctc ctcctcctcc tcccggcctc ccacccgagc 180 

ccgaggtgcc cccagccctg gagrgctggg accccacgcc tcagagcccc agctggcccc 240 

tccagcccgc acccccgccg cccctgctgt tcctgggccc cctggccccc gctcaccaca 300 

gcgggagcca cagcgagtat cccatgagca gttccgggct gccctgcagc tggtggtgga 360 

cccaggcgac ccccgctcct accrggacaa cttcatcaag attggcgagg gctccacggg 420 

catcgtgtgc arcgccaccg tgcgcagctc gggcaagctg gtggccgtca agaagatgga 480 

cctgcgcaag cagcagaggc gcgagctgct cttcaacgag gtggtaatca tgagggacta 540 

ccagcacgag aatgtggtgg agatgtacaa cagctacctg gtgggggacg agctctgggt 600 

ggtcatggag trcctggaag gaggcgccct caccgacatc gtcacccaca ccaggatgaa 660 

cgaggagcag atcgcggccg tgtgccttgc agtgctgcag gccctgtcgg tgctccacgc 720 

ccagggcgtc atccaccggg acatcaagag cgactcgatc ctgctgaccc atgatggcag 780 

ggtgaagctg tcagactttg ggttctgcgc ccaggtgagc aaggaagtgc cccgaaggaa 840 

gtcgctggtc ggcacgccct actggatggc cccagagctc atctcccgcc ttccctacgg 900 

gccagaggta gacatctggt cgctggggat aatggtgatt gagatggtgg acggagagcc 960 

cccctactrc aacgagccac ccctcaaagc catgaagatg attcgggaca acctgccacc 1020 

ccgactgaag aacctgcaca aggrgtcgcc atccctgaag ggcttcctgg accgcctgct 1080 

ggtgcgagac cctgcccagc gggccacggc agccgagccg ctgaagcacc cattcctggc 1140 

caaggcaggg ccgcctgcca gcatcgtgcc ccrcatgcgc cagaaccgca ccagatgagg 1200 

cccagcgccc ttcccctcaa ccaaagagcc cccccgggtc acccccgccc cactgaggcc 1260 

agtagggggc caggcctccc actccrccca gcccgggaga tgctccgcgt ggcaccaccc 1320 

tccttgcrgg gggtagatga gaccctacra ctgaactcca gttttgatct cgtgactttt 1380 

agaaaaacac agggactcgt gggagcaagc gaggctccca ggacccccac cctcrgggac 1440 

aggcccrccc ccatgttctt ctgtctccag gaagggcagc ggcccrccca tcacrggaag 1500 

tctgcagtgg gggtcgctgg gggrggagag aacacuaaga ggtgaacatg tatgagtgtg 1560 

tgcacgcgtg rgagtgtgca tgtgtgtgtg tgrgcaaagg tccagccacc ccgtcctcca 1620 

gcccgcaagg ggrgtctggc gccttgcctg acacccagcc ccctcrcccc ctgagccatt 1680 
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grgggggtcg 
ccgcggctgg 
gactactgca 
acactctagc 
gcagaaccac 
ccccrgcctc 
aaaaaaaaaa 



arcatgaatg 
atggggagac 
cctggacagc 
ccctgccctt 
tgcctgaaga 
gagttagtt t 



uccgaagagt 
aggtcagggc 
ctcctctttt 
attgggggac 
accaggttcc 
tacaattaaa 



ggcctt t tec 
cccccaccct 
ctagaagtct 
agatggtccc 
tgcccggtca 
acattgtctt 



cgtagccctg 
ctccagcccc 
atttatattg 
tgrcctgcgg 
gcgcagcccc 
gttttgtgaa 



cgcccccttt 
tgcagcaaat 
tcattttata 
ggtggccctg 
agcccgccca 
aaaaaaaaaa 



1740 
1800 
1860 
1920 
1980 
2040 
2050 



<210> 29 
<211> 681 
<212> PRT 

<213> Mammalian (Human) PAK4 
<400> 29 

Met Phe Arg Lys Lys Lys Lys Lys Arg Pro Glu lie Ser Ala Pro Gin 

1. .... 5 . 10 15 

Asn Phe Gin His Arg Val His Thr Ser Phe Asp Pro Lys Glu Gly Lys 
20 26 30 

Phe Val Gly Leu Pro Pro Gin Trp Gin Asn lie Leu Asp Thr Leu Arg 
35 40 45 

Arg Pro Lys Pro Val Val Asp Pro Ser Arg lie Thr Arg Val Gin Leu 
50 55 60 

Gin Pro Met Lys Thr Val Val Arg Gly Ser Ala Met Pro Val Asp Gly 
65 70 75 80 

Tyr lie Ser Gly Leu Leu Asn Asp lie Gin Lys Leu Ser Val lie Ser 
85 90 95 

Ser Asn Thr Leu Arg Gly Arg Ser Pro Thr Ser Arg Arg Arg Ala Gin 
100 105 110 

Ser Leu Gly Leu Leu Gly Asp Glu His Trp Ala Thr Asp Pro Asp Met 
115 120 125 

Tyr Leu Gin Ser Pro Gin Ser Glu Arg Thr Asp Pro His Gly Leu Tyr 
130 135 140 

Leu Ser Cys Asn Gly Gly Thr Pro Ala Gly His Lys Gin Met Pro Trp 
145 150, 155 160 

Pro Glu Pro Gin Ser Pro Arg Val Leu Pro Asn Gly Leu Ala Ala Lys 
165 170 175 

Ala Gin Ser Leu Gly Pro Ala Glu Phe Gin Gly Ala Ser Gin Arg Cys 
180 185 190 

Leu Gin Leu Gly Ala Cys Leu Gin Ser Ser Pro Pro Gly Ala Ser Pro 
195 200 205 
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Pro Thr Giy Thr Asn Arg His Gly Met Lys Ala Ala Lys His Gly Ser 
210 215 220 

Glu Glu Ala Arg Pro Gin Ser Cys Leu Val Giy Ser Ala Thr Gly Arg 
225 230 235 ' 240 

Pro Gly Gly Glu Gly Ser Pro Ser Pro Lys Thr Arg Glu Ser Ser Leu 
245 250 255 

Lys Arg Arg Leu Phe Arg Ser Met Phe Leu Ser Thr Ala Ala Thr Ala 
260 265 270 

Pro Pro Ser Ser Ser Lys Pro Gly Pro Pro Pro Gin Ser Lys Pro Asn 
275 280 285 

Ser Ser Phe Arg Pro Pro Gin Lys Asp Asn Pro Pro Ser Leu Val Ala 
290 295 . 300 

Lys Ala Gin Ser Leu Pro Ser Asp Gin Pro Val Gly Thr Phe Ser Pro 
305 310 315 320 

Leu Thr Thr Ser Asp Thr Ser Ser Pro Gin Lys Ser Leu Arg Thr Ala 
325 330 335 

Pro Ala Thr Gly Gin Leu Pro Gly Arg Ser Ser Pro Ala Gly Ser Pro 
340 345 350 

Arg Thr Trp His Ala Gin lie Ser Thr Ser Asn Leu Tyr Leu Pro Gin 
355 360 365 

Asp Pro Thr Val Ala Lys Gly Ala Leu Ala Gly Glu Asp Thr Gly Val 
370 375 380 

Val Thr His Glu Gin Phe Lys Ala Ala Leu Arg Met Val Val Asp Gin 

390 395 400 

Gly Asp Pro Arg Leu Leu Leu Asp Ser Tyr Val Lys lie Gly Glu Gly 
405 410 415 

Ser Thr Gly lie Val Cys Leu Ala Arg Glu Lys His Ser Gly Arg Gin 
420 425 430 

Val Ala Val Lys Met Met Asp Leu Arg Lys Gin Gin Arg Arg Glu Leu 
435 440 445 

Leu Phe Asn Glu Val Val He Met Arg Asp Tyr Gin His Phe Asn Val 
450 455 * 460 

Val Glu Met Tyr Lys Ser Tyr Leu Val Gly Glu Glu Leu Trp Val' Leu 

470 475 480 

Met Glu Phe Leu Gin Gly Gly Ala Leu Thr Asp He Val Ser Gin Val 
485 490 ' 495 

Arg Leu Asn Glu Glu Gin He Ala Thr Val Cys Glu Ala Val Leu Gin 
500 505 ' 510 
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Ala Leu Ala Tyr Leu His Ala Gin Gly Val lie His Arg Asp lie Lys 
515 520 525 

Ser Asp Ser lie Leu Leu Thr Leu Asp Gly Arg Val Lys Leu Ser Asp 
530 535 540 

Phe Gly Phe Cys Ala Gin lie Ser Lys Asp Val Pro Lys Arg Lys Ser 
545 550 555 560 

Leu Val Gly Thr Pro Tyr Trp Met Ala Pro Glu Val lie Ser Arg Ser 
565 570 575 

Leu Tyr Ala Thr Glu Val Asp lie Trp Ser Leu Gly lie Met Val lie 
580 585 590 

Glu Met Val Asp Gly Glu Pro Pro Tyr Phe Ser Asp Ser Pro Val Gin 
595 600 605 

Ala Met Lys Arg Leu Arg Asp Ser Pro Pro Pro Lys Leu Lys Asn Ser 
610 615 620 

His Lys Val Ser Pro Vai Leu Arg Asp Phe Leu Glu Arg Met Leu Val 
625 630 635 640 

Arg Asp Pro Gin Glu Arg Ala Thr Ala Gin Glu Leu Leu Asp His Pro 
645 650 655 

Phe Leu Leu Gin Thr Gly Leu Pro Glu Cys Leu Vai Pro Leu lie Gin 
660 665 670 

Leu Tyr Arg . Lys Gin Thr Ser Thr Cys 
675 • 680 



<210> 30 
<211> 398 
<212> PRT 

<213> Mammalian (Human) PAK5 
<400> 30 

Ala Ser Gly Ala Lys Leu Ala Ala Gly Arg Pro Phe Asn Thr Tyr Pro 
15 10 15 

Arg Ala Asp Thr Asp His Pro Ser Arg Gly Aia Gin Gly Glu Pro His 
20 25 30 

Asp Val Aia Pro Asn Gly Pro Ser Ala Gly Gly Leu Ala lie Pro Gin 
35 40 45 

Ser Ser Ser Ser Ser Ser Arg Pro Pro Thr Arg Aia Arg Gly Ala Pro 
50 55 60 
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Ser Pro Gly Vai 
65 

Pro Ala Cys Thr 



Arg Ser Pro Gin 
100 

Ala Ala Leu Gin 
115 

Asp Asn Phe lie 
130 



Leu Gly Pro His 
70 

Pro Ala Ala Pro 
85 

Arg Glu Pro Gin 



Leu Val Val Asp 
120 

Lys lie Gly Glu 
135 



Ala Ser Glu Pro 
75 

Ala Val Pro Gly 
90 

Arg Val Ser His 
105 

Pro Gly Asp Pro 



Gly Ser Thr Gly 
140 



Gin Leu Ala Pro 
80 

Pro Pro Gly Pro 
95 

Glu Gin Phe Arg 
110 

Arg Ser Tyr Leu 
125 

lie Val Cys lie 



Ala Thr Val Arg Ser Ser Gly Lys Leu Val Ala Val Lys Lys Met Asp 
1*35 150 155 160 

Leu Arg Lys. Gin Gin Arg Arg Glu Leu Leu Phe Asn Glu Val Val He 
165 170 175 

Met Arg Asp Tyr Gin His Glu Asn Val Val Glu Met Tyr Asn Ser Tyr 
180 185 190 

Leu Val Gly Asp Glu Leu Trp Val Val Met Glu Phe Leu Glu Gly Gly 
195 200 205 

Ala *Leu Thr Asp He Val Thr His Thr Arg Met Asn Glu Glu Gin He 
210 215 220 

Ala Ala Val Cys Leu Ala Val Leu Gin Ala Leu Ser Val Leu His Ala 
225 230 235 240 

Gin Gly Val He His Arg Asp He Lys Ser Asp Ser He Leu Leu Thr 
245 250 255 

His Asp Gly Arg Val Lys Leu Ser Asp Phe Gly Phe Cys Ala Gin Val 
260 265 270 

Ser Lys Glu Val Pro Arg Arg Lys Ser Leu Val Gly Thr Pro Tyr Trp 
275 280 285 

Met Ala Pro Glu Leu He Ser Arg Leu Pro Tyr Gly Pro Glu Val Asp 
290 295 300 

He Trp Ser Leu Gly He Met Val He Giu Met Val Asp Gly Glu Pro 
'■■•^ 310 315 . 320 

Pro Tyr Phe Asn Glu Pro Pro Leu Lys Ala Met Lys Met He Arg Asp 
325 330 335 

Asn Leu Pro Pro Arg Leu Lys Asn Leu His Lys Val Ser Pro Ser Leu 
340 345 350 



Lys Gly Phe Leu Asp Arg Leu Leu Val Arg Asp Pro Ala Gin Arg Ala 
355 360 365 



BMSDOCID: <WO 9953036A2_tA> 



wo 99/53036 



59 



PCT/US99/08I50 



Thr Aia Ala Glu Leu Leu Lys His Pro Phe Leu Ala Lys Ala Gly Pro 

370 375 380 

Pro Ala Ser lie Val Pro Leu Met Arg Gin Asn Arg Thr Arg 

385 390 395 



<210> 31 
<211> 1001 
<212> PRT 

<213> Mammalian (murine/human) SULU3 
<400> 31 

Met Pro Ser Thr Asn Arg Ala Gly Ser Leu Lys Asp Pro Glu lie Ala 
15 10 15 

Glu Leu Phe Phe Lys Glu Asp Pro Glu Lys Leu Phe Thr Aso Leu Arg 
20 25 30" 

Glu He Gly His Gly Ser Phe Gly Ala Val Tyr Phe Ala Arg Asp Val 
35 40 45 

Arg Thr Asn Glu Val Val Aia He Lys Lys Met Ser Tyr Ser Gly Lys 
50 55 60 

Gin Ser Thr Glu Lys Trp Gin Asp He He Lys Glu Val Lys Phe Leu 
65 70 75 80 

Gin Arg He Lys His Pro Asn Ser He Giu Tyr Lys Gly Cys Tyr Leu 
35 90 " 95 

Arg Giu His Thr Ala Trp Leu Val Met Glu Tyr Cys Leu Gly Ser Aia 
100 105 110 

Ser Asp Leu Leu Glu Val His Lys Lys Pro Leu Gin Glu Val Glu He 
115 120 125 

Ala Ala He Thr His Gly Ala Leu Gin Gly Leu Ala Tyr Leu His Ser 
130 135 140 

His Thr Met He His Arg Asp He Lys Ala Gly Asn He Leu Leu Thr 
1^5 150 155 160 

Glu Pro Gly Gin Val Lys Leu Ala Asp Phe Giy Ser Ala Ser Met Ala 
165 170 . 175 

Ser Pro Ala Asn Ser Phe Val Giy Thr Pro Tyr Trp Met Ala Pro Glu 
180 . 185 ' 190 

Val He Leu Ala Met Asp Giu Giy Gin Tyr Asp Giy Lys Val Asp Val 
195 200 205 

Trp Ser Leu Giy He Thr Cys He Glu Leu Aia Glu Arg Lys Pro Pro 
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Leu Phe Asn Met Asn 
225 

Glu Ser Pro Thr Leu 
245 

Phe Vai Asp Ser Cys 
260 

Glu Glu Leu Leu Lys 
275 

Val Leu lie Asp Leu 
290 



60 

215 

Aia Met Ser Ala Leu Tyr 
230 235 

Gin Ser Asn Giu Trp Ser 
250 

Leu Gin Lys lie Pro Gin 
265 

His lie Phe Val Leu Arg 
280 

lie Gin Arg Thr Lys Asp 
295 



220 

His lie Aia Gin Asn 
240 

Asp Tyr Phe Arg Asn 
255 

Asp Arg Pro Thr Ser 
270 

Giu Arg Pro Giu Thr 
285 

Aia Val Arg Giu Leu 
300 



Asp Asn Leu Gin Tyr Arg Lys Met Lys Lys Leu Leu Phe Gin Giu Aia 
305 310 315 320 

His Asn Giy Pro Aia Val Giu Aia Gin Glu Giu Glu Glu Giu Gin Asp 
325 330 335 

His Giy Vai Giy Arg Thr Giy Thr Vai Asn Ser Vai Giy Ser Asn Gin 
340 345 350 

Ser lie Pro Ser Met Ser lie Ser Aia Ser Ser Gin Ser Ser Ser Val 
355 ' 360 365 

Asn Ser Leu Pro Asp Vai Ser Asp Asp Lys Ser Giu Leu Asp Met Met 
370 375 380 

Glu Giy Asp His Thr Vai Mer Ser Asn Ser Ser Vai lie His Leu Lys 
385 390 395 400 

Pro Giu Giu Giu Asn Tyr Arg. Giu Giu Giy Asp Pro Arg Thr Arg Aia 
405 410 415 

Ser Asp Pro Gin Ser Pro Pro Gin Val Ser Arg His Lys Ser His Tyr 
420 425 430 

Arg Asn Arg Glu His Phe Ala Thr lie Arg Thr Ala Ser Leu Vai Thr 
435 440 445 

Arg Gin Met Gin Glu His Giu Gin Asp Ser Giu Leu Arg Glu Gin Met 
450 455 460 

Ser Giy Tyr Lys Arg Met Arg Arg Gin His Gin Lys Gin Leu Met Thr 
465 470 475 480 

Leu Giu Asn Lys Leu Lys Ala Glu Met Asp Giu His Arg Leu Arg Leu 
435 490 495 

Asp Lys Asp Leu Glu Thr Gin Arg Asn Asn Phe Ala Ala Glu Met Giu 
500 505 510 
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Lys Leu lie Lys Lys His Gin Aia Aia Met Glu Lys Giu Aia Lys Val 
5i5 520 525 

Met Ser Asn Giu Giu Lys Lys Phe Gin Gin His lie Gin Aia Gin Gin 
530 535 540 

Lys Lys Giu Leu Asn Ser Phe Leu Giu Ser Gin Lys Arg Glu Tyr Lys 

550 555 560 

Leu Arg Lys Giu Gin Leu Lys Giu Giu Leu Ash Giu Asn Gin Ser Thr 
565 570 575 

Pro Lys Lys Giu Lys Gin Giu Trp Leu Ser Lys Gin Lys Glu Asn lie 
580 585 590 

Gin His Phe Gin Aia Giu Giu Giu Aia Asn Leu Leu Arg Arg Gin Arg 
595 600 605 

Gin Tyr Leu Giu Leu Giu Cys Arg Arg Phe Lys Arg Arg Met Leu Leu 
610 615 620 

Gly Arg His Asn Leu Giu Gin Asp Leu Val Arg Glu Glu Leu Asn Lys 
625 630 635 640 

Arg Gin Thr Gin Lys Asp Leu Glu His Aia Met Leu Leu Arg Gin His 
645 650 655 

Giu Ser Met Gin Glu Leu Glu Phe Arg His Leu Asn Thr lie Gin Lys 
660 665 670 

Met Arg Cys Glu Leu lie Arg Leu Gin His Gin Thr Giu Leu Thr Asn 
675 680 685 

Gin Leu Giu Tyr Asn Lys Arg Arg Glu Arq Giu Leu Arg Arg Lys His 
690 695 700 

Val Met Glu Val Arg Gin Gin Pro Lys Ser Leu Lys Ser Lys Glu Leu 
"705 710 715 720 

Gin lie Lys Lys Gin Phe Gin Asp Thr Cys Lys lie Gin Thr Arg Gin 
725 730 735 

Tyr Lys Ala Leu Arg Asn His Leu Leu Glu Thr Thr Pro Lys Ser Glu 
740 745 750 

His Lys Ala Vai Leu Lys Arg Leu Lys Giu Giu Gin Thr Arg Lys Leu 
755 760 765 

Ala lie Leu Aia Giu Gin Tyr Asp His Ser lie Asn Glu Met Leu Ser 
770 775 780 

Thr Gin Aia Leu Arg Leu Asp Giu Aia Gin Glu Aia Giu Cys Gin Vai 
"785 790 795 800 

Leu Lys Met Gin Leu Gin Gin Giu Leu Glu Leu Leu Asn Aia Tvr Gin 
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805 810 815 

Ser Lys lie Lys Met Gin Ala Glu Ala Gin His Asp Arg Glu Leu Arg 
820 825 * 830 

Giu Leu Glu Gin Arg Vai Ser Leu Arg Arg Ala Leu Leu Glu Gin Lys 
835 840 845 

lie Glu Glu Glu Met Leu Ala Leu Gin Asn Glu Arg Thr Glu Arg lie 
850 855 860 

Arg Ser Leu Leu Glu Arg Gin Ala Arg Glu lie Glu Ala Phe Asp Ser 
865 870 875 880 

Glu Ser Met Arg Leu Gly Phe Ser Asn Met Val Leu Ser Asn Leu Ser 
885 890 895 

Pro Glu Ala Phe Ser His Ser Tyr Pro Gly Ala Ser Gly Trp Ser His 
■900 .905 ' 910 

Asn Pro Thr Gly Gly Pro Gly Pro His Trp Gly His Pro Met Gly Gly 
915 920 925 

Pro Pro Gin Ala Trp Gly His Pro Met Gin Gly Gly Pro Gin Pro Trp 
930 935 940 

Gly His Pro Ser Gly Pro Met Gin Gly Val Pro Arg Gly Ser Ser Met 
945 950 955 960 

Gly Val Arg Asn Ser Pro Gin Ala Leu Arg Arg Thr Ala Ser Gly Gly 
965 970 " 975 

Arg Thr Glu Gin Gly Met Ser Arg Ser Thr Ser Val Thr Ser Gin lie 
980 985 990 

Ser Asn Gly Ser His Met Ser Tyr Thr 
995 1000 



<210> 32 . 

<211> 25 

<212> DNA 

<2i3> TRKl 

<220> 
<223> "n" 

"y" 
»i ^ it 

<400> 32 
ctgaacccgg ngcnttygcn aargt 



stands for a, c, g or t. 
stands for c or t . 
stands for a or g . 
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<210> 33 
<211> 24 
<212> DNA 
<213> TRK4 

<220> 

<223> "n" stands for a, c, g or t. 
*'y" stands for c or t . 

<400> 33 

gctggatccy t cnggnggca tcca 



<210> 34 
<211> 23 
<212> DNA 
<213>.ROSl 

<220> 

<223> "n" stands for a, c, g or t. 

"y" stands for c or t. 

"r" stands for a or g. 

■<400> 34. 

gcnttyggng argtntayga- rgg 



<210> 35 
<211> 24 
<212> DNA 
<213> CCK4b 

<220> 

<223> "n" stands for a, c, g or t. 

"y" stands for c or t. 

"s" stands for c or g. 

"w" stands for a or t. 

<400> 35 
gcrggatccy tcnggnswca tcca 



<210> 36 
<211> 23 
<212> DNA 
<213> CCK4C 
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<220> 

<223> "n" stands for a, c, g or t . 
"y" stands for c or t . 
"r" stands for a or g. 

<400> 36 
gagttyggng argtnttyyt ngc 



<210> 37 

<211> 6 

<212> PRT 

<213> TRKl 

<400> 37 

Gly Ala Phe. .G.iy Lys Val 
1 5 



<210> 38 

<211> 5 

<212> PRT 

<213> TRK4 

<400> 38 

Trp Met Pro Pro- Giu 
1 5 



<210> 39 

<211> 8 

<212> PRT 

<213> ROSl 

<400> 39 

Ala Phe Gly Glu Val Tyr Glu Gly 
1 5 



<210> 40 

<211> 5 

<212> PRT 

<213> CCK4b 

<400> 40 

Trp Met Ser Pro Glu 
1 5 
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<210> 41 
<211> 8 
<2i2> PRT 
<213> CCK4C 

<400> 41 

Glu Phe Gly Giu Val Tyr Glu Gly 
1 5 



• <210> 42 
<211> 21 
<212>.DNA 

<213> Arrificiai Sequence 
<220> 

<223> Synthesized nucleic acid molecule 
<400> 42 

cacagaaacg gtcagattca c 21 



<210> 43 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthesized nucleic acid molecule 
<400> 43 

gatcagggtg acatcaaggg ac 22 



<210> 44 
<211> 22 
<212> DNA 

<213> Arrificial Sequence 
<220> 

<223> Synthesized nucleic acid molecule 
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<400> 44 
ctcatctqta cacacttcat gg 



<210> 45 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthesized nucleic- acid molecule 
<400> 45 

gattcccaca ctgragatgt c 21 

<210> 46 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthesized nucleic acid molecule 
<400> 46 

ggccctcgac tacatccacc acat 24 

<210> 47 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthesized nucleic a.id molecule 
<400> 47 

caacgaaact aacacagcar aagg 24 



<210> 48 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Synthesized nucleic acid molecule 
<4C0> 48 

atggcgaacg actctcccgc gaa 25 

<210> 49 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Synthesized nucleic acid molecule 
<4C0> 49 

acaccaaaa t c.aacaagtt t cacct c 2 6 



<210> 50 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthesized nucleic acid molecule 
<400> 5.0 

agttacaagg aattccaagt tct 23 



<210> 51 
<211> 23 
<2i2> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthesized nucleic acid molecule 
<400> 51 

atgaagagga agaaatcaaa ctg 23 



<210> 52 
<211> 20 
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<2i2> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthesized nucleic acid molecule 
<400> 52 

agatggaccg tactgggagg 20 

<210> 53 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthesized nucleic acid molecule 
<400> 53 

actttgtgca gctctgtggg 20 



<210> 54 
<211> 21 
<212> DNA 

<2i3> Artificial Sequence 
<220> 

<223> Synthesized nucleic acid molecule 



<400> 54 

aaggttatgg atgtcacagg g 21 

<210> 55 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthesized nucleic acid molecule 
<400> 55 

ctcacaagct tgccaacagg 20 
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<2i0> 56 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthesized nucleic acid molecule 
<400> 56 

agtccccacc agaaggttta c 21 



<210> 57 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthesized nucleic acid molecule 
<400> 57 
tcaggggtca gaggtcacg 



<210> 58 
<211> 21 
<212> DNA 

<2i3> Artificial Sequence 
<220> 

<223> Synthesized nucleic acid mciecule 
<400> 58 

cccaaaccct accacaaatt c 21 



<210> 59 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthesized nucleic acid molecule 
<400> 59 
cccccgggaa acgatgacca 
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<210> 60 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthesized nucleic acid molecule 
<400> 60 
agccgctgcc ccucctctac rgt 



<210> 61 
<211> 23 
<212> DMA 
, <213> Artificial Sequence 

<220> 

<223> Synthesized nucleic acid molecule 
<400> 61 
accgcaacat cgccacctac zac 



<210> 62 
<211> 19 
<212> DNA 

<213> Arrificial Sequence 
<220> 

<223> Synthesized nucleic acid molecule 
<400> 62 

ctcgacgtcg tggaccacc j^g 



<210> 63 
<211> 24 
<212> DNA 

<2i3> Artificial Sequence 
<220> 

<223> Synthesized nucleic acid molecule 
<400> 63 
caatgttaac ccactctatg tczc 
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<210> 64 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthesized nucleic acid molecule 
<4C0> 64 
agttrgccga tgrttttctt ttc 



<210> 65 
<211> 19 
<212> .DNA 

<213*> Artificial Sequence 
<220> 

<223> Synthesized nucleic acid molecule 
<400> 65 
ccgccatgaa ccccggctt 



<210> 66 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthesized nucleic acid molecule 
<400> 66 

cgattgccaa agaccgtgtc a "21 



<210> 67 
<211> 23 
<212> DNA. 

<213> Artificial Sequence 
<220> 

<223> Synthesized nucleic acid molecule 
<400> 67 

agaagttgca gctgrtgaga gga 23 
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<210> 66 
<211> 21 
<212> DNA 

<2i3> Artificial Sequence 
<220> 

<223> Synthesized nucleic acid molecule 
<400> 68 

tatggcccgr gtaaggattt c 21 

<210> 69 
<211> 23 
<212> .DNA 

<213> Artificial Sequence 
<220> 

<223> Synthesized nucleic acid molecule 
<400> 69 

gtgccagaag tgttgtgttg taa 23 

<210> 70 
<211> 22 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Synthesized nucleic acid molecule 
<400> 70 

tattgaattg gcggaacgga ag 22 

<210> 71 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthesized nucleic acid molecule 
<400> 71 

ttgt-rtgrg ctcattcttt ggag 24 
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<210> 72 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthesized nucleic acid molecule 
<400> 72 
gattgctttg rgctcatrct ttgg 



<210> .73 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthesized nucleic acid molecule 
<400> 73 

ttgtrctaag agtgccctcc g 2i 



<210> 74 
<211> 19 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Synthesized nucleic acid molecule 
<400> 74 
aagaccatgc cgtgcgccg 



<210> 75 
<2li> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthesized nucleic acid molecule 
<400> 75 
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attcczzaaq grtctggtra tgg 25 

<2i0> 76 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthesized nucleic acid molecule 
<400> 76 

agtccgccaa gcctcaatg 19 



<210>-77 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthesized nucleic acid molecule 
<400> 77 

caccttcagc tgctctatca t 21 



<210> 78 
<21i> 12 
<212> ?RT 

<213> Peptide 540A 
<400> 78 

His Gly Asp Pro Arg Pro Glu Pro Arg Pro Thr Gin 
15 10 



<210> 79 

<211> 10 

<212> PRT 

<213> Peptide 539A 

<400> 79 

Cys Leu Asp Phe Pro Lys Glu Asp Tyr Arg 
1 5 10^ 
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<2i0> 80 

<2il> 19 

<212> PRT 

<213> Peptide 541A 



<400> 80 

Asp Pro Arg Thr Arg Ala Ser Asp Pro Gin Ser Pro Pro Gin Val Ser 
15 10 15 

Arg His Lys 



<210> 81 
<211>..14 
<212> PRT 

<213> Peptide 542A 
<400> 81 

Pro Ser Thr Asn Arg Ala Gly Ser Leu Lys Asp Pro Giu Cys 
1 5 10 



<210> 82 

<211> 15 

<212> PRT 

<213> Peptide 554A 

<400> 82 

Cys Leu Val Pro Leu lie Gin Leu Tyr Arg Lys Gin Thr Ser Thr 
15 10 15 



<210> 33 
<211> IC 
<212> PRT 

<213> Peptide 579A 
<400> 83 

Cys Pro Leu Met Arg Gin Asn Arg Thr Arg 



1 



10 



<210> 84 
<211> 426 



BNSDOCID: <WO 9953036A2JA> 



wo 99/53036 PCT/US99/08 1 50 



<212> ?RT 
<213> STE:20h 



<400> 64 

Mez Ala His Leu Arg Giy Phe Aia Asn Gin His Ser Arg Val Asp Pro 
1 5 10 ^ 15 

Glu Giu Leu Phe Thr Lys Leu Asp Arg lie Gly Lys Giy Ser Phe Giy 
20 25 ' ' 30 

Glu Val Tyr Lys Giy lie Asp Asn His Thr Lys Giu Vai Val Aia lie 
35 40 45 

Lys lie lie Asp Leu Giu Giu Aia Glu Asp Glu lie Giu Asp lie Gin 
50 55 -60 

Gin Glu lie -Thr Vai Leu Ser Gin Cys Asp Ser Pro Tyr lie Thr Arg 
6^ 70 75 80 

Tyr Phe Giy Ser Tyr Leu Lys Ser Thr Lys Leu Trp lie lie Met Giu 
85 90 95 

Tyr Leu Giy Giy Giy Ser Aia Leu Asp Leu Leu Lys Pro Giy Pro Leu 
100 105 110 

Giu Glu Thr Tyr lie Aia Thr lie Leu Arg Giu lie Leu Lys Gly Leu 
115 120 125 

Asp Tyr Leu His Ser Giu Arg Lys lie His Arg Asp lie Lys Aia Aia 
130 135 140 

A.sn Val Leu Leu Ser Giu Gin Giy Asp Vai Lys Leu Aia Asn Phe Giy 

150 ' 155 ' 160 

Vai Aia Giy Gin Leu Thr Asp Thr Gin lie Lys Arg Asn Thr Phe Val 
165 170 175 

Gly Thr Pro Phe Trp Met Aia Pro Giu Vai lie Lys Gin Ser Aia Tyr 
180 185 - 190 

Asp Phe Lys Aia Asp lie Trp Ser Leu Giy lie Thr Aia lie Giu Leu 
195 200 205 

Aia Lys Gly Giu Pro Pro Asn Ser Asp Leu His Pro Met Arg Vai Leu 
210 215 220 

Phe Leu lie Pro Lys Asn Ser Pro Pro Thr Leu Giu Giy Gin His Ser 
225 230 235 240 

Lys Pro Phe Lys Giu Phe Vai Giu Aia Cys Leu Asn Lys Asp Pro Ara • 
245 250 255 

Phe Arg Pro Thr Ala Lys Giu Leu Leu Lys His Lys Phe lie Thr Arg 
260 265 270 
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Tyr Thr Lys Lys Thr Ser Phe Leu Thr Giu Leu lie Asp Arg Tyr Lys 
275 280 285 

Arg Trp Lys Ser Giu Giy His Gly Giu Giu Ser Ser Ser Giu Asp Ser 
290 295 300 

Asp lie Asp Gly Giu Ala Giu Asp Giy Giu Gin Gly Pro lie Trp Thr 
305 310 315 320 

Phe Pro Pro Thr lie Arg Pro Ser Pro His Ser Lys Leu His Lys Giy 
325 330 335 

Thr Ala Leu His Ser Ser Gin Lys Pro Ala Giu Pro Vai Lvs Arg Gin 
340 345 350 

Pro Arg Ser Gin Cys Leu Ser Thr Leu Vai Arg Pro Vai Phe Giy Giu 
355 360 365 

Leu Lys Giu Lys His Lys Gin Ser Gly Giy Ser Vai Gly Ala Leu Giu 
370 375 380 

Giu Leu Giu Asn Ala Phe Ser Leu Ala Giu Giu Ser Cys Pro Giy lie 
385 390 395 ^qq 

Ser Asp Lys Leu Met Vai His Leu Vai Giu Arg Vai Gin Arg Phe Ser 
405 410 " 415 

His Asn Arg Asn His Leu Thr Ser Thr Arg 
420 425 



<210> 85 
<211> 431 
<212> PRT 
<213> MST3 

<400> 85 

Met Ala His Ser Pro Vai Gin Ser Giy Leu Pro Giy Met Gin Asn Leu 
1 5 10 15 

Lys Ala Asp Pro Giu Giu Leu Phe Thr Lvs Leu Giu Lys lie Gly Lys 
20 25 " 30 

Giy Ser Phe Giy Giu Vai Phe Lys Giy lie Asp Asn Arg Thr Gin Lys 
35 40 45 

Vai Vai Ala lie Lys lie lie Aso Leu Giu Giu Ala Giu Asp Giu "^i^ 
50 55 60 

Giu Asp lie Gin Gin Giu He Thr Vai Leu Ser Gin Cys Asp Ser Pro 
^5 70 75 ' SO 



:-yr Vai Thr Lys Tyr Tyr Giy Ser Tyr Leu Lys Asp Thr Lys L 



eu Tro 
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lie lie Met Glu Tyr Leu 
100 

Pro Giy Pro Leu Asp Glu 
115 

Leu Lys Gly Leu Asp Tyr 
130 

lie Lys Ala Ala Asn Val 
145 ' 150 

Ala Asp Phe Gly Val Ala 
165 

Asn Thr Phe Vai Gly Thr 
130 

Gin Ser Ala Tyr Asp Ser 
195 

Ala lie Glu Leu Ala Arg 
210 

Met Lys Val Leu Phe Leu 
225* 230 

Gly Asn Tyr Ser Lys Pro 
245 

Lys Glu Pro S^sr Phe Arg 
2 60 

Phe He Leu Arg Asn Ala 
275 

Asp Arg Tyr Lys Arg Trp 
290 

Ser Glu Asp Ser Asp Ala 
305 ' 310 

Asp Ser Gly Asp Trp lie 
325 



78 
90 

Giy Gly Gly Ser Ala Leu 
105 

Thr Gin lie Ala Thr lie 
120 

Leu His Ser Glu Lys Lys 
135 140 

Leu Leu Ser Glu His Giy 
155 

Gly Gin Leu Thr Asp Thr 
170 

Pro Phe Trp Met Ala Pro 
185 

Lys Ala Asp lie Trp Ser 
200 

Gly Glu Pro Pro His Ser 
215 220 

lie Pro Lys Asn A.sn Pro 
235 

Leu Lys Glu Phe Val Glu 
250 

Pro Thr Ala Lys Glu Leu 
265 

Lys Lys Thr Ser Tyr Leu 
280 

Lys Ala Glu Gin Ser His 
295 300 

Glu Thr Asp Gly Gin Ala 
315 

Phe Thr lie Arg Glu Lys 
330 
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Asp Leu Leu Glu 
110 

Leu Arg Glu He 
125 

lie His Arg Asp 



Glu Val Lys Leu 
160 

Gin He Lys Arg 
175 

Glu Val lie Lys 
190 

Leu Gly lie Thr 
205 

Glu Leu His Pro 



Pro Thr Leu Glu 
240 

Ala Cys Leu Asn 
255 

Leu Lys His Lys 
270 

Thr Glu Leu He 
285 

Asp Asp Ser Ser 



Ser Gly Gly Ser 
320 

Asp Pro Lys Asn 
335 



Leu Glu Asn Giy Ala Leu Gin Pro Ser Asp Leu Asp Arg Asn Lvs Met 

340 345 ^ 350 

Lys Asp He Pro Lys Arg Pro Phe Ser Gin Cys Leu Ser Thr He He 

355 360 ^ -365 

Ser Pro Leu- Phe Ala Glu Leu Lys Glu Lys Ser Gin. Ala Cys Gly Gly 

370 375 380 
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Asn Leu Gly Ser lie Glu Glu Leu 
385 390 

Glu Ala Cys Pro Giy lie Ser Asp 
405 

Arg Leu Gin Arg Tyr Ser Leu Ser 
420 



Arg Giy Ala lie Tyr Leu Ala Glu 
395 400 

Thr Met Val Ala Gin Leu Val Gin 
410 415 

Gly Gly Gly Thr Ser Ser His 
425 430 



<210> 86 

<211> 443 

<212> PRT 

<213> T19A5.2_ce 

<220> 

<223> "Xaa" stands for any amino acid, 
<400> 86 

Met Thr Thr . Thr Ser Ser Asp Glu Leu Pro Arg Gin Ala Asp Aso Asp 
^ 5 10 15^ 

Ser Met Lys Trp Asp Arg lie Tyr lie Gin Lys Leu Asd Pro Glu Vai 
20 25 ' 30 

lie Phe Thr Lys Gin Glu Arg lie Gly Arg Gly Ser Phe Gly Glu Val 
35 40 45 

Tyr Lys Giy lie Asp Asn Arg Thr Glv Arg Vai Val Ala lie Lys lie 
50 55 60 

lie Asp Leu Glu Gin Ala Giu Asp Giu lie Giu Asp lie Gin Gin Giu 

^0 75 80 

lie Gin Val Leu Ser Gin Cys Asp Ser Gin Tyr Val Thr Lys Tyr Phe 
85 90 95 

Gly Ser Phe Leu Lys Giy Ser Lys Leu Trp lie lie Met Giu Tyr Leu 
100 105 lie 

Gly Gly Giy Ser Ala Leu Asp Leu Thr Lys Ser Giy Lys Leu Aso Giu 
115 120 ' 125 

Ser His lie Ala Val lie Leu Arq Giu lie Leu Lys Giy Leu Giu Tyr 
130 135 

Leu His Ser Glu Arg Lys lie His Arg Aso lie Lys Ala Ala Asn Vai 

150 155 160 

Leu Val Ser Giu His Giy Asp Vai Lys Val Ala Asp Phe Gly Val Ala 
165 170 175 
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Gly Gin Leu Thr Giu Thr Vai Lys Lys Arg lie Thr Phe Vai Gly Ser 
180 185 190 

Pro Phe Trp Met Ala Pro Giu Leu lie Lys Gin Ser Ser Tyr Asp Tyr 
195 200 205 

Lys Ala Asp lie Trp Ser Leu Gly He Thr Ala He Giu Leu Ala Asn 
210 . 215 220 

Gly Giu Pro Pro His Ser Asp Leu His Pro Met Arg Vai Leu Phe Leu 
225 230 235 240 

He Pro Lys Asn Pro Pro Pro Vai Leu Gin Gly Ser Gin Trp Ser Lys 
245 250 ' 255 

Pro Phe Lys Giu Phe Vai Giu Met Cys Leu Asn Lys Asp Pro Giu Asn 
260 265 270 

Arg Pro Ser. Ala Ser Thr Leu Leu Lys His Gin Phe He Lys Arg Ala 
275 280 ■ ' 285 

Lys Lys Asn Ser lie Leu Vai Asp Leu He Giu Arg Ala Ala Giu Tyr 
290 295 300 

Arg Leu Arg Thr Gly Vai Ser Ser Asp Ser Asp Leu Asp Giu Asp Ser 
305 310 315 320 

Asp Gly Gly Gly Gly Thr Ser Lys Trp Asp Tyr Pro Thr Vai Arg Gly 
325 330 335 

Pro Arg Vai Ser Ala Asp Asp Asp Gly Thr Vai Arg Gin Arg Thr Asp 
340 345 350 

Arg Pro Arg Ala Gin Vai Asp Arg Arg Ser Pro Ser Gly Ser Pro Gly 
355 360 365 

Gly Thr He Vai Arg Gly Ser Pro Gin Vai Ala Ala Vai Ala Giu Gin 
370 375 380 



Leu Arg Asn Ser Xaa Xaa Ala Leu Asp Gin Leu Arg His Vai Phe Arg 
385 390 395 400 

Asp Vai Giu Asp Ser Cys Pro Giy He Cys Asn Giu Leu He Giu Giu 
405 410 415 

Leu Met Gin Arg He Ala Vai Pro Gin Vai Ser Gin Ser Asp Leu Asp 
420 425 430 

Ala Ala He Arg Arg Leu Thr Thr Pro Pro Ser 
435 440 



<21C> 87 
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<211> 275 
<212> PRT 
<213> Pak_sp 

<400> 87 

Leu Leu Tyr Arg Asn Phe Val Lys lie Giy Gin Giy Ala Ser Giy Asp 
1 5 10 15 ' 

Vai Tyr Ser Ala Arg Gin Val Giy Thr Asn Leu Ser Val Aia lie Lys 
20 25 30 ' 

Lys Met Asn lie Asn Gin Gin Pro Lys Lys Giu Phe He Val Asn Glu 
35 40 45 

lie Leu Vai iyiet Lys Ser His His His Lys Asn He Val Asn Phe He 
50 55 60 

Asp Thr Phe, Phe Tyr Lys Ser Giu Leu Trp Met Val Met Glu Tyr Met 
o5 70 75 80 

Arg Giy Giy Ser Leu Thr Giu Vai Val Thr Asn Asn Thr Leu Ser Glu 
85 90 95 

Giy Gin He Aia Aia He Cys Lys Giu Thr Leu Giu Giy Leu Gin His 
100 105 110 

Leu His Giu Asn Giy lie Val His Arg Asp He Lys Ser Asp Asn He 
115 120 ' "* 125 

Leu Leu Ser Leu Gin Giy Asp He Lys Leu Thr Asp Phe Giy Phe Cys 
130 135 140 

Aia Gin lie Asp Ser Asn Met Thr Lys Arg Thr Thr Met Val Giy Thr 

150 155 160 

Pro Tyr Trp Met Ala Pro Giu Val Vai Thr Arg Lys Giu Tyr Giy Phe 
165 170 175 

Lys Vai Asp Vai Trp Ser Leu Giy He Met Aia He Giu Met Val Giu 
180 185 190 

Giy Giu Pro Pro Tyr Leu Asn Glu Asn Pro Leu Arg Ala Leu Tyr Leu 
195 200 205 

He Aia Thr He Giy Thr Pro Lys He Ser Arg Pro Glu Leu Leu Ser 
210 215 220 

Ser Val Phe His Asp Phe Leu Ser Lys Ser Leu Thr Val Asn Pro Lys 
225 230 235 240 

Gin Arg Pro Ser Ser Giy Giu Leu Leu Arg His Pro Phe Leu Lys Gin 
245 250 255 

Aia Vai Pro Val Ser Ser Leu He Pro Leu He Lys Ser He His His 
260 265 ' 270 
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Ser Gly Lys 
275 



<210> 88 

<211> 1109 

<212> PRT 

<213> ZC504.4_ce 

<400> 88 

Me- Ser Ser Ser Gly Leu Asp Glu He Asp Leu Asn Ser Leu Arg Asp 
1 5 10 15 

Pro Ala Gly He Phe Glu Leu He Glu Vai Val Gly Asn Gly Thr Tyr 
20 25 30 

Gly Gin Val Tyr Lys Gly Arg His Val Lys Thr Ala Gin Leu Ala Ala 
35 40 45 

lie Lys He Met Asn He Asn Glu Asp Glu Glu Asp Glu lie Lys Leu 
50 55 60 

Glu He Asn Met Leu Lys Lys His Ser His His Arg Asn Val Ala Thr 
65 70 75 80 

Tyr Tyr Gly Ala Phe He Lys Lys Leu Pro Ser Ser Thr Gly Lys His 
85 90 95 



Asp Gin Leu Trp 
100 

Asp Leu Val Lys 
115 

Ala Tyr lie Cys 
130 

Ser Lys Val He 
145 

Asp Ser Ala Glu 



Asp Lys Thr Vai 
180 

Met Ala Pro Glu 
195 

Asp Ser Arg Ser 
210 



Leu Val Met Glu 



Asn Thr Lys Gly 
120 

Arg Giu He Leu 
135 

His Arg Asp He 
150 

Val Lys Leu Val 
165 

Gly Arg Arg Asn 



Vai He Ala Cys 
200 

Asp Leu Trp Ser 
215 



Phe Cys Gly Ser 
105 

Gly Ser Leu Lys 



Arg Gly Leu Tyr 
140 

Lys Gly Gin Asn 
155 

Asp Phe Gly Val 
170 

Thr Phe He Gly 
185 

Asp Glu Ser Pro 



Leu Gly He Thr 
220 



Gly Ser He Thr 
110 

Glu Glu Trp He 
125 

His Leu His Gin 



Vai Leu Leu Thr 
160 

Ser Ala Gin Leu 
175 

Thr Pro Tyr Trp 
190 

Giu Ala Thr Tyr 
205 

Ala Leu Glu Met 
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Aia Glu Giy His Pro Pro Leu Cys Asp Met His Pro Met Arg Ala Leu 
225 230 235 240 

Phe Leu He Pro Arg Asn Pro Pro Pro Lys Leu Lys Arg Asn Lys Lys 
245 250 255 

Trp Thr Lys Lys Phe Glu Thr Phe He Glu Thr Val Leu Val Lys Asp 
260 265 270 

Tyr His Gin Arg Pro Tyr Thr Gly Ala Leu Leu Arg His Pro Phe He 
275 280 285 

Lys Glu Gin Pro His Glu Gin Thr He Arg His Ser He Lys Glu His 
290 295 300 

He Asp Arg Asn Arg Arg Val Lys Lys Asp Asp Ala Asp Tvr Glu Tyr 
305 310 315 ^ 320 

Ser Gly Ser .Glu Asp Asp Glu Pro Ser Pro Asn Asn Arg Asd Aso Ser 
325 330 ' 335 

Glu Ser Ser Ser Met He Pro Met Asp Asn Thr Leu Arg Lys Gly Phe 
340 345 . 350 

Gin Lys Leu Gin Glu Ser Ser Arg Gly Phe Ala Glu Pro Gly Ala Gin 
355 360 365 



Gin Leu Arg Arg 
370 

Gin Gin Ser Arg 
385 

Leu Arg Ala Val 



Pro Ala Ser Arg 
420 

Ala Pro His Leu 
435 

Glu Arg Glu Glu 
450 

Pro He Ala Arg 
465 

Lys Met Ser Glu 



Asp Val Leu Ala 
500 

Arg Ser Arg Glu 



Leu Pro Gin Gin 
375 

Tyr Val Glu Pro 
390 

Ser Ser Arg Giy 
405 

Pro Arg Pro Arg 



Ala Asp Leu Ala 
440 

Arg Arg Glu Arg 
455 

Val Ser Ala Ser 
470 

Pro Leu Leu He 
485 

Ser Glu Leu Ser 



Glu Ser Met Ser 



Pro Ala Pro Ala 
380 

Arg Arg Glu Ser 
395 

Ala Aia Asp Gly 
410 

Ser Pro Gin Gin 
425 

Asn Tyr Glu Lys 



Glu Arg Gin Ala 
4 60 

Val Pro Ala Pro 
475 

Thr His Val Lys 
490 

Lys Hec Gly Giy 
505 

Pro Pro Pro Pro 



Pro Phe Gin Tyr 



Ser Glu Val Lys 
400 

Pro Arg His Ser 
415 

Ser His Pro Ala 
430 

Arg Arg Arg Ser 
445 

His His Ala Me- 



Gin Gin Ser Arg 
480 

Pro Glu Asp Leu 
495 

His His Asn Gly 
510 

Ala Pro Pro Pro 
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515 520 525 

Arg Glu Ala Ser He Ser Ser He Thr Asp Thr He Asp Val Giy Glu 
530 ' 535 540 

Leu Asp Asn Giy Ala Asp Ala Glu Trp Asp Asp Leu Lys Asp lie Met 
345 550 555 ' 560 



Met Asn Giy Giu Giy Thr Leu Arg Giy Pro Asn Lys Pro Leu Pro Pro 

565 570 575 

Thr Pro Thr Asp Giy Giu Asn Thr Leu Val Ser Asp Val Arg Arg Asn 

580 585 590 



Giy Asn Giy Asn Ser Giy His Giy Ala Tyr Lys Giy Lys Lys He Pro 

595 600 605 

Glu He Arg Pro Giy He He Ser Leu Asp Asp Asp Asp Ser Asp Ser 

610 615 ' 620 

Asp Asn Glu Glu Giy Asn Glu Pro Leu Met Phe Lys Pro He Val Arg 

625 630 635 640 



Cys Pro Phe Ser 



Ser Val Asp Giy 
660 

Asn Ala Ser Ser 
675 

Fro Asp Leu Arg 
690 - 

Asp Arg Ala Asp 
705 

Ser Ser He Gin 



Val Giy Tyr Phe 
740 

Arg Pro Giy Arg 
755 

Pro Asn Ser Asn 
770 

Lys Tyr Lys Lys 
735 

Giy Vai Asn Leu 



He Phe Phe Trp 
645 

Ser He Pro Leu 



Ser Arg Giy Ala 
680 

Arg Gin He Asn 
695 

Glu Gin Pro Asn 
710 

His Ser Phe Ser 
725 

Giy Giy Giy Ala 



Pro Gin Asp He 
760 

Giy Thr Pro Ala 
775 

Lys Phe Ser Giy 
790 

Leu He Giy Thr 
805 



Phe Leu Ser Ala 
650 

Val Lys His Leu 
665 

Leu Pro Asp Leu 



Asp Gin Thr Arg 
700 

Giy Phe Gin Asn 
715 

Asn Arg Asp Arg 
730 

Giy Ala Giy Giy 
745 

Asn Gin Val Gin 



Glu Asn Asp Ala 
780 

Glu He Leu Cys 
795 

Asp Ser Giy Leu 
810 



Asn Val He His 
655 

He Trp Phe Gin 
670 

Leu Pro Lys Ser 
685 

Gin Mer Ser Asp 



Ser Asp Ser Arg 
720 

Glu Lys Ser Phe 
735 

Giy Thr Val Asn 
750 

Val Asn Vai Thr 
765 

Pro Glu He Arg 



Ala Ala Leu Trp 
800 

Met Leu Leu Asp 
815 
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Arg Ser Giy Gl-n Gly Lys Vai Tyr Pro Leu lie Ser Arg Arg Arg Phe 
32C 825 83C 

Asp Gin Met Thr Val Leu Giu Gly Gin Asn lie Leu Ala Thr He Ser 
835 840 845 

Gly Arg Lys Arg Arg He Arg Val Tyr Tyr Leu Ser Trp Leu Arg Gin 

850 855 860 

Lys He Leu Arg Thr Glu Gly Ala Gly Ser Ala Asn Thr Thr Glu Lys 
865 870 875 880 

Arg Asn Gly Trp Val Asn Val Giy Asp Leu Gin Gly Ala lie His Phe 
885 890 895 

Lys He Vai Arg Tyr Glu Arc He Lys Phe Leu Vai Val Gly Leu Glu 
- 900 905 910 



Ser Ser He Glu He Tyr Ala Trp Ala Pro Lys Pro Tyr His Lys Phe 
915 920 925 

Met Ser Phe Lys Ser Phe Gly Ser Leu Ser His Val Pro Leu He Val 
930 935 940 

Asp Leu Thr Val Glu Asp Asn Ala Arg Leu Lys Val Leu Tyr Gly Ser 
945 950 955 960 

Thr Gly Giy Phe His Ala He Asp Leu Asp Ser Ala Ala Val Tyr Asp 
965 970 975 

He Tyr Thr Pro Ala Gin Ser Giy Gin Thr Thr Thr Pro His Cys He 
980 985 990 

Val Vai Leu Pro Asn Ser Asr. Giy Met Gin Leu Leu Leu Cys Tyr Asp 
995 1000 1005 

Asn Glu Gly Val Tyr Val Asn Thr Tyr Gly Arg Met Thr Lys Asn Val 
1010 1015 ' 1020 

Val Leu Gin Trp Gly Giu Met Pro Ser Ser Val Ala Tyr He Ser Thr 
1025 1030 1035 1040 

Gly Gin He Met Gly Trp Giy Asn Lys Ala He Glu He Arg Ser Val 
1045 1050 1055 

Asp Thr Giy His Leu Asp Giy Vai Phe Met His Lys Lys Ala Gin Lys 
1060 1065 1070 

Leu Lys Phe Leu Cys Giu Arg Asn Asp Lys Val Phe Phe Ser Ser Ala 
1075 1030 1085 

Lys Giy Gly Gly Ser Cys Gin He Tyr Phe Met Thr Leu Asn Lys Pro 
1090 1095 1100 
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Gly Leu Thr Asn Trp 
1105 



<210> 89 
<211> 1233 
<212> PRT 
<213> NIK_m 

<400> 89 

Met Ala Asn Asp Ser Pro Ala Lys Ser Leu Val Asp lie Asp Leu Ser 
1 5 10 ' 15 

Ser Leu Arg Asp Pro Ala Gly lie Phe Glu Leu Val Glu Val Val Gly 
20 25 30 

Asn Gly Thr Tyr Gly Gin Val Tyr Lys Gly Arg His Val Lys Thr Val 
35 40 45 

Thr Ala Ala He Lys Val Met Asp Val Thr Glu Asp Glu Glu Glu Glu 
50 55 60 

lie Thr Leu Glu lie Asn Met Leu Lys Lys Tyr Ser His His Arg Asn 
65 70 75 80 

lie Ala Thr Tyr Tyr Gly Ala Phe lie Lys Lys Ser Pro Pro Glv His 
85 90 95' 

Asp Asp Gin Leu Trp Leu Val Mer Glu Phe Cys Gly Ala Giy Ser He 
100 105 " ' 110 

Thr Asp Leu Val Lys Asn Thr Lys Gly Asn Thr Leu Lvs Glu Asp Trp 
115 120 125 

He Ala Tyr lie Ser Arg Glu He Leu Arg Gly Leu Ala His Leu His 
130 135 140 

He His His Val He His Arg Asp He Lys Gly Gin Asn Val Leu Leu 

150 155 160 

Thr Glu Asn Ala Glu Val Lys Leu Val Asp Phe Gly Val Ser Ala Gin 
165 170 175 

Leu Asp A.rg Thr Val Giy Arc Arg Asn Thr Phe He Gly Thr Pro Tyr 
180 185 * 190 

Trp Met Ala Pro Glu Val He Ala Cys Asp Glu Asn Pro Aso Ala Thr 
195 200 205 

Tyr Asp Tyr Arg Ser Asp Leu Trp Ser Cys Glv He Thr Ala He Glu 
210 215 ' 220 

Me- Ala Glu Gly Gly Pro Pro Leu Cys Asp Met His Pro Met Ara Ala 
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225 230 235 240 

Leu ?he Leu He Pro Arg Asn Pro Pro Pro Arg Leu Lys Ser Lys Lys 
245 250 255 

Trp Ser Lys Lys Phe Phe Ser Phe lie Giu Gly Cys Leu Val Lys Asn 
260 265 270 

Tyr Met Gin Arg Pro Ser Thr Glu Gin Leu Leu Lys His Pro Phe He 
275 280 285 



Arg Asp Gin Pro Asn Giu Arg Gin Val Arg lie Gin Leu Lys Asp His 
290 295 300 

lie Asp Arg Thr Arg Lys Lys Arg Giy Giu Lys Asp Glu Thr Giu Tyr 

305 310 315 320 

Giu Tyr Ser .Gly Ser Giu Giu Glu Glu Glu Giu Val Pro Giu Gin Glu 
325 330 335 

Gly Giu Pro Ser Ser lie Val Asn Val Pro Giy Giu Ser Thr Leu Arg 
340 345 350 

Arg Asp Phe Leu Arg Leu Gin Gin Giu Asn Lys Giu Arg Ser Giu Ala 

355 360 365 

Leu Arg Arg Gin Gin Leu Leu Gin Glu Gin Gin Leu Arg Glu Gin Glu 
370 375 380 



Glu Tyr Lys Arg 
385 

Gin Lys Giu Gin 



Giu Ala Arg Arg 
420 

Giu Lys Arg Arg 
435 

Giu Arg Arg Arg 
450 

Giu Tyr lie Arg 

465. 

Leu Gin Gin Gin 



Arg Arg Pro His 
500 

Arg Ser Lys Pro 
515 



Gin Leu Leu Ala 
390 

Arg Arg Arg Leu 
405 

Gin Gin Giu Arg 



Leu Giu Giu Leu 
440 

Ala Giu Glu Giu 
455 

Arg Gin Leu Glu 
470 

Leu Leu Gin Giu 
485 

Ala Gin Gin Gin 



Ser Phe His Ala 
520 



Giu Arg Gin Lys 
395 

Glu Glu Gin Gin 
410 

Giu Gin Arg Arg 
425 

Giu Arg Arg Arg 



Lys Arg Arg Val 
460 

Glu Giu Gin Arg 
475 

Gin Ala Met Leu 
490 

Pro Pro Pro Pro 
505 

Pro Glu Pro Lvs 



Arg lie Glu Gin 
400 

Arg Arg Glu Arg 
415 

Arg Giu Gin Giu 
430 

Lys Giu Giu Giu 
445 

Glu Arg Glu Gin 



His Leu Giu He 
480 

Leu His Asp His 
495 

Gin Gin Gin Asp 
510 

Pro His Tyr Asp 
52 5 
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Pro Aia Asp Arg Ala Arg Glu Vai Gin Trp Ser His Leu Ala Ser Leu 

530 535 ' 540 

Lys Asn Asn Val Ser Pro Val Ser Arg Ser His Ser Phe Ser Asp Pro 

545 550 555 ' 560 



Ser Pro Lys Phe Ala His His His Leu Arg Ser Gin Asd Pro Cys Pro 
565 570 ' 575 

Pro Ser Arg Ser Glu Gly Leu Ser Gin Ser Ser Asp Ser Lys Ser Glu 
580 585 590 

Val Pro Glu Pro Thr Gin Lys Ala Trp Ser Arg Ser Asp Ser Asp Glu 
595 600 605 

Val Pro Pro Arg Val Pro Val Arg Thr Thr Ser Arg Ser Pro Val Leu 
610 615 620 

Ser Arg Arg Asp Ser Pro Leu Gin Gly Gly Gly Gin Gin Asn Ser Gin 
625 630 635 640 

Ala Gly Gin Arg Asn Ser Thr Ser Ser lie Glu Pro Arg Leu Leu Trp 
645 650 655 

.Glu Arg Val Giu Lys Leu Val Pro Arg Pro Gly Ser Gly Ser Ser Ser 
660 665 670 

Gly Ser Ser Asn Ser Gly Ser Gin Pro Gly Ser His Pro Gly Ser Gin 
675 680 685 

Ser Gly Ser Gly Glu Arg Phe Arg Val Arg Ser Ser Ser Lys Ser Glu 
690 695 700 

Gly Ser Pro Ser Pro Arg Gin Glu Ser Aia Ala Lys Lys Pro Asp Asp 
705 710 715 ' 720 

Lys Lys Glu Val Phe Arg Ser Leu Lys Pro Ala Gly Glu Val Asp Leu 
725 730 735 

Thr Ala Leu Aia Lys Glu Leu Arg Ala Val Giu Aso Val Arg Pro Pro 
740 745 ' 750 

His Lys Val Thr Asp Tyr Ser Ser Ser Ser Glu Glu Ser Gly Thr Thr 
755 760 765 

Asp Glu Glu Glu Glu Asp Val Glu Gin Glu Gly Ala Asp Aso Ser Thr 
770 775 780 

Ser Gly Pro Glu Asp Thr Arg Ala Ala Ser Ser Pro Asn Leu Ser Asn 
785 790 795 800 

Gly Glu Thr Glu Ser Val Lys Thr Met lie Val His Aso Aso Val Glu 
805 810 " ' 815 
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Ser Glu Pro Ala Met Thr Pro Ser Lys Giu Gly Thr Leu lie Val Arc 
820 825 830 

Gin Thr Gin Ser Ala Ser Ser Thr Leu Gin Lys His Lys Ser Ser Ser 
835 840 845 

Ser Phe Thr Pro Phe He Asp Pro Arg Leu Leu Gin lie Ser Pro Ser 
850 855 860 

Ser Gly Thr Thr Val Thr Ser Val Val Gly Phe Ser Cys Asp Gly Leu 
865 870 875 ^ 880 

Arg Pro Giu Ala He Arg Gin Asp Pro Thr Arg Lys Gly Ser Val Val 
885 890 895 

Asn Val Asn Pro Thr Asn Thr Arg Pro Gin Ser Asp Thr Pro Glu He 
900 905 910 

Arg Lys Tyr Lys Lys Arg Phe Asn Ser Glu He Leu Cys Ala Ala Leu 
915 920 925 

Trp Gly Val Asn Leu Leu Val Gly Thr Glu Ser Gly Leu Met Leu Leu 
930 935 940 

Asp Arg Ser Gly Gin Gly Lys Val Tyr Pro Leu He Ser Arg Arg Arg 
^45 950 955 960 

Phe Gin Gin Met Asp Vai Leu Giu Gly Leu Asn Val Leu Val Thr He 
965 970 975 

Ser Gly Lys Lys Asp Lys Leu Arg Vai Tyr Tyr Leu Ser Tro Leu Arg 
980 985 990 

Asn Lys He Leu His Asn Asp Pro Glu Val Glu Lys Lys Gin Gly Trp 
595 1000 " 1005 

Thr Thr Vai Gly Asp Leu Glu Gly Cys Vai His Tyr Lys Val Val Lys 
1010 1015 1020 

Tyr Glu Arg He Lys Phe Leu Val He Ala Leu Lys Ser Ser Val Glu 
1025 1030 1035 1040 

Val Tyr Ala Trp Ala Pro Lys Pro Tyr His Lys Phe Met Ala Phe Lys 
1045 1050 1055 

Ser Phe Gly Glu Leu Leu His Lys Pro Leu Leu Vai Asp Leu Thr Vai 
1060 1065 ' 1070 

Glu Glu Gly Gin Arg Leu Lys Vai lie Tyr Gly Ser Cys Ala Glv Phe 
1075 1080 ' 1085 

His Ala Vai Asp Vai Asp Ser Gly Ser Vai Tyr Asp lie Tyr Leu Pro 
1090 1095 1100 
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Thr His lie Gin Cys Ser lie Lys Pro His Aia lie lie lie Leu Pro 

1110 1115 1120 

Asn Thr Asp Gly iMet Glu Leu Leu Val Cys Tyr Glu Asp Glu Gly Vai 
1125 1130 1135 

Tyr Val Asn Thr Tyr Giy Arg lie Thr Lys Asp Val Val Leu Gin Trp 
1140 1145 1150 

Gly Glu Met Pro Thr Ser Val Ala Tyr lie Arg Ser Asn Gin Thr Met 
1155 1160 1165 

Gly Trp Gly Glu Lys Ala lie Glu lie Arg Ser Val Glu Thr Gly His 
1170 1175 1180 

Leu Asp Gly Val Phe Met His Lys Arg Ala Gin Arg Leu Lys Phe Leu 
1185 1190 1195 1200 

Cys Gly Arc Asn Asp Lys Val Phe Phe Ser Ser Val Arg Ser Gly Giy 
1205 1210 1215 

Ser Ser Gin Val Tyr Phe Met Thr Leu Gly Arg Thr Ser Leu Leu Se^- 
1220 1225 1230 

Trp 



<210> 90 
<211> 982 
<212> PRT 
<2i3> SULU_ce 

<40C> 90 

Met Aia Pro Ala Vai Leu Asp Lys Pro Gly Vai lie Lys Asp Pro Ser 
1 5 10 15 

lie Ala Ala Leu Phe Ser Asn Lys Asp Pro Glu Gin Ara Tvr Gin Asp 
20 25 ^ 30 ' 

Leu Arg Glu lie Gly His Giy Ser Phe Gly Ala Val Tyr Phe Ala Tyr 
35 40 45 

Asp Lys Lys Asn Giu Gin Thr Val Ala lie Lvs Lys Met Asn Phe 3e^ 
50 55 "60 

Gly Lys Gin Ala Val Glu Lys Trp Asn Asp lie Leu Lvs Giu Va'' Ser 
^5 70 75 ' 30 

Phe Leu Asn Thr vai Val His Pro His lie Val Asp Tyr Lys Ala Cvs 
85 90 ' ' 95 ' 

Phe Leu Lys Asp Thr Thr Cys Trp Leu Vai Met Giu Tyr Cvs -le Gly 
100 105 lio 
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Ser Ala Ala Asp lie Val Asp Val Leu Arg Lys Giy Met Arg Giu Vai 
115 120 125 

Giu lie Ala Ala lie Cys Ser Gin Thr Leu Asp Aia Leu Arg Tyr Leu 
130 135 140 

His Ser Leu Lys Arg lie His Arg Asp lie Lys Aia Giy Asn lie Leu 
1^5 150 155 160 

Leu Ser Asp His Ala lie Val Lys Leu Ala Asp Phe Giy Ser Ala Ser 
165 170 175 

Leu Val Asp Pro Ala Gin Thr Phe lie Giy Thr Pro Phe Phe Met Ala 
180 185 190 

Pro Giu Vai lie Leu Aia Met Asp Giu Giy His Tyr Thr Asp Arg Al'a 
195 200 205 

Asp He Trp Ser Leu Giy lie Thr Cys He Giu Leu Ala Giu Arg Arg 
210 215 220 

Pro Pro Leu Phe Ser Met Asn Ala Met Ser Aia Leu Tyr His lie Aia 
225 230 235 240 

Gin Asn Asp Pro Pro Thr Leu Ser Pro lie Asp Thr Ser Giu Gin Pro 
245 250 255 

Giu Trp Ser Leu Giu Phe Vai Gin Phe He Asp Lys Cys Leu Arg Lys 
260 265 270 

Pro Aia Giu Giu Arg Met Ser Aia Giu Giu Cys Phe Arg His Pro Phe 
275 280 285 

He Gin Arg Ser Arg Pro Ser Asp Thr He Gin Giu Leu He Gin Arc 
290 295 300 

Thr Lys Asn Met Vai Leu Giu Leu Aso Asn Phe Gin Tyr Lys Lys Met 
305 310 315 320 

Arg Lys Leu Met Tyr Leu Asp Giu Thr Giu Giy Lys Giu Glv Ser Giu 
325 330 " 335 

Giy Asn Giy Ala Ser Asp Asp Leu Asp Phe His Giy Asn Giu Ala Asn 
340 345 350 

Ser He Giy Arg Aia Giy Asp Ser Aia Ser Ser Arg Ser Ala Ser Leu 
355 "360 365 

Thr Ser Phe Arg Ser Met Gin Ser Ser Giy Giy Aia Giy Leu Leu Val 
370 375 380 

Ser Thr Asn Thr Thr Giy Ala Met Asp Asn Vai His Giy Ser Ser Giy 
335 390 395 400 

Tyr Giy Asn Giy Ser Ser Ser Thr Thr Ser Ser Aia Arg Arg Arg Pro 
405 410 415 
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Pro lie Pro Ser Gin Met Leu Ser Ser Thr Ser Thr Ser Giy Vai Giy 
420 425 430 

Thr Mez Pro Ser His Giy Ser Vai Giy Ala Ser lie Thr Aia lie Aia 
435 440 445 

Vai Asn Pro Thr Pro Ser Pro Ser Glu Pro lie Pro Thr Ser Gin Pro 
450 455 460 

Thr Ser Lys Ser Giu Ser Ser Ser lie Leu Giu- Thr Aia His Asp Asp 
465 470 475 480 

Pro Leu Asp Thr Ser lie Arg Aia Pro Vai Lys Asp Leu His Met Pro 
485 490 495 

His Arg Aia Vai Lys Giu Arg lie Aia Thr Leu Gin Asn His Lys Phe 
500 505 510 

Aia Thr Leu Arg Ser Gin Arg lie lie Asn Gin Giu Gin Giu Giu Tyr 
515 520 525 

Thr Lys Giu Asn Asn Met Tyr Glu Gin Met Ser Lys Tyr Lys His Leu 
530 535 540 

Arg Gin Ala His His Lys Glu Leu Gin Gin Phe Giu Giu Arg Cys Aia 

550 555 560 

Leu Asp Arg Glu Gin Leu Arg Vai Lys Met Asp Arg Giu Leu Giu Gin 
565 570 " 575 

Leu Thr Thr Thr Tyr Ser Lys Giu Lys Met Arg Vai Arg Cys Ser Gin 
580 585 " 590 

Asn Asn Giu Leu Asp Lys Arg Lys Lys Asp lie Giu Asp Giy Giu Lys 
595 600 605 

Lys Met Lys Lys Thr Lys Asn Ser Gin Asn Gin Gin Gin Met Lys Leu 
610 615 620 

Tyr Ser Aia Gin Gin Leu Lys Giu Tyr Lys Tyr Asn Lvs Giu A.ia Gin 
625 630 635 ' 640 



Lys Thr Arg Leu Arg Ser Leu Asn Met Pro Arg Ser Thr Tyr Giu Asn 
645 650 655 

Aia Met Lys Giu Vai Lys Aia Asp Leu Asn Arg Vai Lys Asp Ala Arg 
660 665 ' 670 

Glu Asn Asp Phe Asp Giu Lys Leu Arg Ala Giu Leu Giu Asp Giu lie 
675 680 685 

Vai Arg Tyr Arg Arg Gin Gin Leu Ser Asn Leu His Gin Leu Giu Giu 
690 695 700 
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Gin Leu Asp Asp Glu Asp Val Asn Val Gin Giu Arg Gin Met Asp Thr 
"705 710 715 " 720 

Arg His Giy Leu Leu Ser Lys Gin His Glu Met Thr Arg Asc Leu Giu 
725 730 ^ 735 

He Gin His Leu Asn Giu Leu His Ala Met Lys Lys Arg His Leu Glu 
740 745 750 

Thr Gin His Giu Ala Glu Ser Ala Ser Gin Asn Giu Tyr Thr Gin Arg 
755 760 765 

Gin Gin Asp Giu Leu Arg Lys Lys His Ala Met Gin Ser Arg Gin Gin 
"770 775 780 

Pro Arg Asp Leu Lys lie Gin Giu Ala Gin He Arg Lys Gin Tyr Arg 
785 790 795 aoc 

Gin Val Val Lys Thr Gin Thr Arg Gin Phe Lys Leu Tyr Leu Thr Gin 
805 810 815 

Met Val Gin Val Val Pro Lys Asp Glu Gin Lys Giu Leu Thr Ser Arg 
820 825 830 

Leu Lys Gin Asp Gin Met Gin Lys Val Ala Leu Leu Ala Ser Gin Tyr 
835 840 845 

Giu Ser Gin He Lys Lys Met Val Gin Asp Lys Thr Val Lys Leu Giu 
850 855 860 

Ser Trp Gin Giu Asp Glu Gin Arg Val Leu Ser Giu Lvs Leu Glu Lys 
865 870 875 ^ 880 

Glu Leu Glu Giu Leu He Ala Tyr Gin Lys Lys Thr Arg Ala Thr Leu 
885 890 ^ 895 

Giu Glu Gin lie Lys Lys Glu Arg Thr Ala Leu Giu Glu Arg lie Giy 
900 905 910 



Thr Arg Arg Ala Met Leu Giu Gin Lys He He Giu Giu Arg Giu Gin 
915 920 925 

Met Giy Glu Met Arg Arg Leu Lys Lys Glu Gin He Arg Asp Arg His 
930 935 940 

Ser Gin Glu Arg His Arg Leu Giu Asn His Phe Val Arg Thr Giy Ser 

950 955 960 

Thr Ser Arg Ser Ser Giy Giy He Ala Pro Giy Vai Giy Asn Ser Ser 
965 970 975 

Ser He Gin Met Ala Met 
980 



BNSOOCiD: <WO 99S3036A2_fA> 



wo 99/53036 



PCT/US99/08150 



94 



<210> 91 
<211> 842 
<212> PRT 
<213> KHSl_h 

<400> 91 

Leu Arg Pro Ala Ala Asp lie Leu Arg Arg Asn Pro Gin Gin Asp Tyr 
15 10 

Glu Leu Val Gin Arg Val Gly Ser Gly Thr Tyr Gly Asd Val Ty- Lys 
20 25 " 30 

Ala Arg Asn Val His Thr Gly Glu Leu Ala Ala Val Lys lie He Lys 



35 40 



45 



Leu Glu Pro Gly Asp Asp Phe Ser Leu lie Gin Gin Glu lie Phe Met 
. ^° 55 60 

Val Lys Glu Cys Lys His Cys Asn He Val Ala Tyr Phe Gly Ser Tv- 

65 7n -IK 

75 go 

Leu Ser Arg Glu Lys Leu Trp He Cys Met Glu Tyr Cys Gly Glv Glv 
85 90 95 

Ser Leu Gin Asp He Tyr His Val Thr Gly Pro Leu Ser Glu Leu Gin 
IOC 105 110 

He Ala Tyr Val Cys Arg Glu Thr Leu Gin Gly Leu Ala Tyr Leu His 
115 120 125 

Thr Lys Gly Lys Met His Arg Asp He Lys Gly Ala Asn He Leu Leu 
130 135 140 

Thr Asp His Gly Asp Val Lys Leu Ala Asp Phe Gly Val Ala Ala Lys 

150 155 160 

He Thr Ala Thr He Ala Lys Arg Lys Ser Phe He Gly Thr Pro Tyr 
165 170 175 

Trp Met Ala Pro Glu Val Ala Ala Val Glu Lys Asn Gly Glv Tyr Asn 
180 185 198 

Gin Leu Cys Asp He Trp Ala Val Gly He Thr Ala He Glu Leu Gly 
195 200 205 

Glu Leu Gin Pro Pro Met Phe Asp Leu His Pro Met Arg Ala Leu Phe 
210 215 220 



Leu Met Ser Lys Ser Asn Phe Gin Pro Pro Lys Leu Lys Asp Lys Thr 

240 



225 230 235 



Lys Trp Ser Ser Thr Phe His Asn Phe Val Lys He Ala Leu Thr Lys 
245 250 255 
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Asn Pro Lys Lys Arg Pro Thr Ala Glu Arg Leu Leu Thr His Thr Phe 
260 265 ■ 270 

Val Ala Gin Pro Gly Leu Ser Arg Ala Leu Ala Val Glu Leu Leu Asp 
275 280 285 

Lys Val Asn Asn Pro Asp Asn His Ala His Tyr Thr Glu Ala Asp Asp 
290 295 300 

Asp Asp Phe Glu Pro His Ala He He Arg His Thr He Arg Ser Thr 

310 315 320 

Asn Arg Asn Ala Arg Ala Glu Arg Thr Ala Ser Glu He Asn Phe Asp 
325 330 335 

Lys Leu Gin Phe Glu Pro Pro Leu Arg Lys Glu Thr Glu Ala Arg Asp 
340 345 350 

Glu Met Gly Leu Ser Ser Asp Pro Asn Phe Met Leu Gin Trp Asn Pro 
355 360 365 

Phe Val Asp Gly Ala Asn Thr Gly Lys Ser Thr Ser Lys Arg Ala He 
370 375 

Pro Pro Pro Leu Pro Pro Lys Pro Arg He Ser Ser Tyr Pro Glu Asp 
385 390 

-^^^ 395 400 

Asn Phe Pro Asp Glu Glu Lys Ala Ser Thr He Lys His Cys Pro Asp 
405 410 415 

Ser Glu Ser Arg Ala Pro Gin He Leu Arg Arg Gin Ser Ser Pro Ser 
420 425 430 

Cys Gly Pro Val Ala Glu Thr Ser Ser He Gly Asn Gly Asp Gly H- 
435 440 445 

Ser Lys Leu Met Ser Glu Asn Thr Glu Gly Ser Ala Gin Ala Pro Gin 
''50 455 460 

Leu Pro Arg Lys Asn Asp Lys Arg Asp Phe Pro Lys Pro Ala He Asn 

^70 475 480 

Gly Leu Pro Pro Thr Pro Lys Val Leu Met Gly Ala Cys Phe Ser Lys 
485 490 495 

Val Phe Asp Gly Cys Pro Leu Lys He Asn Cys Ala Thr Ser Trp He 
500 505 5j^Q 

His Pro Asp Thr Lys Asp Gin Tyr He He Phe Gly Thr Glu Asp Gly 
515 520 525 

He Tyr Thr Leu Asn Leu Asn Glu Leu His Glu Ala Thr Met Glu Gin 
530 535 
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Leu Phe Pro Arg Lys Cys Thr Trp Leu Tyr Val He Asn Asn Thr Leu 

550 555 

Met Ser Leu Ser Giu Gly Lys Thr Phe Gin Leu Tyr Ser His Asn Leu 
565 570 575 

He Aia Leu Phe Glu His Ala Lys Lys. Pro Gly Leu Ala Ala His lie 
580 585 590 

Gin Thr His Arg Phe Pro Asp Arg lie Leu Pro Arg Lys Phe Ala Leu 
595 600 605 

Thr Thr Lys He Pro Asp Thr Lys Gly Cys His Lys Cvs Cys He Val 
610 615 620 

Arg Asn Pro Tyr Thr Gly His Lys Tyr Leu Cys Gly Ala Leu Gin Ser 

"0 635 640 

Gly He Val Leu Leu Gin Trp Tyr Glu Pro Met Gin Lvs Phe Met Leu 
• 645 650 655 

He Lys His Phe Asp Phe Pro Leu Pro Ser Pro Leu Asn Val Phe Glu 
660 665 670 

Met Leu Val He Pro Glu Gin Glu Tyr Pro Met Val Cys Val Ala He 
675 680 685 

Ser Lys Gly Thr Glu Ser Asn Gin Val Val Gin Phe Glu Thr He Asn 
690 695 700 

Leu Asn Ser Ala Ser Ser Trp Phe Thr Glu He Gly Ala Gly Ser Gin 
'05 7 1 n TIC 

715 720 

Gin Leu Asp Ser He His Val Thr Gin Leu Glu Arg Asp Thr Val Leu 
725 730 735 

Val Cys Leu Asp Lys Phe Val Lys He Val Asn Leu Gin Gly Lys Leu 
740 745 750 

Lys Ser Ser Lys Lys Leu Ala Ser Glu Leu Ser Phe Asp Phe Arg He 
755 760 765 

Glu Ser Val Val Cys Leu Gin Asp Ser Val Leu Ala Phe Trp Lys His 
770 775 

Gly Met Gin Gly Lys Ser Phe Lys Ser Asp Glu Val Thr Gin Glu He 



785 . 790 



800 



Ser Asp Glu Thr Arg Val Phe Arg Leu Leu Gly Ser Asp Arg Val Vai 
805 810 815 

Val Leu Giu Ser Arg Pro Thr Glu Asn Pro Thr Ala His Se- ;^sn Leu 
820 825 830 

Tyr He Leu Ala Gly His Glu Asn Ser Tyr 
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835 840 



<210> 92 
<211> 911 
<212> PRT 
<213> LOK m 



<400> 92 

Met Ala Phe Ala Asn Phe Arg Arg lie Leu Arg Leu Ser Thr Phe Glu 
15 10 15 

Lys Arg Lys Ser Arg Glu Tyr Glu His Val Arg Arg Asp Leu Asp Pro 
20 25 30 

Asn Asp Val Trp Glu lie Val Gly Glu Leu Gly Asp Gly Ala Phe Gly 
35 40 '45 

Lys Val Tyr Lys Ala Lys Asn Lys Glu Thr Gly Ala Leu Ala Ala Ala 
50 55 60 



Lys Val lie Glu Thr Lys Ser Glu Glu Glu Leu Glu Asp Tyr lie Val 

70 75 80 

Glu' He Glu He Leu Ala Thr Cys Asp His Pro Tyr He Val Lys Leu 
85 90 95 

Leu Gly Ala Tyr Tyr Tyr Asp Gly Lys Leu Trp He Met He Glu Phe 
100 105 110 

Cys Pro Gly Gly Ala Val Asp Ala He Met Leu Glu Leu Asp Arg Gly 
115 120 125 

Leu Thr Glu Pro Gin He Gin Val Val Cys Arg Gin Met Leu Glu Ala 
130 135 140 

Leu Asn Phe Leu His Gly Lys Arg He He His Arg Asp Leu Lys Ala 

150 155 160 

Gly Asn Val Leu Met Thr Leu Glu Gly Asp He Arg Leu Ala Asp Phe 
165 170 175 

Gly Val Ser Ala Lys Asn Leu Lys Thr Leu Gin Lys Arg Asp Ser Phe 
180 185 190 

He Gly Thr Pro Tyr Trp Met Ala Pro Glu Val Val Leu Cys Glu Thr 
195 200 205 

Met Lys Asp Ala Pro Tyr Asp Tyr Lys Ala Asp He Trp Ser Leu Gly 
210 215 220 

He Thr Leu He Glu Met Ala Gin He Glu Pro Pro His His Glu Leu 
225 230 235 240 
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Asn Pro Met Arg Val Leu Leu Lys lie Ala Lys Ser Asp Pro Pro Thr 
245 25C 255 

Leu Leu Thr Pro Ser Lys Trp Ser Val Glu Phe Arg Asd Phe Leu Lys 
260 265 ' 270 

He Ala Leu Asp Lys Asn Pro Glu Thr Arg Pro Ser Ala Ala Gin Leu 
275 280 285 

Leu Gin His Pro Phe Val Ser Arg Val Thr Ser Asn Lys Ala Leu Arg 
290 295 300 

Glu Leu Val Ala Glu Ala Lys Ala Giu Val Met Glu Glu He Glu Asp 
305 310 315 320 

Gly Arg Glu Asp Gly Giu Giu Glu Asp Ala Val Asp Ala Val Pro Pro 
325 330 335 

Leu Val Asn His Thr Gin Asp Ser Ala Asn Val Thr Gin Pro Ser Leu 
340 345 350 

Asp Ser Asn Lys Leu Leu Gin Asp Ser Ser Thr Pro Leu Pro Pro Ser 
355 360 365 

Gin Pro Gin Giu Pro Val Asn Gly Pro Cys Ser Gin Pro Ser Gly Asp 
370 375 380 

Gly Pro Leu Gin Thr Thr Ser Pro Ala Asp Gly Leu Ser Lys Asn Asp 

390 395 400 

Asn Asp Leu Lys Val Pro Val Pro Leu Arg Lys Ser Arg Pro Leu Ser 
405 410 415 

Met Asp Ala Arg lie Gin Met Asp Glu Giu Lys Gin lie Pro Asp Gin 
420 425 430 

Asp Giu Asn Pro Ser Pro Ala Ala Ser Lys Ser Gin Lys Ala Asn Gin 
435 440 445 

Ser Arg Pro Asn Ser Ser Ala Leu Giu Thr Leu Gly Gly Giu Ala Leu 
450 455 460 

Thr Asn Gly Gly Leu Giu Leu Pro Ser Ser Val Thr Pro Ser His Se- 

470 475 480 

Lys Arg Aia Ser Asp Cys Ser Asn Leu Ser Thr Ser Glu Ser Met Asp 
485 490 495 

Tyr Gly Thr Ser Leu Ser Ala Asp Leu Ser Leu Asn Lys Glu Thr Gly 
500 505 510 

Ser Leu Ser Leu Lys Gly Ser Lys Leu His Asn Lvs Thr Leu Lys Arg 
515 520 525 
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Thr Arg Arg Phe 
530 

Lys lie lie Ser 
545 

Arg Arg Gin Glu 



Arg Asn Gin Thr 
580 

Met His Lys Arg 
595 



Vai Val Asp Giy 
535 

Glu Asp Glu Lys 
550 

Leu Arg Glu Leu 
565 

Gin Leu Ser Ser 



Phe Glu Gin Glu 
600 



99 

Val Glu Val Ser 
540 

Lys Asp Glu Glu 
555 

Arg Leu Leu Gin 
570 

Lys His Glu Leu 
58 5 

lie Asn Ala Lys 



lie Thr Thr Ser 



Met Arg Phe Leu 
560 

Lys Glu Glu His 
575 

Gin Leu Glu Gin 
590 

Lys Lys Phe Tyr 
605 



Asp Val Glu Leu Giu Asn Leu Glu Arg Gin Gin Lys Gin Gin Val Glu 
610 615 620 

Lys Met Glu Gin Asp His Ser Val Arg Arg Lys Glu Glu Ala Lys Arg 

630 635 640 

He Arg Leu Glu Gin Asp Arg Asp Tyr Ala Lys Phe Gin Glu Gin Leu 
645 650 655 

Lys Gin Met Lys Lys Glu Val Lys Ser Glu Val Glu Lys Leu Pro Arg 
660 665 670 

Gin Gin Arg Lys Glu Ser Met Lys .Gin Lys Met Glu Glu His Ser Gin 
675 680 685 

Lys Lys Gin Arg Leu Asp Arg Asp Phe Val Ala Lys Gin Lys Glu Asp 
690 695 700 

Leu Glu Leu Ala Met Arg Lys Leu Thr Thr Glu Asn Arg Ara Glu He 

710 715 ' 720 

Cys Asp Lys Glu Arg Asp Cys Leu Ser Lys Lys Gin Glu Leu Leu Arg 
725 730 735 

Asp Arg Glu Ala Ala Leu Trp Glu Met Glu Glu His Gin Leu Gin Glu 
740 745 750 

Arg His Gin Leu Val Lys Gin Gin Leu Lys Asp Gin Tyr Phe Leu Gin 
755 760 765 

Arg His Asp Leu Leu Arg Lys His Glu Lys Glu Arg Glu Gin Met Gin 
^'70 775 780 

Arg Tyr Asn Gin Arg Met Met Glu Gin Leu Lys Val Arg Gin Gin Gin 

790 795 800 

Glu Lys Ala Arg Leu Pro Lys He Gin Arg Ser Asp Glv Glu Thr Arg 
805 810 ' 815 

Met Ala Met Tyr Lys Lys Ser Leu His Tie Asn Giy Ala Giy Ser Ala 
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820 S25 830 

Ser Glu Gin Arg Glu Lys lie Lys Gin Phe Ser Gin Gin Glu GIu Lys 
835 840 845 

Arg Gin Lys Ala Glu Arg Leu Gin Gin Gin Gin Lys His Glu His Gin 
850 855 860 

Met Arg Asp Met Val Ala Gin Cys Glu Ser Asn Met Ser Glu Leu Gin 

870 875 880 

Gin Leu Gin Asn Glu Lys Cys Tyr Leu Leu Val Glu His Glu Thr Gin 
885 890 895 

Lys Leu Lys Ala Leu Asp Glu Ser His Asn Gin Ser Leu Lys Glu 
900 905 910 



<210> 93 
<211> 545 
<212> PRT 
<213> PAKl_h 

<400> 93 



Met Ser Asn Asn 
1 



Met Arg Asn Thr 

20 



Thr Leu Asn His 
35 



Lys Lys Lys Asp 
50 



Asn Lys Lys Lys 
65 



Phe Glu His Thr 



Thr Gly Met Pro 
100 



Thr Lys Ser Glu 
115 



Glu Phe Tyr Asn 
130 



Gly Leu Asp lie 
5 

Ser Thr Met lie 



Gly Ser Lys Pro 
.4 0 

Arg Phe Tyr Arg 
55 

Glu Lys Glu Arg 
70 

lie His Val Gly 
85 

Glu Gin Trp Ala 



Gin Lys Lys Asn 
120 

Ser Lys Lys Thr 
135 



Gin Asp Lys Pro 

lo" 

Gly Ala Gly Ser 
25 

Leu Pro Pro Asn 



Ser lie Leu Pro 
60 

Pro Glu lie Ser 
75 

Phe Asp Ala Val 
90' 

Arg Leu Leu Gin 
105 

Pro Gin Aia Vai 



Ser Asn Ser Gin 
140 



Pro Ala Pro Pro 
15 

Lys Asp Ala Gly 
30 

Pro Glu Glu Lys 
45 

Gly Asp Lys Thr 



Leu Pro Ser Asp 
80 

Thr Gly Glu Phe 
95 

Thr Ser Asn lie 
110 

Leu Asp Val Leu 
125 

Lys Tyr Met Ser 
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Phe Thr Asp Lys Ser Ala Glu Asp Tyr Asn Ser Ser Asn Ala Leu Asn 

150 155 160 

Val Lys Ala Val Ser Glu Thr Pro Ala Val Pro Pro Val Ser Glu Asp 
165 170 175 

Glu Asp Asp Asp Asp Asp Asp Ala Thr Pro Pro Pro Val lie Ala Pro 
180 185 190 

ArgPro Glu His Thr Lys Ser Val Tyr Thr Arg Ser Val He Glu Pro 
195 200 205 

Leu Pro Val Thr Pro Thr Arg Asp Val Ala Thr Ser Pro He Ser Pro 
210 215 220 

Thr Glu Asn Asn Thr Thr Pro Pro Asd Ala Leu Thr Leu Asn Thr Glu 
225 230 235 240 

Lys Gin Lys Lys .Lys Pro Lys Met Ser Asp Glu Glu lie Leu Glu Lys 
245 250 255 

Leu Arg Ser He Val Ser Val Gly Asp Pro Lys Lys Lys Tyr Thr Arg 
260 265 270 

Phe Glu Lys He Gly Gin Gly Ala Ser Gly Thr Val Tyr Thr Ala Met 
275 280 285 

Asp Val Ala Thr Gly Gin Glu Val Ala He Lys Gin Met Asn Leu Gin 
290 295 300 

Gin Gin Pro Lys Lys Glu Leu He He Asn Glu He Leu Val Met Arg 

310 315 320 

Glu Asn Lys Asn Pro Asn He Val Asn Tyr Leu Asp Ser Tyr Leu Val 
325 330 * 335 

Gly Asp Glu Leu Trp Val Val Met Glu Tyr Leu Ala Gly Gly Ser Leu 
340 345 350 

Thr Asp Val Val Thr Glu Thr Cys Met Asp Glu Gly Gin He Ala Ala 
355 360 365 

Val Cys Arg Glu Cys Leu Gin Ala Leu Glu Ser Leu His Ser Asn Gin 
370 375 380 

Val He His Arg Asp He Lys Ser Asp Asn He Leu Leu Gly Met Asp 

390 395 400 

Gly Ser Val Lys Leu Thr Asp Phe Gly Phe Cys Ala Gin He Thr Pro 
405 410 415 

Glu Gin Ser Lys Arg Ser Thr Met Val Gly Thr Pro Tyr Trp Met Ala 
420 425 430 

Pro Glu Val Val Thr Arg Lys Ala Tyr Gly Pro Lys Val Asp He Trp 
435 440 445 
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Ser Leu Giy lie Met 
450 

Leu Asn Glu Asn Pro 
465 



Thr Pro Glu Leu Gin 
485 

Phe Leu Asn Arg Cys 
500 

Lys Glu Leu Leu Gin 
515 

Ser Leu Thr Pro Leu 
530 

His 
545 



Ala lie Glu Met lie Glu 
455 

Leu Arg Ala Leu Tyr Leu 

470 475 

Asn Pro Glu Lys Leu Ser 
490 

Leu Glu Met Asp Val Glu 
505 

His Gin Phe Leu Lys lie 
520 

He Ala Ala Ala Lys Glu 
535 



Gly Glu Pro Pro Tyr 
4 60 

He Ala Thr Asn Gly 
480 

Ala He Phe Arg Asp 
495 

Lys Arg Gly Ser Ala 
510 

Ala Lys Pro Leu Ser 
525 

Ala Thr Lys Asn Asn 
540 



<210> 94 
<211> 506 
<212> PRT 
<213> PAK65_h 

<400> 94 



Met Glu Giu Thr 
1 



His Ser Leu Lys 
20 



His Lys He He 
35 



Lys Glu Lys Glu 
50 



Thr He His Val 
65 



Pro Glu Gin Trp 



Glu Gin Lys Lys 
100 



Asp Ser Asn Thr 
115 



Gin Gin Lys Ser 
5 



Pro Leu Pro Ser 



Ser He Phe Ser 
40 

Arg Pro Glu He 
55 

Gly Phe Asp Thr 
70 

Ala Arg Leu Leu 
85 



Asn Pro Gin Ala 



Val Lys Gin Lys 
120 



Asn Leu Glu Leu 
10 

Val Pro Glu Glu 
25 

Gly Thr Glu Lys 



Ser Pro Pro Ser 
60 

Val Thr Gly Glu 
75 

Gin Thr Ser Asn 
90 

Val Leu Asp Val 
105 

Tyr Leu Ser Phe 



Leu Ser Ala Asn 
15 

Lys Lys Pro Arg 
30 

Gly Ser Lys Lys 
45 

Asp Phe Glu His 



Phe Thr Gly Met 
80 

He Thr Lys Leu 
95 

Leu Lys Phe Tyr 
110 

Thr Pro Pro Glu 
125 
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Lys Asp Gly Phe Pro Ser Gly Thr Pro Ala Leu Asn Ala Lys Gly Thr 
130 135 140 

Glu Ala Pro Ala Val Val Thr Glu Glu Glu Asp Asp Asp Glu Glu Thr 
145 150 155 160 

Ala Pro Pro Val He Ala Pro Arg Pro Asp His Thr Lys Ser He Tyr 
165 170 175 

Thr Arc Ser Val He Asp Pro Val Pro Ala Pro Val Gly Asp Ser His 
180 185 190 

Val Asp Gly Ala Ala Lys Ser Leu Asp Lys Gin Lys Lys Lys Thr Lys 
195 200 205 

Met Thr Asp Glu Glu He Met Glu Lys Leu Arg Thr He Val Ser lie 
210 215 220 

Gly Asp Pro .Lys Lys Lys Tyr Thr Arg Tyr Glu Lys He Gly Gin Gly 
225 230 235 240 

Ala Ser Gly Thr Val Phe Thr Ala Thr Asp Val Ala Leu Gly Gin Glu 
245 250 255 

Val Ala He Lys Gin He Asn Leu Gin Lys Gin Pro Lys Lys Glu Leu 
260 265 270 

He He Asn Glu He Leu Val Met Lys Glu Leu Lys Asn Pro Asn He 
275 280 285 

Val Asn Phe Leu Asp Ser Tyr Leu Val Gly Asp Glu Leu Phe Val Val 
290 295 300 

Met Glu Tyr Leu Ala Gly Arg Ser Leu Thr Asp Val Val Thr Glu Thr 
305 310 315 320 

Cys Met Asp Glu Ala Gin He Ala Ala Val Cys Arg Glu Cys Leu Gin 
325 330 335 

Ala Leu Glu Phe Leu His Ala Asn Gin Val He His Arg Asp He Lys 
340 345 350 

Ser Asp Asn Val Leu Leu Gly Met Glu Gly Ser Val Lys Leu Thr Asp 
355 360 365 

Phe Gly Phe Cys Ala Gin He Thr Pro Glu Gin Ser Lys Arg Ser Thr 
370 375 380 

Met Val Gly Thr Pro Tyr Trp Met Ala Pro Glu Val Val Thr Arg Lys 
385 390 395 400 



Ala Tyr Gly Pro Lys Val Asp He Trp Ser Leu Gly He Met Ala He 
405 410 415 



Glu Met Val Glu Gly Glu Pro Pro Tyr Leu Asn Glu Asn Pro Leu Arc 
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420 

Ala Leu Tyr Leu lie Ala 
435 

Glu Lys Leu Ser Pro lie 
450 

Met Asp Val Glu Lys Arg 
465 470 

Phe Leu Lys Leu Ala Lys 
485 

Ala Ala Lys Glu Ala Met 
500 



104 

425 

Thr Asn Gly Thr Pro 
440 

Phe Arg Asp Phe Leu 
455 

Gly Ser Ala Lys Glu 
475 

Pro Leu Ser Ser Leu 
490 

Lys Ser Asn Arg 
505 



430 

Glu Leu Gin Asn Pro 
445 

Asn Arg Cys Leu Glu 
460 

Leu Leu Gin His Pro 
480 

Thr Pro Leu lie Met 
495 



<210> 95 

<211> 544 

<212> PRT 

<213> PAK3_m 

<400> 95 

Met Ser Asp Ser Leu Asp Asn Glu Glu Lys Pro Pro Ala Pro Pro Leu 
1 5 10 15 

Arg Met Asn Ser Asn Asn Arg Asp Ser Ser Ala Leu Asn His Ser Ser 
20 25 30 

Lys Pro Leu Pro Met Ala Pro Glu Giu Lys Asn Lys Lys Ala Arg Leu 
35 40 45 

Arg Ser He Phe Pro Gly Gly Gly Asp Lys Thr Asn Lys Lys Lys Glu 
^0 55 60 



Lys Glu Arg Pro Glu lie Ser Leu Pro Ser Asp Phe Glu His Thr He 

80 



65 70 75 



His Val Gly Phe Asp Ala Val Thr Gly Glu Phe Thr Gly He Pro Glu 
85 90 95 

Gin Trp Ala Arg Leu Leu Gin Thr Ser Asn He Thr Lys Leu Glu Gin 
100 105 110 

Lys Lys Asn Pro Gin Ala Val Leu Asp Val Leu Lys Phe Tyr Asp Ser 
115 120 125 

Lys Giu Thr Val Asn Asn Gin Lys Tyr Met Ser Phe Thr Ser Gly Asp 

135 140 



Lys Ser Ala His Gly Tyr He Ala Ala His Gin Ser Asn Thr Lys Thr 

160 



145 150 155 
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Gly Ser Glu Pro Pro 
165 

Glu Glu Glu Giu Glu 
180 

Pro Arg Pro Glu His 
195 

Ser lie Ala Ser Pro 
210 

Ala Glu Asn Ala Asn 
225 

Arg Lys Lys Ser Lys 
245 

Ser lie Val Ser Val 
260 

Lys lie Gly Gin Gly 
275 

Ala Thr Gly Gin Glu 
290 

Pro Lys Lys Glu Leu 
305 

Lys Asn Pro Asn lie 
325 

Glu Leu Trp Val Val 
340 

Val Val Thr Glu Thr 
355 

Arg, Glu Cys Leu Gin 
370 

His Arg Asp lie Lys 
385 

Val Lys Leu Thr Asp 
405 

Ser Lys Arg Ser Thr 
420 

Val Val Thr Arg Lys 
435 

Gly He Met Ala He 



Leu Ala Pro Pro Val Ser 
170 

Glu Asp Asp Asn Glu Pro 
185 

Thr Lys Ser He Tyr Thr 
200 

Ala Ala Pro Asn Lys Glu 
215 

Ser Thr Thr Leu Tyr Arg 
230 235 

Met Thr Asp Glu Glu He 
250 

Gly Asp Pro Lys Lys Lys 
265 

Ala Ser Gly Thr Val Tyr 
280 

Val Ala He Lys Gin Met 
295 

He He Asn Glu He Leu 
310 315 

Val Asn Tyr Leu Asp Ser 
330 

Met Glu Tyr Leu Ala Gly 
345 

Cys Met Asp Val Gly Gin 
360 

Ala Leu Asp Phe Leu His 
375 

Ser Asp Asn He Leu Leu 
390 395 

Phe Gly Phe Cys Ala Gin 
410 

Met Val Gly Thr Pro Tyr 
425 

Ala Tyr Gly Pro Lys Val 
440 

Glu Met Val Giu Gly Giu 



Glu Glu Glu Asp Glu 
175 

Pro Pro Val He Ala 
190 

Arg Ser Val Val Giu 
205 

Asp He Pro Pro Ser 
220 

Asn Thr Asp Arg Gin 
240 

Leu Glu Lys Leu Arg 
255 

Tyr Thr Arg Leu Glu 
270 

Thr Ala Leu Asp He 
285 

Asn Leu Gin Gin Gin 
300 

Val Met Arg Glu Asn 
320 

Tyr Leu Val Gly Asp 
335 

Gly Ser Leu Thr Asp 
350 

He Ala Ala Val Cys 
365 

Ser Asn Gin Val He 
380 

Gly Met Asp Gly Ser 
400 

He Thr Pro Glu Gin 
415 

Trp Met Ala Pro Giu 
430 

Asp He Trp Ser Leu 
445 

Pro Pro Tyr Leu Asn 
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450 455 460 

Glu Asn Pro Leu Arg Ala Leu Tyr Leu lie Aia Thr Asn Gly Thr ^ro 

470 475 480 

Glu Leu Gin Asn Pro Glu Arg Leu Ser Ala Val Phe His Asp Phe Leu 
485 490 495 

Asn Arg Cys Leu Glu Met Asp Val Asp Arg Arg Gly Ser Ala Lys Glu 
500 505 510 

Leu Leu Gin His * Pro Phe Leu Lys Leu Aia Lys Pro Leu Ser Ser Leu 



<400> 96 
<210> 96 
<211> 2110 
<212> DNA 

<213> Full Length Mammalian (Human) STLK5 



<400> 96 



ggccaagacg gtcggggctg cttgctaact ccaggaacag gtttaagttt ttgaaactga 60 

agtaggtcta cacagtagga actcatgtca tttcttgtaa gtaaaccaga gcgaatcagg 120 

cggtgggtct cggaaaagtt cattgttgag ggcttaagag atttggaact attrggagac 180 

caatgatgcg agctcagagt caatagcatc cttctctaaa caggaggtca tgaatagctt 240 

tctgccagag ggagggtgtt acgagctgct cacrgtgata ggcaaaggat ttgaggacct 300 

gatgactgtg aatctagcaa ggtacaaacc aacaggagag tacgtgactg tacggaggat 360 

taacctagaa gcttgttcca atgagatggt aacattcttg cagggcgagc tgcatgtctc 420 

caaactcttc aaccatccca atatcgtgcc atatcgagcc acttttattg cagacaatga 480 

gctgtgggtt gtcacatcat tcatggcata cggttctgca aaagatctca tctgtacaca 540 

cttcatggat ggcatgaatg agctggcgat tgcttacatc ctgcaggggg tgctgaaggc 600 

cctcgactac atccaccaca tgggatatgt acacaggagt gtcaaagcca gccacatcct 660 

gatctctgtg gatgggaagg tctacctgtc tggtttgcgc agcaacctca acatgataag 720 

ccatgggcag cggcagcgag tggtccacga ttttcccaag tacagtgtca aggttctgcc 780 

gtggctcagc cccgaggtcc tccagcagaa tctccagggt tatgatgcca agtctgacat 840 

cnacagtgrg ggaatcacag cctgtgaact ggccaacggc catgtcccct ttaaggatat 900 

gcctgccacc cagatgctgc tagagaaact gaacggcaca gtgccctgcc tgttggatac 960 

cagcaccarc cccgctgagg agctgaccat gagcccttcg cgctcagrgg ccaacrctgg 1020 

cctgagtgac agcctgacca ccagcacccc ccggccctcc aacggtgacr cgccctccca 1080 

cccctaccac cgaaccttct ccccccactt ccaccacttt grggagcagt gccttcagcg 1140 

caacccggat gccaggccca gtgccagcac cctcctgaac cactctttct tcaagcagat 1200 

caagcgacgt gcctcagagg ctttgcccga attgcttcgt cctgtcaccc ccatcaccaa 1260 

tttcgagggc agccagtccc aggaccacag tggaatcttt ggcctggcaa caaacctgga 1320 

agagctggag gtggacgatt gggagttctg agcctctgca aactgtgcgc attctccagc 1380 

cagggatgca gaggccaccc agaggccccr cccgagggcc ggccacattc ccgccctcct 1440 

gggcagattg ggtagaaagg acattcttcc aggaaagttg actgctgact gattgggaaa 1500 

gaaaatcctg gagagatact tcactgcrcc aaggcttrtg agacacaagg aaatctcaac 1560 

aaccagggat caggagggtc caaagccgac attcccagrc ctgtgagctc aggtgacctc 1620 
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ctccgcagaa gagagatgct gctctggccc 
ctccttaaac ccgaggaccg ccacctcttc 
aactttgctc tcagacgcct cagatgctat 
ctgcctccct tccctcagac ctctccctca 
taagcacaga ctaaggctgg aacagtccat 
acctaagtct ttcccacggt ttatgtgtgt 
ttagcgcctc ctgccagctg ccctggtgct 
cttgagggct taagtcctta tgctgtgtta 
agagacgctg 



107 

tgggagctga at-ccaagcc cagggtttgg 1680 

ccagtgcttg cgaccagcct cattctattt 1740 

aggtcagtga aagggcaagt agtaagccgc 1800 

taattccaga gaagggcatt tctgtctttt 1860 

ccttatccct cttctggctt gggccctgac 1920 

gcctcattcc tttcccacca agaatccatc 1980 

ttctccaagg gccatcagtg tcttgcctag 2040 

gtttcgttgt cagaacaaat taaaattttc 2100 

2110 



<210> 97 
<211> 373 
<212> PRT 

<213> Full Length Mammalian (Human) STLK 5 
<400> 97 



Met Ser Ser Phe Leu Pro Glu Gly Gly Cys Tyr Glu Leu Leu Thr Val 
15 10 15 

He Gly Lys Gly Phe Glu Asp Leu Met Thr Val Asn Leu Ala Arg Tyr 
20 25 30 

Lys Pro Thr Gly Glu Tyr Val Thr Val Arg Arg He Asn Leu Glu Ala 
35 40 45 

Cys Ser Asn Glu Met Val Thr Phe Leu Gin Gly Glu Leu His Val Ser 
50 55 60 

Lys Leu Phe Asn His Pro Asn He Val Pro Tyr Arg Ala Thr Phe He 

70 75 80 

Ala Asp Asn Glu Leu Trp Val Val Thr Ser Phe Met Ala Tyr Gly Ser 
85 90 95 

Ala Lys Asp Leu He Cys Thr His Phe Met Asp Gly Met Asn Glu Leu 
100 105 110 

Ala He Ala Tyr He Leu Gin Gly Val Leu Lys Ala Leu Asp Tyr He 
115 120 125 

His His Met Gly Tyr Val His Arg Ser Val Lys Ala Ser His He Leu 
130 135 140 

He Ser Val Asp Gly Lys Val Tyr Leu Ser Gly Leu Arg Ser Asn Leu 

150 155 160 

Ser Met He Ser His Gly Gin Arg Gin Arg Val Val His Asp Phe Pro 
165 170 '175 

Lys Tyr Ser Val Lys Val Leu Pro Trp Leu Ser Pro Glu Val Leu Gin 
180 185 190 
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Gin Asn Leu Gin Gly Tyr Asp Ala Lys Ser Asp lie Tyr 



195 



Ser Vai Gl\ 



200 



205 



lie Thr Ala Cys Giu Leu Ala Asn Gly His Val Pro Phe Lvs Asp Me^ 

215 220 



Pro Aia Thr Gin Met Leu Leu Giu Lys Leu Asn Gly Thr Val Pro Cvs 
225 230 



235 



240 



Leu Leu Asp Thr Ser Thr lie Pro Aia Giu Giu Leu Thr Met Ser Pro 
245 250 255 



Ser Arg Ser Vai Aia Asn Ser Gly Leu Ser Asp 



260 



265 



Ser Leu Thr Thr Se:i 
270 



Thr Pro Arg Pro Ser Asn Gly Asp Ser Pro Ser His Pro Tyr His Arg 
2"^^ 280 285 



Thr Phe Ser Pro His Phe His His Phe Vai Giu Gin 



290 



295 



300 



Cys Leu Gin Arg 



Asn Pro Asp Aia Arg Pro Ser Aia Ser Thr Leu Leu Asn His Ser Phe 
305 310 



315 



320 



Phe Lys Gin lie Lys Arg Arg Aia Ser Giu Aia Leu Pro Giu Leu Leu 
325 330 



335 



Arg Pro Val Thr Pro lie Thr Asn Phe Giu 
340 



Gly Ser Gin Ser Gin Asp 
345 350 



His Ser Gly lie Phe Gly Leu Vai Thr Asn Leu Giu Giu Leu Giu Val 
355 360 



365 



Asp Asp Trp Giu Phe 
370 



<210> 98 
<2il> 2001 
<212> DNA 

<213> Mammalian (Human) STLK6 
<400> 98 



aaggaagata aaacaaaagc cttctttgga atagatggar ttttgtcact ttctgtgtga 
actaaagtga ttcaatgtct ctttrggatt gcttctgcac ttcaagaaca caagttgaat 
cactcagacc tgaaaaacag tctgaaacca gtatccatca atacttggtt gatgagccaa 
ccctttcctg gtcacgtcca tccactagag ccagtgaagt acuatgtccc accaacgttt 
ctcactatga gctccaagta gaaataggaa gaggatttga caacttgact tctgtccatc 
ttgcacggca tactcccacg ggaacactgg taactataaa aattacaaat crggaaaa-t 
gcaatgaaga acgcctgaaa gctttacaga aagccgtgat tctatcccac tttttccggc 



120 

240 
300 
360 
420 
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atcccaatat tacaacttat tggacagttt tcactgttgg cagctggctt tgggttattt 
ctccatttat ggcctatggt tcagcaagtc aactcttgag gacctacttt cctgaaggaa 
tgagrgaaac tttaataaga aacattctct ttggagccgt gagagggttg aactatctgc 
accaaaatgg ctgtattcac aggagtatra aagccagcca tatcctcatt tctggtgatg ^ou 
gcctagrgac cctctctggc ctrtcccatc tgcatagttt ggttaagcat ggacagaggc 720 
atagggctgt gratgatttc ccacagttca gcacatcagt gcagccgtgg ctgagtccag 780 
aactactgag acaggattta cargggtata atgtgaagtc agatatttac agtgtcggga 
ttacagcatg tgaattagcc agtgggcagg tgcctttcca ggacatgcat agaactcaga 
tgctgttaca gaaactgaaa ggtcctcctt atagcccatt ggatatcagt attttccctc 
aatcagaatc cagaatgaaa aattcccagt caggtgtaga ctcrgggatt ggagaaagtg 
tgcrtgtctc cagtggaact cacacagtaa atagtgaccg attacacaca ccatcctcaa ^uou 
aaactttctc tcctgccttc tttagcttgg tacagctctg tttgcaacaa gatcctgaga 1140 
aaaggccatc agcaagcagt ttattgtccc atgttttctt caaacagatg aaagaagaaa 1200 
gccaggattc aatactttca ctgttgcctc ctgcttataa caagccatca atatcattgc 1260 
ctccagtgtt accttggact gagccagaat gtgattttcc tgatgaaaaa gactcatact 1320 
gggaattcta gggctgccaa atcattttat gtcctatata cttgacactt tctccttgct 1380 
gctttttctt ctgtatttcr aggtacaaat accagaatta tacttgaaaa tacagttggt 
gcactggaga atctattatt taaaaccact ctgttcaaag gggcaccagt ttgtagtccc 
tctgtttcgc acagagtact atgacaagga aacatcagaa ttactaatct aqctagtgrc 
at-ttattcng gaattttttt ctaagctgtg actaactctt tttatcrctc aatataattr 
ttgagccagt taattttttt cagtattttg ctgtcccttg ggaatgggcc ctcagaggac xoau 
agtgcttcca agtacatctt ctcccagatt ctctggcctt tttaatgagc tattgttaaa 1740 
ccaacaggct agtttatctt acatcagacc cttttctggt agagggaaaa tgtttgtgct 1800 
ttcccttttt cttctgttaa tacttatggt aacacctaac tgagcctcac tcacattaaa 
tgattcactt gaaatatata cagaaattgt aatttgcttt tttttaaaaa agggggctaa 
agtaacactt tcctacttat gtaaattata gatcctaaat tcacgcaccc cgtgggagct 
caataaagat ttactgaatt g 



480 
540 
600 
660 



840 
900 
960 
1020 
1080 



1440 
1500 
1560 
1620 
1680 



1860 
1920 
1980 
2001 



<210> 99 
<211> 373 
<212> PRT 

<213> Mammalian (Human) STLK6 
<400> 99 

Met Ser 3er Phe Leu Pro Giu Gly Giy Cys Tyr Glu Leu Leu Thr Val 
1 5 10 15 

He Gly Lys Gly Phe Glu Asp Leu Met Thr Val Asn Leu Ala Ara Tvr 
20 25 30 

Lys Pro Thr Gly Glu Tyr Val Thr Val Arg Arg He Asn Leu Glu Ala 
35 40 45 

Cys Ser Asn Glu Met Val Thr Phe Leu Gin Gly Glu Leu His Val Ser 
^0 55 60 

Lys Leu Phe Asn His Pro Asn He Val Pro Tyr Arg Ala Thr Phe He 

70 75 80 

Ala Asp Asn Glu Leu Trp Val Val Thr Ser Phe Met Ala Tyr Gly Ser 
85 90 95 
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Ala Lys Asp Leu He Cys Thr His Phe Met Asp Gly Met Asn Glu Leu 
100 105 210 

Ala lie Ala Tyr He Leu Gin Gly Val Leu Lys Ala Leu Asd Ty- He 
115 120 125 

His His Met Gly Tyr Vai His Arg Ser Val Lys Ala Ser His He Leu 
130 135 

He Ser Val Asp Gly Lys Val Tyr Leu Ser Gly Leu Arg Ser Asn Leu 

150 155 160 

Ser Met He Ser His Gly Gin Arg Gin Arg Val Val His Asp Phe Pro 
165 170 175 

Lys Tyr Ser Val Lys Val Leu Pro Trp Leu Ser Pro Glu Val Leu Gin 
180 185 190 

Gin Asn Leu Gin Gly Tyr Asp Ala Lys Ser Asp He Tyr Ser Val 
195 • 200 • 205 

He Thr Ala Cys Glu Leu Ala Asn Gly His Val Pro Phe Lys Asp Me^ 
210 215 220 

Pro Ala Thr Gin Met Leu Leu Glu Lys Leu Asn Gly Thr Val Pro 
2" 230 235 240 

Leu Leu Asp Thr Ser Thr He Pro Ala Glu Glu Leu Thr Met Ser Pro 
245 250 255 

Ser Arg Ser Val Ala Asn Ser Gly Leu Ser Asp Ser Leu Thr Thr 
260 265 270 

Thr Pro Arg Pro Ser Asn Gly Asp Ser Pro Ser His Pro Tyr His Arg 
275 280 285 

Thr Phe Ser Pro His Phe His His Phe Val Glu Gin Cys Leu Gin 
290 295 300 

Asn Pro Asp Ala Arg Pro Ser Ala Ser Thr Leu Leu Asn His Ser Phe 

310 315 320 

Phe Lys Gin He Lys Arg Arg Ala Ser Glu Ala Leu Pro Glu Leu 
325 330 335 

Arg Pro Val Thr Pro He Thr Asn Phe Glu Gly Ser Gin Ser Gin Asp 
340 345 350 

His Ser Gly He Phe Gly Leu Vai Thr Asn Leu Glu Glu Leu Glu 
355 360 365 

Asp Asp Trp Glu Phe 

370 
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<210> 100 
<211> 311 
<212> DNA 

<213> Mammalian (Human) STLK7 
<400> 100 

tcaacaggga cgattacgag ctgcaggagg tgatcgggag tggagcaact gctgtagtcc 
aagcagctta ttgtgcccct aaaaaggaga aagtggcaat caaacggata aaccttgaga 
aatgtcaaac tagcatggat gaactcctga aagaaattca agccatgagt caatgccarc 180 
atcctaatat tgtatcttac tacacatctt ttgtggtaaa agatgagctg tggcttgtca 240 
tgaagctgct aagtggaggt tctgttctgg atattattaa gcacattgtg gcaaaagggg 300 
aacacaaaag t 



<210> 101 
<211> 103 
<212> PRT 

<213> Mammalian (Human) STLK7 
<400> 101 

Asn Arg Asp Asp Tyr Glu Leu Gin Glu Val lie Gly Ser Gly Ala Thr 
1 5 . 10 15 

Ala Val Val Gin Ala Ala Tyr Cys Ala Pro Lys Lys Glu Lys Val Ala 
20 25 30 

He Lys Arg He Asn Leu Glu Lys Cys Gin Thr Ser Met Asp Glu Leu 
35 40 45 

Leu Lys Glu He Gin Ala Met Ser Gin Cys His His Pro Asn He Val 
50 55 60 

Ser Tyr Tyr Thr Ser Phe Val Val Lys Asp Glu Leu Trp Leu Val Met 

70 75 80 

Lys Leu Leu Ser Gly Gly Ser Val Leu Asp He He Lys His He Val 
85 90 95 

Ala Lys Gly Glu His Lys Ser 
100 



<210> 102 
<211> 2806 
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<212> DNA 

<213> Full Length Mammalian (Human) PAK5 
<400> 102 

cgggagtgrc cgcggtggrg gcggtgcaag agagctgaag gaggcgcgac ggcgcggagt 60 

tccaggccga gcagttaggc cgcgagcgac tgcggcgccg agccgatgag taacccgaag 120 

cccctagagg agtggtcacc rgcctgaggg cacttctgtc ccaccagcat cagaccaggc 180 

cgcaccgagt ccccggcacc atgtttggga agaggaagaa gcgggrggag atctccgcgc 240 

cgtccaactt cgagcaccgc gtgcacacgg gcttcgacca gcacgagcag aagttcacgg 300 

ggctgccccg ccagtggcag agcctgatcg aggagtcggc . tcgccggccc aagcccctcg 360 

tcgaccccgc ctgcatcacc tccatccagc ccggggcccc caagaccacc gtgcggggca 420 

gcaaaggtgc caaagatggg gccctcacgc tgctgctgga cgagtttgag aacatgrcgg 480 

tgacacgctc caacrccctg cggagagaca gcccgccgcc gcccgcccgt gcccgccagg 54 0 

aaaatgggat gccagaggag ccggccacca cggccagagg gggcccaggg aaggcaggca 600 

gccgaggccg gttcgccggt cacagcgagg cgggtggcgg cagtggtgac aggcgacggg 660 

cgqggccaga gaagaggccc aagtcttcca gggagggctc agggggtccc caggagtcct 720 

cccgggacaa acgccccctc tccgggcctg atgtcggcac cccccagcct getggtctgg 780 

ccagtggggc gaaactggca gctggccggc cctttaacac ctacccgagg gctgacacgg 840 

accacccatc ccggggtgcc cagggggagc • ctcatgacgt ggcccctaac gggccatcag 900 

cggggggccc ggccatcccc cagtcctcct cctccccctc ccggcctccc acccgagccc 960 

gaggtgcccc cagccctgga gtgctgggac cccacgcctc agagccccag ctggcccctc 1020 

cagcctgcac ccccgccgcc cctgctgttc ctgggccccc tggcccccgc tcaccacagc 1080 

gggagccaca gcgagtatcc catgagcagt tccgggctgc cctgcagctg gtggtggacc 1140 

caggcgaccc ccgctcctac ctggacaact tcatcaagat tggcgagggc tccacgggca 1200 

tcgtgtgcat cgccaccgtg cgcagctcgg gcaagctggt ggccgtcaag aagatggacc 1260 

tgcgcaagca gcagaggcgc gagctgctct ccaacgaggt ggtaatcatg agggactacc 1320 

agcacgagaa tgtggtggag atgtacaaca gctacctggt gggggacgag ctctgggtgg 1380 

tcatggagtt cctggaagga ggcgccctca ccgacatcgt cacccacacc aggatgaacg 1440 

aggagcagat cgcggccgtg tgccttgcag tgctgcaggc cctgtcggtg ctccacgccc 1500 

agggcgtcat ccaccgggac atcaagagcg actcgatcct gctgacccat gatggcaggg 1560 

tgaagctgtc agactttggg ttccgcgccc aggtgagcaa ggaagtgccc cgaaggaagt 1620 

cgctggtcgg cacgccctac tggatggccc cagagctcat ctcccgcctt ccctacgggc 1680 

cagaggtaga catctggtcg ctggggataa tggtgattga gatggtggac ggagagcccc 1740 

cctacttcaa cgagccaccc crcaaagcca tgaagatgat tcgggacaac ctgccacccc 1800 

gactgaagaa cctgcacaag gtgtcgccat ccctgaaggg cttccrggac cgcctgctgg I860 

tgcgagaccc tgcccagcgg gccacggcag ccgagctgct gaagcaccca ttcctggcca 1920 

aggcagggcc gcctgccagc atcgtgcccc tcatgcgcca gaaccgcacc agatgaggcc 1980 

cagcgccctt cccctcaacc aaagagcccc cccgggtcac ccccgcccca ctgaggccag 2040 

tagggggcca ggcctcccac tcctcccagc ccgggagatg ctccgcgtgg caccaccctc 2100 

ctngctgggg gtagatgaga ccctactact gaactccagt tttgatctcg tgacttttag 2160 

aaaaacacag ggactcgtgg gagcaagcga ggctcccagg acccccaccc tctgggacag 2220 

gccctccccc acgttcttct gtctccagga agggcagcgg ccctcccatc actggaagtc 2280 

tgcagtgggg gtcgctgggg gtggagagaa cactaagagg tgaacatgta tgagtgtgtg 2340 

cacgcgtgtg agtgtgcarg tgtgtgtgtg tgcaaaggtc cagccacccc gtcctccagc 2400 

ccgcaagggg tgtctggcgc cttgcctgac acccagcccc ctctccccct gagccattgt 2460 

gggggtcgat catgaatgtc cgaagagtgg ccutttcccg tagccctgcg ccccctttct 2520 

gtggctggac ggggagacag gtcagggccc cccaccctct ccagcccctg cagcaaatga 2580 

ctactgcacc tggacagcct cctcttttct agaagtctat ttatattgtc attttataac 2640 

actctagccc ctgcccttat tgggggacag atggtccctg ccctgcgggg tggccctggc 2700 

agaaccactg cctgaagaac caggttcctg cccggtcagc gcagccccag cccgcccac- 2760 

ccrgcctcga gttagtttta caattaaaac attgtcttgt tttgtg 2806 
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<210> 103 

<211> 591 

<212> PRT 

<213> Full Length " Mammalian (Human) PAK5hu ■ 

<400> 103 

Met Phe Gly Lys Arg Lys Lys Arg Val Glu He Ser Ala Pro Ser Asn 
1 5 10 15 

Phe Glu His Arg Val His Thr Gly Phe Asp Gin His Glu Gin Lys Phe 
2C 25 30 

Thr Gly Leu Pro Arg Gin Trp Gin Ser Leu He Glu Glu Ser Ala Arg 
35 40 45 

Arg Pro Lys Pro Leu Val Asp Pro Ala Cys He Thr Ser He Gin Pro 
' 50 55 60 

Gly Ala Pro Lys Thr He Val Arg Gly Ser Lys Gly Ala Lys Asp Gly 
65 70 75 80^ 

Ala Leu Thr Leu Leu Leu Asp Glu Phe Glu Asn Met Ser Val Thr Arg 
85 90 95 

Ser Asn Ser Leu Arg Arg Asp Ser Pro Pro Pro Pro Ala Arg Ala Arg 
100 105 110 

Gin Glu Asn Gly Met Pro Glu Glu Pro Ala Thr Thr Ala Arg Gly Gly 
115 120 125 

Pro Gly Lys Ala Gly Ser Arg Gly Arg Phe Ala Gly His Ser Glu Ala 
130 135 140 

Gly Gly Gly Ser Gly Asp Arg Arg Arg Ala Gly Pro Glu Lys Arg Pro 

150 155 160 

Lys Ser Ser Arg Glu Gly Ser Gly Gly Pro Gin Glu Ser Ser Arg Asp 
165 170 175 

Lys Arg Pro Leu Ser Gly Pro Asp Val Gly Thr Pro Gin Pro Ala Gly 
180 185 190 

Leu Ala Ser Gly Ala Lys Leu Ala Ala Gly Arg Pro Phe Asn Thr Tyr 
195 200 205 

Pro Arg Ala Asp Thr Asp His Pro Ser Arg Gly Ala Gin Gly Glu Pro 
■210 215 220 

His Asp Val Ala Pro Asn Gly Pro Ser Ala Gly Gly Leu Ala He Pro 
225 230 235 240 

Gin Ser Ser Ser Ser Ser Ser Arg Pro Pro Thr Arg Ala Arg Gly Ala 
245 250 255 
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Pro Ser Pro Gly Val Leu Gly Pro His Ala Ser Glu Pro Gin Leu Ala 
260 265 270 

Pro Pro Ala Cys Thr Pro Ala Ala Pro Ala Val Pro Gly Pro Pro Gly 
275 280 285 

Pro Arg Ser Pro Gin Arg Glu Pro Gin Arg Val Ser His Glu Gin Phe 
290 295 300 

Arg Ala Ala Leu Gin Leu Val Val Asp Pro Gly Asp Pro Arg Ser Tyr 
305 310 315 320 

Leu Asp Asn Phe lie Lys He Gly Glu Gly Ser Thr Gly He Val Cys 
325 330 335 

He Ala Thr Val Arg Ser Ser Gly Lys Leu Val Ala Val Lys Lys Met 
340 345 350 

Asp Leu Arg Lys Gin Gin Arg Arg Glu Leu Leu Phe Asn Glu Val Val 
355 360 365 

He Met Arg Asp Tyr Gin His Glu Asn Val Val Glu Met Tyr Asn Ser 
370 375 380 

Tyr Leu Val Gly Asp Glu Leu Trp Val Val Met Glu Phe Leu Glu Gly 
385 390 395 400 

Gly Ala Leu Thr Asp He Val Thr His Thr Arg Met Asn Glu Glu Gin 
405 410 415 

He Ala Ala Val Cys Leu Ala Val Leu Gin Ala Leu Ser Val Leu His 
420 425 430 

Ala Gin Gly Val He His Arg Asp He Lys Ser Asp Ser He Leu Leu 
435 440 445 

Thr His Asp Gly Arg Val Lys Leu Ser Asp Phe Gly Phe Cys Ala Gin 
450 455 460 

Val Ser Lys Glu Val Pro Arg Arg Lys Ser Leu Val Gly Thr Pro Tyr 

470 475 480 

Trp Met Ala Pro Glu Leu He Ser Arg Leu Pro Tyr Gly Pro Glu Val 
485 490 495 

Asp He Trp Ser Leu Gly He Met Val He Glu Met Val Asp Gly Glu 
500 505 510 

Pro Pro Tyr Phe Asn Glu Pro Pro Leu Lys Ala Met Lys Met He Arg 
515 520 525 

Asp Asn Leu Pro Pro Arg Leu Lys Asn Leu His Lvs Val Ser Pro Ser 
530 535 540 
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Leu Lys Gly Phe Leu Asp Arg Leu Leu Val Arg Asp Pro Ala Gin Arg 

550 555 560 

Ala Thr Ala Ala Glu Leu Leu Lys His Pro Phe Leu Ala Lys Ala Gly 

565 570 575 

Pro Pro Ala Ser He Val Pro Leu Met Arg Gin Asn Arg Thr Arg 
580 585 590 



<210> 104 

<211> 3684 

<212> DNA 

<213> Full Length Mammalian (Human) 2C4 

<400> 104 



atggcgggac 

ccaactggaa 

ttgggacttc 

gatgaggaag 

aacattgtgt 

ctttggat gg 

agr aatcaga 

ttagctcacc 

ctgactcata 

actaatggaa 

gactgtgatg 

attactgcca 

gctctcttcg 

aagttccaca 

gcaaacatgc 

teat taacaa 

gagaaaaaat 

aggtrcaggg 

cgcaggcaaa 

agacaagttg 

attcaggctg 

gaggttcaag 

ttctcttcaa 

agacaaagga 

aaaattttag 

aaagaagaat 

cctgaaggcc 

actggaagga 

gacatccatg 

aacgaatggg 

gcacctgtca 

gctgatactg 

catgccaatg 

gaccatgatg 

cctagctgtc 

ggaagtcgtg 

catggaggaa 

gccaatatag 



ctgggggctg gagggacagg 
tat tctcact agataaaacc 
atgagaagac rggtgcattt 
aggatctcag gactgaactc 
ccttctatgg agcatttttc 
tgatggagtt atgtgcagca 
gtttaaaaga agattggatt 
ttcacgcaca ccgagtaatt 
atgctgaagt aaaactggtt 
gaaggaatag tttcattggg 
aggacccaag acgctcctat 
ttgaaatggc tgaaggagcc 
trattttgcg ggaatctgct 
atttcatgga aaagtgtacg 
ttcaacaccc atttgttcgg 
ggcatcttac tggaatcatt 
caaaagtttc tactctgagg 
caaagtcatc atggagacct 
ggcgccaacg cagatgggaa 
ataaagacaa agaagatgaa 
aagtccagat agagccattg 
agagatctcc ttctgtgcct 
gcgttcctca gcggtctctt 
gttcgcaaaa tcgrcaaaat 
caggcaaaac acagagctac 
ttcaagaagg aatgaatcaa 
attgtatttg gcaattgggt 
ggtctcagtc atcaccacct 
ttccagatgg atttaaagta. 
taggctataa tgcactctct 
ttcagccacc tgaagaggat 
atggtgatga tgatgatgag 
gcaatgatga cttggataac 
atgacaacaa taagtttgtt 
caagggcaag ctatggcaga 
gaaaagagga agcctacaga 
gtgcagccag tgaggacaat 
gcagrgaaag aagaggcagt 



gaggtcacgg 
attggccttg 
acagctgtta 
aaccttctga 
aagctgagtc 
ggttcggtca 
gcttatatct 
caccgggaca 
gat tttggag 
acaccatact 
gat tacagaa 
cctcctctgt 
cccacagtca 
ataaaaaatc 
gatataaaaa 
aaaaaaagac 
caagcactgg 
gaaaagct tg 
gatatcttta 
tcatcagaca 
aagccataca 
aacaaccagg 
ttggaacaag 
tggctggcag 
tgtttaacaa 
aagtgtcagg 
gaatcttctt 
tattctacta 
ggaaaaatat 
gaaatcttcc 
ggtgattatg 
tctaatgata 
caggttgatc 
gatgatgtaa 
gatggaagct 
ggctatggaa 
gcagccat tg 
gagggtgatg 



atctgggcca 
gtacttatgg 
aagtgatgaa 
ggaagtactc 
cccctggtca 
ctgatgtagt 
gccgagaaat 
tcaaaggtca 
tgagtgccca 
ggatggcacc 
gtgatgtgtg 
gtaaccttca 
aatccagcgg 
tcctgtttcg 
atgaacgaca 
agaaaaaaga 
caaaaagact 
aact ct cgga 
atcagcatga 
atgatgaagt 
tttcaaatcc 
atcatgcaca 
ctcagaagcc 
catcaggtga 
tttatatttc 
gagcccaagt 
ctgaggaaga 
ttgatcagaa 
caccccctgt 
ggaatgattg 
ttgaactcta 
ctt ttgaaga 
aggctaatga 
ataataatta 
gcaagcaaga 
gccatacagc 
gagatcagga 
gaggtaaggg 



cctgccggat 

cagaatctat 

cgctcgtaag 

tttccacaaa 

gcggcaccaa 

gagaatgacc 

ccttcagggc 

gaatgtgctg 

ggtgagcaga 

tgaggtgatt 

gtctgtggga 

acccttggaa 

atggtcccgt 

tcctacttct 

tgttgttgag 

acaggcacgg 

at caccaaag 

tt tagaagcc 

ggaagaattg 

atttcattcg 

taaaaaaatt 

tcatgtcaag 

cattgacatc 

tt caaagcac 

agaagtcaag 

aggattagga 

aagtcctgtg 

gttgctggtt 

atacttgaca 

gt taactccg 

tgatgccagt 

tacctatgat 

tgtttgtaaa 

ttatgaggcg 

tggttatgat 

caatagaagc 

agaacatgca 

agtcgt tcga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
y 1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
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accagrgaag .agagtggagc ccrtggacrc aatggagaag aaaattgctc agagacagat 2340 

ggtccaggat tgaagagacc tgcgrcccag gactttgaat atctacagga ggagccaggt 2400 

ggtggaaarg aggcctcaaa tgccattgac tcaggtgctg caccgtcagc accrgatcat 2460 

gagagtgaca a-zaaggacat atcagaatca tcaacacaan cagatttttc tgccaatcac 2520 

tcatctcctt ccaaaggttc tgggatgtct gctgatgcra actttgccag tgccatctta 2580 

tacgcrggat tcgtagaagt acctgaggaa tcacctaagc aaccctctga agtcaatgtt 2640 

aacccactct atgtctcrcc tgcatgtaaa aaaccactaa tccacacgta tgaaaaggag 2700 

ttcacttctg agatctgcrg tggttctttg tggggagtca- atttgctgtt gggaacccga 2760 

tctaatctar atcrgatgga cagaagtgga aaggctgaca ttactaaact tataaggcga 2820 

agaccattcc gccagattca agtcttagag ccactcaatt tgctgattac catcccaggt 2880 

cataagaaca gacttcgggt gtarcatctg acctggttga ggaacaagat tttgaataac 2940 

gatccagaaa gtaaaagaag gcaagaagaa atgctgaaga cagaggaagc ctgcaaagct 3000 

attgataagt taacaggctg tgaacacttc agtgtcctcc aacatgaaga aacaacatat 3060 

attgcaattg ctttgaaatc atcaatrcac ctttatgcat gggcaccaaa gtccrttgat 3120 

gaaagcacrg ctattaaagt atttccaaca cttgatcata agccagtgac agttgacctg 3180 

gctattggtt ctgaaaaaag actaaagatt ttcttcagct cagcagatgg atatcacctc 3240 

atcgatgcag aatctgaggt tatgtctgat gtgaccctgc caaagaatcc cctggaaatc 3300 

attataccac agaatatcat cartttacct gattgcttgg gaattggcat gargctcacc 3360 

ttcaatgctg aagccctcrc tgtggaagca aatgaacaac tcttcaagaa gatccttgaa 3420 

atgtggaaag acataccatc ttcratagct tttgaatgta cacagcgaac cacaggatgg 3480 

ggccaaaagg ccattgaagt gcgctctttg caatccaggg ttctggaaag tgagctgaag 3540 

cgcaggtcaa ttaagaagct gagattcctg tgcacccggg gtgacaagct gttctttacc 3600 

tctaccctgc gcaatcacca cagccgggtt tacttcatga cacttggaaa acttgaagag 3660 

ctccaaagca attatgargt ctaa 3684 



<210> 105 
<211> 1227 
<212> PRT 

<213> Full Length Mammalian (Human) ZC4 
<400> 105 

Met Ala Gly Pro Gly Giy Trp Arg Asp Arg Giu Val Thr Asp Leu Gly 
^5 10 15 

His Leu Pro Asp Pro Thr Gly He Phe Ser Leu Asp Lys Thr He Gly 
20 25 30 

Leu Gly Thr Tyr Giy Arg He Tyr Leu Gly Leu His Glu Lys Thr Gly 
35 40 45 

Ala Phe Thr Ala Val Lys Val Met Asn Ala Arg Lvs Asp Glu Glu Glu 
50 55 60 

Asp Leu Arg Thr Giu Leu Asn Leu Leu Arg Lys Tyr Ser Phe His Lys 
65 70 75 80 

Asn He Val Ser Phe Tyr Gly Ala Phe Phe Lys Leu Ser Pro Pro Gly 
85 90 95 

Gin Arg His Gin Leu Trp Met Val Met Glu Leu Cys Ala Ala Gly Ser 
100 105 110 
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Val Thr Asp Val Val Arg Met Thr Ser Asn Gin Ser Leu Lys Glu Asp 
il5 120 125 

Trp lie Ala Tyr He Cys Arg Glu He Leu Gin Gly Leu Ala His Leu 
130 135 140 

His Ala His Arg Val He His Arg Asp lie Lys Gly Gin Asn Val Leu 
145 150 155 160 

Leu Thr His Asn Ala Glu Val Lys Leu Val Asp Phe Gly Val Ser Ala 
165 170 175 

Gin Val Ser Arg Thr Asn Gly Arg Arg Asn Ser Phe He Gly Thr Pro 
180 185 ' 190 

Tyr Trp Met Ala Pro Glu Val He Asp Cys Asp Glu Asp Pro Arg Arg 
195 200 205 

Ser Tyr Asp Tyr Arg Ser Asp Val Trp Ser Val Gly He Thr Ala He 
210 215 220 

Glu Met Ala Glu Gly Ala Pro Pro Leu Cys Asn Leu Gin Pro Leu Glu 
225 230 235 240 

Ala Leu Phe Val He Leu Arg Glu Ser Ala Pro Thr Val Lys Ser Ser 
245 250 255 

Gly 'Trp Ser Arg Lys Phe His Asn Phe Met Glu Lys Cys Thr He Lys 
260 265 270 

Asn Phe Leu Phe Arg Pro Thr Ser Ala Asn Met Leu Gin His Pro Phe 
275 280 285 

Val Arg Asp He Lys Asn Glu Arg His Val Val Glu Ser Leu Thr Arg 
290 295 300 

His Leu Thr Gly He He Lys Lys Arg Gin Lys Lys Glu Gin Ala Arg 

310 315 

Glu Lys Lys Ser Lys Val Ser Thr Leu Arg Gin Ala Leu Ala Lys Arg 
325 330 335 

Leu Ser Pro Lys Arg Phe Arg Ala Lys Ser Ser Trp Arg Pro Glu Lys 
340 345 350 

Leu Glu Leu Ser Asp Leu Glu Ala Arg Arg Gin Arg Arg Gin Arg Arg 
355 360 365 

Trp Glu Asp He Phe Asn Gin His Glu Glu Glu Leu Arg Gin Val Asp 
^■^0 375 380 

Lys Asp Lys Glu Asp Glu Ser Ser Asp Asn Asp Glu Val Phe His Ser 

390 395 400 
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He Gin Ala Glu Val Gin lie Glu Pro Leu Lys Pro Tyr He Ser Asn 
405 410 415 

Pro Lys Lys He Glu Val Gin Glu Arg Ser Pro Ser Val Pro Asn Asn 
420 425 430 



Gin Asp His Ala His His Val Lys Phe Ser Ser Ser Val Pro Gin Arg 



A3t 



440 



445 



Ser Leu Leu Glu Gin Ala Gin Lys Pro He Asp He Arg Gin Arg Ser 
450 455 460 

Ser Gin Asn Arg Gin Asn Trp Leu Ala Ala Ser Gly Asd Ser Lys His 
465 470 475 " 480 

Lys He Leu Ala Gly Lys Thr Gin Ser Tyr Cys Leu Thr He Tyr He 
485 490 495 

Ser Glu Val Lys Lys Glu Glu Phe Gin Glu Gly Met Asn Gin Lvs Cys 
500 505 510 

Gin Gly Ala Gin Val Gly Leu Gly Pro Glu Gly His Cys He Trp Gin 
515 520 525 

Leu Gly Glu Ser Ser Ser Glu Glu Glu Ser Pro Val Thr Gly Arg Arg 
530 535 



Ser Gin Ser Ser Pro Pro Tyr Ser Thr He Asp Gin Lys Leu Leu Val 

550 555 560 

Asp He His Val Pro Asp Gly Phe Lys Val Gly Lys He Ser Pro Pro 
565 570 575 

Val Tyr Leu Thr Asn Glu Trp Val Gly Tyr Asn Ala Leu Ser Glu He 
580 585 590 

Phe Arg Asn Asp Trp Leu Thr Pro Ala Pro Val He Gin Pro Pro Glu 
595 600 605 

Glu Asp Gly Asp Tyr Val Glu Leu Tyr Asp Ala Ser Ala Asp Thr Asp 

615 620 

Gly Asp Asp Asp Asp Glu Ser Asn Asp Thr Phe Glu Asp Thr Tyr Asp 

630 635 640 

His Ala Asn Gly Asn Asp Asp Leu Asp Asn Gin Val Asp Gin Ala Asn 
645 650 655 

Asp Val Cys Lys Asp His Asp Asp Asp Asn Asn Lys Phe Val Asp Asp 
660 665 670 

Val Asn Asn Asn Tyr Tyr Glu Ala Pro Ser Cys Pro Arg Ala Ser Tyr 
675 680 685 
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Gly Arg Asp Gly Ser Cys Lys Gin Asp Gly Tyr Asp Gly Ser Arg Giy 
690 695 700 

Lys Glu Glu Ala Tyr Arg Giy Tyr Gly Ser His Thr Ala Asn Arg Ser 
705 710 715 720 

His Gly Giy Ser Ala Ala Ser Glu Asp Asn Ala Ala lie Gly Asp Gin 
725 730 735 

Glu Glu His Ala Ala Asn lie Gly Ser Glu Arg Arg Gly Ser Glu Gly 
740 745 750 

Asp Gly Gly Lys Gly Val Val Arg Thr Ser Glu Glu Ser Gly Ala Leu 
755 760 765 

Gly Leu Asn Gly Glu Glu Asn Cys Ser Glu Thr Asp Gly Pro Gly Leu 
770 775 780 

Lys Arg Pro Ala Ser Gin Asp Phe Glu Tyr Leu Gin Glu Glu Pro Gly 
785 790 795 800 

Gly Gly Asn Glu Ala Ser Asn Ala lie Asp Ser Gly Ala Ala Pro Ser 
805 810 815 

Ala Pro Asp His Glu Ser Asp Asn Lys Asp lie Ser Glu Ser Ser Thr 
820 825 830 

Gin Ser Asp Phe Ser Ala Asn His Ser Ser Pro Ser Lys Giy Ser Gly 
835 840 845 

Met Ser Ala Asp Ala Asn Phe Ala Ser Ala lie Leu Tyr Ala Gly Phe 
850 855 860 

Val Glu Val Pro Glu Glu Ser Pro Lys Gin Pro Ser Glu Val Asn Val 
865 870 875 880 

Asn Pro Leu Tyr Val Ser Pro Aia Cys Lys Lys Pro Leu lie His Met 
885 890 895 

Tyr Glu Lys Glu Phe Thr Ser Glu lie Cys Cys Gly Ser Leu Trp Gly 
900 905 910 

Val Asn Leu Leu Leu Gly Thr Arg Ser Asn Leu Tyr Leu Met Asp Arg 
915 920 925 

Ser Gly Lys Ala Asp lie Thr Lys Leu lie Arg Arg Arg Pro Phe Arg 
930 935 " 940 

Gin lie Gin Val Leu Glu Pro Leu Asn Leu Leu lie Thr lie Ser Gly 
945 950 955 960 

His Lys Asn Arg Leu Arg Val Tyr His Leu Thr Trp Leu Arg Asn Lys 
965 970 975 

lie Leu Asn Asn Asp Pro Glu Ser Lys Arg Arg. Gin Glu Glu Met Leu 
980 985 990 
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Lys Thr Glu Glu Aia Cys Lys Ala He Asp Lys Leu Thr Giy Cys Glu 
995 1000 1005 

His Phe Ser Val Leu Gin His Glu Glu Thr Thr Tyr He Ala He Ala 
1010 1015 1Q20 

Leu Lys Ser Ser lie His Leu Tyr Ala Trp Ala Pro Lys Ser Phe Asp 
i025 1030 1035 1040 

Glu Ser Thr Ala He Lys Val Phe Pro Thr Leu Asp His Lys Pro Val 
1045 1050 1055 

Thr Val Asp Leu Ala He Gly Ser Glu Lys Arg Leu Lys He Phe Phe 
1060 1065 1070 

Ser Ser Ala Asp Gly Tyr His Leu He Asp Ala Glu Ser Glu Val Met 
1075 1080 1085 

Ser Asp Val Thr Leu Pro Lys Asn Pro Leu Glu He He He Pro Gin 
1090 1095 1100 

Asn He He He Leu Pro Asp Cys Leu Gly He Gly Met Met Leu Thr 
1^0^ 1110 1115 1120 

Phe Asn Ala Glu Ala Leu Ser Val Glu Ala Asn Glu Gin Leu Phe Lys 
1125 1130 1135 

Lys He Leu Glu Met Trp Lys Asp He Pro Ser Ser He Ala Phe Glu 
1140 1145 1150 

Cys Thr Gin Arg Thr Thr Gly Trp Gly Gin Lys Ala He Glu Val Arg 
1155 1160 1165 

Ser Leu Gin Ser Arg Val Leu Glu Ser Glu Leu Lys Ara Arg Ser He 
11^0 1175 1180 



Lys Lys Leu Arg Phe Leu Cys Thr Arg Gly Asp Lys Leu Phe Phe Thr 

1190 1195 1200 

Ser Thr Leu Arg Asn His His Ser Arg Val Tyr Phe Met Thr Leu Gly 
1205 1210 1215 

Lys Leu Glu Glu Leu Gin Ser Asn Tyr Asp Val 
1220 1225 



<210> 106 
<211> 2962 
<212> DNA 

<213> Full Length Mammalian (Human) GEK2 
<400> 106 
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cgaagccaca gcccgagccc gagcccgagc ccgagccggc gccaccgcgc ccccggccat 6C 

ggcttttgcc aatttccgcc gcatcctgcg cctgtctacc ttcgagaaga gaaagccccg 120 

cgaatatgag cacgtccgcc gcgacctgga ccccaacgag grgtgggaga tcgtgggcga 180 

gctgggcgac ggcgccttcg gcaaggttta caaggccaag aataaggaga cgggtgcttt 240 

ggctgcggcc aaagtcattg aaaccaagag tgaggaggag ctggaggacc acatcgtgga 300 

gattgagatc ctggccacct gcgaccaccc ctacattgtg aagctcctgg gagcctacta 360 

tcacgacggg aagctgtgga tcatgattga gttctgtcca gggggagccg tggacgccat 420 

catgctggag ctggacagag gcctcacgga gccccagata caggtggttt gccgccagat 480 

gctagaagcc ctcaacttcc tgcacagcaa gaggatcatc caccgagatc tgaaagctgg 540 

caacgtgctg atgacccccg agggagacat caggctggct gactttggtg tgnctgccaa 600 

gaatcrgaag actctacaga aacgagattc cttcatcggc acgccttact ggatggcccc 660 

cgaggtggtc atgtgtgaga ccatgaaaga cacgccctac gactacaaag ccgacatctg 720 

gtccctgggc atcacgctga ttgagatggc ccagatcgag ccgccacacc acgagctcaa 780 

ccccatgcgg gtccrgctaa agatcgccaa gtcggaccct cccacgctgc tcacgccctc 840 

caagtggtct gtagagttcc gtgacttcct gaagatagcc ctggataaga acccagaaac 900 

ccgacccagt gccgcgcagc tgctggagca tcecttcgtc agcagcatca ccagtaacaa 960 

ggctcrgcgg gagctggtgg ctgaggccaa ggccgaggtg atggaagaga tcgaagacgg 1020 

ccgggargag ggggaagagg aggacgccgt ggatgccgcc tccaccctgg agaaccatac 1080 

tcagaactcc tcrgaggtga gtccgccaag cctcaatgct gacaagcctc tcgaggagtc 1140 

accttccacc ccgcrggcac ccagccagtc tcaggacagt gtgaatgagc cctgcagcca 1200 

gccctctggg gacagatccc tccaaaccac cagtccccca gtcgtggccc crggaaatga 1260 

gaacggcctg gcagtigcctg tgcccctgcg gaagtcccga cccgtgtcaa tggatgccag 1320 

aattcaggta gcccaggaga agcaagttgc tgagcagggt ggggacctca gcccagcagc 1380 

caacagatct caaaaggcca gccagagccg gcccaacagc agcgccctgg agaccttggg 1440 

tggggagaag ctggccaatg gcagcctgga gccacctgcc caggcagcrc cagggccttc 1500 

caagagggac ccggactgca gcagcctctg cacctctgag agcatggact atggraccaa 1560 

tctctccact gacctgtcgc tgaacaaaga gatgggctct ctgtccatca aggacccgaa 1620 

actgtacaaa aaaaccctca agcggacacg caaatttgtg gtggatggtg tggaggtgag 1680 

catcaccacc tccaagatca tcagcgaaga tgagaagaag gatgaggaga tgagatttct 1740 

caggcgccag gaactccgag agcttcggct gctccagaaa gaagagcatc ggaaccagac 1800 

ccagctgagt aacaagcatg agctgcagct ggagcaaatg cataaacgtt ttgaacagga I860 

aatcaacgcc aagaagaagt tctttgacac ggaattagag aacctggagc gtcagcaaaa 1920 

gcagcaagtg gagaagatgg agcaagacca tgccgtgcgc cgccgggagg aggccaggcg 1980 

gatccgcctg gagcaggatc gggactacac caggttccaa gagcagctca aactgatgaa 2040 

gaaagaggtg aagaacgagg tggagaagct cccccgacag cagcggaagg aaagcatgaa 2100 

gcagaagatg gaggagcaca cgcagaaaaa gcagcttctt gaccgggact ttgtagccaa 2160 

gcagaaggag gacctggagc tggccatgaa gaggctcacc accgacaaca ggcgggagat 2220 

ctgtgacaag gagcgcgagt gcctcatgaa gaagcaggag ctccttcgag accgggaagc 2280 

agccctgtgg gagatggaag agcaccagct gcaggagagg caccagctgg tgaagcagca 2340 

gctcaaagac cagtacttcc tccagcggca cgagctgctg cgcaagcatg agaaggagcg ' 2400 

ggagcagatg cagcgctaca accagcgcat gatagagcag ctgaaggtgc ggcagcaaca 2460 

ggaaaaggcg cggctgccca agatccagag gagtgagggc aagacgcgca tggccatgta 2520 

caagaagagc ctccacatca acggcggggg cagcgcagct gagcagcgtg agaagatcaa 2580 

gcagttctcc cagcaggagg agaagaggca gaagrcggag cggctgcagc aacagcagaa 2640 

acacgagaac cagatgcggg acatgctggc gcagtgcgag agcaacatga gcgagctgca 2700 

gcagctgcag aacgaaaagt gccacctcct ggtagagcac gaaacccaga aactgaaggc 2760 

cctggatgag agccataacc agaacctgaa ggaatggcgg gacaagcttc ggccgcgcaa 2820 

gaaggctctg gaagaggatc tgaaccagaa gaagcgggag caggagatgt tctrcaagct 2880 

gagcgaggag gcggagtgcc caaacccctc caccccaagc aaggccgcca agttcttccc 2940 

ctacagctcc ggggatgctf cc 2962 
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<210> 107 
<211> 968 
<212> PRT 

<213> Full Length Mammalian (Human) GEK2 
<400> 107 



Met Ala Phe Ala 
1 

Lys Arg Lys Ser 
20 

Asn Glu Val Trp 
35 

Lys Val Tyr Lys 
50 

Lys Val He Glu 
65 

Glu He Glu He 



Leu Gly Ala Tyr 
100 

Cys Pro Gly Gly 
115 

Leu Thr Glu Pro 
130 

Leu Asn Phe Leu 
145 

Gly Asn Val Leu 



Gly Val Ser Ala 
180 

He Gly Thr Pro 
195 

Met Lys Asp Thr 
210 

He Thr Leu He 
225 

Asn Pro Met Arg 



Asn Phe Arg Arg 
5 

Arg Glu Tyr Glu 



Glu He Val Gly 
40 

Ala Lys Asn Lys 
55 

Thr Lys Ser Glu 
70 

Leu Ala Thr Cys 
85 

Tyr His Asp Gly 



Ala Val Asp Ala 
120 

Gin He Gin Val 
135 

His Ser Lys Arg 
150 

Met Thr Leu Glu 
165 

Lys Asn Leu Lys 



Tyr Trp Met Ala 
200 

Pro Tyr Asp Tyr 
215 

Glu Met Ala Gin 
230 

Val Leu Leu Lys 
245 



He Leu Arg Leu 
10 

His Val Arg Arg 
25 

Glu Leu Gly Asp 



Glu Thr Gly Ala 
60 

Glu Glu Leu Glu 
75 

Asp His Pro Tyr 
90 

Lys Leu Trp He 
105 

He Met Leu Glu 



Val Cys Arg Gin 
140 

He He His Arg 
155 

Gly Asp He Arg 
170 

Thr Leu Gin Lys 
185 

Pro Glu Val Val 



Lys Ala Asp He 
220 

lie Glu Pro Pro 
235 

He Ala Lys Ser 
250 



Ser Thr Phe Glu 
15 

Asp Leu Asp Pro 
30 

Gly Ala Phe Gly 
45 

Leu Ala Ala Ala 



Asp Tyr He Val 
80 

He Val Lys Leu 
95 

Met He Glu Phe 
110 

Leu Asp Arg Gly 
125 

Met Leu Glu Ala 



Asp Leu Lys Ala 
160 

Leu Ala Asp Phe 
175 

Arg Asp Ser Phe 
190 

Met Cys Glu Thr 
205 

Trp Ser Leu Gly 



His His Glu Leu 
240 

Asp Pro Pro Thr 
255 
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Leu Leu Thr Pro Ser Lys Trp Ser Val Giu Phe Arg Asp Phe Leu Lys 
260 265 270 

lie Ala Leu Asp Lys Asn Pro Glu Thr Arg Pro Ser Ala Ala Gin Leu 
275 280 285 

Leu Glu His Pro Phe Val Ser Ser lie Thr Ser Asn Lys Ala Leu Arq 
290 295 300 

Glu Leu Val Ala Glu Ala Lys Ala Glu Val Met Glu Glu He Glu Asp 

310 315 320 

Gly Arg Asp Glu Gly Giu Glu Glu Asp Ala Val Asp Ala Ala Ser Thr 
325 330 335 

Leu Glu Asn His Thr Gin Asn Ser Ser Glu Val Ser Pro Pro Ser Leu 
340 345 350 

Asn Ala Asp Lys Pro Leu Glu Glu Ser Pro Ser Thr Pro Leu Ala Pro 
355 360 ■ 365 

Ser Gin Ser Gin Asp Ser Val Asn Glu Pro Cys Ser Gin Pro Ser Gly 
370 375 380 

Asp Arg Ser Leu Gin Thr Thr Ser Pro Pro Val Val Ala Pro Gly Asn 

390 395 400 

Glu Asn Gly Leu Ala Val Pro Val Pro Leu Arg Lys Ser Arg Pro Val 
405 410 ' 415 

Ser Met Asp Ala Arg He Gin Val Ala Gin Glu Lys Gin Val Ala Glu 
420 425 430 

Gin Gly Gly Asp Leu Ser Pro Ala Ala Asn Arg Ser Gin Lys Ala Ser 
435 440 445 

Gin Ser Arg Pro Asn Ser Ser Ala Leu Glu Thr Leu Gly Gly Glu Lys 
450 455 

Leu Ala Asn Gly Ser Leu Glu Pro Pro Ala Gin Ala Ala Pro Gly Pro 
"^^5 470 475 . 480 

Ser Lys Arg Asp Ser Asp Cys Ser Ser Leu Cys Thr Ser Glu Ser Met 
485 490 495 

Asp Tyr Gly Thr Asn Leu Ser Thr Asp Leu Ser Leu Asn Lys Glu Met 
500 505 510 

Gly Ser Leu Ser He Lys Asp Pro Lys Leu Tyr Lys Lys Thr Leu Lys 
515 520 525 

Arg Thr Arg Lys Phe Val Val Asp Gly Val Glu Val Ser He Thr- Thr 
530 535 

Ser Lys He He Ser Glu Asp Glu Lys Lys Asp Glu Glu Met Arg Phe 

550 555 560 
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Leu Arg Arg Gin Giu Leu Arg Giu Leu Arg Leu Leu Gin Lys Glu Giu 
565 570 575 

His Arg Asn Gin Thr Gin Leu Ser Asn Lys His Giu Leu Gin Leu Giu 
580 585 590 

Gin Met His Lys Arg Phe Giu Gin Glu lie Asn Ala Lys Lvs Lys Phe 
595 600 605 

Phe Asp Thr Giu Leu Giu Asn Leu Glu Arg Gin Gin Lys Gin Gin Vai 

615 620 

Giu Lys Met Glu Gin Asp His Ala Vai Arg Arg Arg Glu Giu Ala Arg 
^25 630 635 640 

Arg lie Arg Leu Giu Gin Asp Arg Asp Tyr Thr Arg Phe Gin Giu Gin 
645 650 655 

Leu Lys Leu Met Lys Lys Giu Vai Lys Asn Giu Vai Glu Lys Leu Pro 
660 665 670 

Arg Gin Gin Arg Lys Giu Ser Met Lys Gin Lys Met Glu Glu His Thr 
675 680 685 

Gin Lys Lys Gin Leu Leu Asp Arg Asp Phe Vai Ala Lys Gin Lys Glu 
690 695 700 

Asp Leu Giu Leu Ala Met Lys Arg Leu Thr Thr Asd Asn Arg Arg Giu 

710 715 720 

lie Cys Asp Lys Giu Arg Glu Cys Leu Met Lys Lys Gin Glu Leu Leu 
725 730 735 

Arg Asp Arg Giu Ala Ala Leu Trp Giu Met Glu Glu His Gin Leu Gin 
740 745 750 

Giu Arg His Gin Leu Vai Lys Gin Gin Leu Lys Asp Gin Tyr Phe Leu 
"755 760 765 

Gin Arg His Glu Leu Leu Arg Lys His Giu Lys Giu Arg Giu Gin Met 
'^'70 775 780 

Gin Arg Tyr Asn Gin Arg Met He Giu Gin Leu Lys Vai Arg Gin Gin 

790 795 800 

Gin Giu Lys Ala Arg Leu Pro Lys He Gin Arg Ser Giu Gly Lys Thr 
805 810 815 

Arg Met Ala Met Tyr Lys Lys Ser Leu His He Asn Gly Gly Gly Ser 
820 825 830 

Ala Ala Giu Gin Arg Giu Lys lie Lys Gin Phe Ser Gin Gin Giu Giu 
835 840 845 
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Lys Arg Gin Lys 
850 

Gin Met Arg Asp 
865 

Gin Gin Leu Gin 



Gin Lys Leu Lys 
900 

Trp Arg Asp Lys 
915 

Asn Gin Lys Lys 
930 

Ala Glu Cys Pro 
945 

Pro Tyr Ser Ser 



Ser Glu Arg Leu 
855 

Met Leu Ala Gin 
870 

Asn Glu Lys Cys 
885 

Ala Leu Asp Glu 



Leu Arg Pro Arg 
920 

Arg Glu Gin Glu 
935 

Asn Pro Ser Thr 
950 

Gly Asp Ala Ser 
965 



Gin Gin Gin Gin 
860 

Cys Glu Ser Asn 
875 

His Leu Leu Vai 
890 

Ser His Asn Gin 
905 

Lys Lys Ala Leu 



Met Phe Phe Lys 
940 

Pro Ser lys Ala 
955 



Lys His Glu Asn 



Met Ser Giu Leu 
880 

Glu His Giu Thr 
895 

Asn Leu Lys Glu 
910 

Glu Glu Asp Leu 
925 

Leu Ser Glu Glu 



Ala Lys Phe Phe 
960 



<210> 108 

<211> 11 ... 
<212> PRT 

<213> Mammalian (human) 2C1 and/or 2C3 
<220> 

<223> "Xaa can stand for any amino acid 
<400> 108 

Leu Xaa Leu Xaa Leu Xaa Leu Xaa Leu Xaa Leu 
15 10 



<210> 109 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthesized nucleic acid molecule 



<400> 109 

gcagcaagtg gagaagatgg 20 



<210> 110 
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<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthesized nucleic acid molecule 
<400> 110 
ggaagcatcc ccagagctgt ag 



<210> Hi 

<211> 12 

<212> PRT 

<213> Mammalian (Human) STLK2 

<400> Hi 

Glu Lys Phe Gin Lys Cys Ser Ala Asp Glu Ser Pro 
1 5 10 



<210> 112 
<21i> 16 
<212> PRT 

<213> Mammalian (Human) STLK4 
<400> 112 

Ser He Ser Asn Ser Glu Leu Phe Pro Thr Thr Asp Pre Val Gly Thr 
1 5 10 15 



<210> 113 
<211> 9 
<212> PRT 

<213> Mammalian (Human) SULUl 
<400> 113 

Leu Asp Phe Pro Lys Glu Aso Tyr Arg 
1 5 



<210> 114 
<211> 12 
<212> PRT 

<213> Mammalian (Huirtan) SULUl 
<400> 114 
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His Giy Asp Pro Arg Pro Glu Pro Arg Pro Thr Gin 
15 10 



<210> 115 
<211> 14 
<212> PRT 

<213> Mammalian (Human) SULU3 
<400> 115 

Pro Ser Thr Asn Arg Ala Gly Ser Leu Lys Asp Pro Glu Cys 
15 10 



<210> 116 
<211> 19" 
<212> PRT 

<213> Mammalian (Human) SULU3 
<400> 116 

Asp Pro Arg Thr Arg Ala Ser Asp Pro Gin Ser Pro Pro Gin Val Ser 
15 10 15 

Arg His Lys 



<210> 117 
<211> 16 
<212> PRT 

<213> Mammalian (Human) PAK4 
<400> 117 



Cys Leu Val Pro Leu He Gin Leu Tyr Arg Lys Gin Thr Ser Thr Cys 
15 10 15 



<210> 118 
<211> 9 
<212> PRT 

<213> Mammalian (Human) PAK5 

<400> 118 

Pro Leu Met Ara Gin Asn Arg Thr Arg 
1 5 



BNSDOCID: <WO 9953036A2_tA> 



wo 99/53036 



PCT/US99/08150 



128 



<210> 119 
<211> 16 
<212> PRT 

<213> Mammalian (Human) PAK5 
<400> 119 



Ser Gly Asp Arg Arg Arg Ala Gly Pro Glu Lys Arg Pro Lys Ser Ser 
15 10 15 



<210> 120 

<21i> 16 

<212> PRT 

<213> Mammalian (Human) PAK5 

<400> 120 

Cys Arg Arg Lys Ser Leu Val Gly Thr Pro Tyr Trp Met Ala Pro Glu 
15 10 15 



<210> 121 
<211> 20 
<212> PRT 

<213> Mammalian (Human) ZCl 
<400> 121 



Arg Thr Val Gly Arg Arg Asn Thr Phe lie Gly Thr Pro Pro Tyr Trp 
1 5 10 ^ 15 



Met Ala Pro Glu 
20 



<210> 122 

<211> 15 

<212> PRT 

<213> Mammalian (Human) KHSl 

<400> 122 

Lys Arg Lys Ser Phe lie Gly Thr Pro Tyr Trp Met Ala Pro Glu 
15 10 15 
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<210> 123 
<211> 15 
<212> PRT 

<213> Mammalian (Human) STLK2 
<400> 123 

Lys Arg Asn Thr Phe Val Gly Thr Pro Phe Trp Met Ala Pro Glu 
15 10 15 



<210> 124 
<211> 15 
<212> PRT 

<213> Mammalian (Human) SULUl 
<400> 124 

Pro Ala Asn Ser Phe Val Gly Thr Pro Tyr Trp Met Ala Pro Glu 
1 5 10 15 



<210> 125 

<211> 15 

<212> PRT 

<213> Mammalian (Human) ZCl 

<400> 125 

Arg Arg Asn Thr Phe He Gly Thr Pro Tyr Trp Met Ala P^o Glu 
1 5 - . 10 15 



<210> 126 

<211> 15 

<212> PRT 

<213> Mammalian (Human) ZCl 

<400> 126 

Arg Arg Asn Thr Phe He Gly Thr Pro Tyr Trp Met Ala Pro Glu 
- 5 10 15 



<210> 127 
<211> 18 
<212> PRT 

<213> Mammalian (Human) STLK4 
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<400> 127 

Arg Asn Lys Val Arg Lys Thr Phe Val Gly Thr Pro Cys Trp Met Ala 
IS 10 15 

Pro Glu 



<210> 128 
<211> 22 
<212> DNA 

<213> Mammalian (Human) STLK3 



<400> 128 
ctcccatttc ctagcaaaat ca 



<210> 129 
<211> 22 
<212> DNA 

<213> Mammalian (Human) STLK3 
<400> 129 
agaggcagta ttgtcagatg ta 



<210> 130 
<211> 22 
<212> DNA 

<213> Mammalian (Human) STLK4 
<400> 130 
ccacacargc gtatctctgt tg 



<210> 131 
<211> 24 
<212> DNA 

<2i3> Mammalian (Human) 3TLK4 
<400> 
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<210> 132 
<211> 22 
<212> DNA 
<213> Mammalian (Human) ZCl 

<400> 132 

atccctggat cacactgctt ct 



<210> 133 
<211> 22 
<212> DNA 
<213> Mammalian (Human) ZCl 

<400> 133 

caaggtgttc tttgcctctg tt 



<210> 134 
<211> 21 
<212> DNA 
<213> Mammalian (Human) ZC2 

<400> 134 

agatggactg tacrgggagg g 



'<210> 135 
<211> 22 
<212> DNA 
<213> Mammalian (Human) ZC2 

<400> 135 
agaagagcac ttggcactca tc 



<210> 136 
<211> 20 
<212> DNA 
<213> Mammalian (Human) ZC3 
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<400> 136 
catcatgaac tggtgacggg 



<210> 137 
<211> 23 
<212> DNA 

<213> Mammalian {Human) ZC3 
<400> 137 
ccagtgaaat caaaccagta aaa 



<210> 138 
<211> 24 
<212> DNA 

<213> Mammalian (Human) SULUl 
<400> 138 
caaaacctgg ccgtctcttc tatt 



<210> 139 
<211> 23 
<212> DNA 

<213> Mammalian (Human) SULUl 
<400> 139 
atttgtgcta ctgggattct gtg 



<210> 140 
<211> 24 
<212> DNA 

<2i3> Mammalian (Human) KHS2 
<400> 140 
gaatagcggt accacgatag aata 

<2i0> 141 



20 



23 



24 



24 
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<210> 142 
<211> 23 
<212> DNA 

<213> Mammalian (Human) PAK4 
<400> 142 
ctcagtattc tctccaaaga ttg 



23 



<210> 143 
<211> 23 
<212> DNA 

<213> Mammalian (Human) PAK4 
<400> 143 
gatgttctct ccattctgta aag 



23 



<210> 144 
<211> 21 
<212> DNA 

<213> Mammalian (Human) PAK5 
<400> 144 
catcactgga agtctgcagt g 



21 



<210> 145 
<211> 20 
<212> DNA 

<213> Mammalian (Human) PAK5 
<400> 145 
caggtgcagt agtcatttgc 



20 
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<210> 146 
<211> 20 
<212> DNA 
<213> T7 phage 

<400> 146 

ggagctgtcg tatrccagtc 



<210> 147 
<211> 21 
<212> DNA 
<213> T7 phage 

<400> 147 

aacccctcaa gacccgttta g 
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